Web Images Videos Maps News Groups Gmail more »
Recently Visited Groups | Help | Sign in
Google Groups Home
OAI-ORE as transfer syntax
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  4 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Jerome  
View profile  
 More options Aug 21, 6:46 am
From: Jerome <jmcdo...@uiuc.edu>
Date: Thu, 20 Aug 2009 13:46:02 -0700 (PDT)
Local: Fri, Aug 21 2009 6:46 am
Subject: OAI-ORE as transfer syntax
Howdy,

Next in my list of OAI-ORE practicalities questions:

As part of our work on the Preserving Virtual Worlds project, I'm
working on how to transfer a packaged up game between the Univ. of
Illinois and Stanford.  For various reasons, Stanford would like to
receive all data/metadata for the package using the BagIt
specification.  I've got a bunch of metadata in OAI-ORE that
identifies the various digital assets I want to go in the package
delivered to Stanford (the game itself, representation information for
the game, context information for the game, provenance information for
all of the above) as well as the relationships between the assets (not
only OAI-ORE relationships, but FRBR and OAIS relationships as well,
e.g., this asset is semantic representation information for that
asset).  Being nicely formed OAI-ORE, all references to assets are
protocol-based URIs.

The problem comes when I want to put this all in BagIt (this being the
digital assets and the OAI-ORE files), tar and gzip the whole caboodle
and ship it to them.  I don't want the OAI-ORE referencing the copies
of the assets at my site.  In fact, for a couple of reasons (the most
salient being I have to dark archive some of this material), I can't
make it available on the public web server.  I want the OAI-ORE
document to reference the copies of the assets in the BagIt package
using file:/// URIs.  But that's not a protocol-based URI, is it?  And
so, not well-formed OAI-ORE.

My solution space for this at the moment seems to be: 1. ignore the
OAI-ORE requirement for protocol-based URIs and use file:/// URIs to
reference digital assets in the BagIt directory hierarchy; 2. go to a
certain amount of time and trouble instituting a one-time-use
authentication mechanism that insures that only a designated archivist
at Stanford can get at the restricted assets, and use BagIt fetch.txt
to reference them; or 3. Base 64 encode the digital assets, and treat
them as literals in the OAI-ORE RDF expressions.  Can't say I'm
thrilled about any of those options, but #1 probably has the  most
appeal to someone who A. doesn't want to engage in additional
transformations of the underlying assets (ie Base 64) and B. is
congenitally lazy.

My questions: 1. Am I missing some obvious fourth option in the
solution space; and 2. Was there any official discussion/
recommendation of how to use OAI-ORE with something like a tarball of
files to ship content between repository sites?


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Benjamin O'Steen  
View profile  
 More options Aug 21, 7:05 pm
From: Benjamin O'Steen <bost...@gmail.com>
Date: Fri, 21 Aug 2009 10:05:53 +0100
Local: Fri, Aug 21 2009 7:05 pm
Subject: Re: OAI-ORE as transfer syntax

I would recommend storing the resources in a pairtree fashion:

http://www.cdlib.org/inside/diglib/pairtree/pairtreespec.html

which solves a number of hacks I added when I first considered this:

http://oxfordrepo.blogspot.com/2009/02/pushing-bagit-manifest-concept...

Ben


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Erik Hetzner  
View profile  
 More options Aug 22, 10:15 am
From: Erik Hetzner <erik.hetz...@ucop.edu>
Date: Fri, 21 Aug 2009 17:15:25 -0700
Local: Sat, Aug 22 2009 10:15 am
Subject: Re: OAI-ORE as transfer syntax

At Thu, 20 Aug 2009 13:46:02 -0700 (PDT),

I am not an OAI-ORE expert (or even a particularly well-informed
amateur), but I do have some knowledge of Bagit and Pairpath
(mentioned later in the thread).

I think there are two problems here. Laying the groundwork, you are
transferring an object between site A and site B. The first problem is
that because you are using OAI-ORE, either site A or site B needs to
lay claim to the URIs that will describe the object.

The second problem is getting the data from site A to site B.

I don’t think that you can get around the first problem. One site
needs to take responsibility for managing the URIs. This doesn’t
necessarily involve making them dereferenceable (at least not
immediately).

The second is not necessarily related to the first. If you choose to
use http://sitea/object that does not mean that site B needs to use
HTTP to transfer that object from site B.

What site A and site B do need to do is agree on way of mapping
http://sitea/object to some bytestream (representation).

I think what Ben was suggesting is that you can use pairpath to do
provide a mapping between an HTTP URI and a path on a filesystem. For
example, http://sitea/object would map to:

ht/tp/+=/=s/it/ea/=o/bj/ec/t

You could then “dereference” the URI http://sitea/object by generating
this pairpath from it and looking inside your bag to see if it
contained that path. If it does, you have the “dereferenced”
representation of that URI.

In summary, just because it starts with http:// doesn’t mean you have
to use HTTP to get it.

best,
Erik Hetzner

  application_pgp-signature_part
< 1K Download

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Benjamin O'Steen  
View profile  
 More options Aug 25, 7:00 pm
From: Benjamin O'Steen <bost...@gmail.com>
Date: Tue, 25 Aug 2009 10:00:54 +0100
Local: Tues, Aug 25 2009 7:00 pm
Subject: Re: OAI-ORE as transfer syntax

Just to point out one of the devices of pairtree that may be of use for
handling objects with URI names:

(copy&pasted from
http://www.cdlib.org/inside/diglib/pairtree/pairtreespec.html)

current_directory/
|   pairtree_version0_1        [which version of pairtree]
|    ( This directory conforms to Pairtree Version 0.1. Updated spec: )
|    ( http://www.cdlib.org/inside/diglib/pairtree/pairtreespec.html  )
|
|   pairtree_prefix
|    ( http://n2t.info/ark:/13030/xt2                                 )
|
\--- pairtree_root/
     |--- aa/
     |    |--- cd/
     |    |    |--- foo/
     |    |    |    |   README.txt
     |    |    |    |   thumbnail.gif
     |    |    ...
     |    |--- ab/ ...
     |    |--- af/ ...
     |    |--- ag/ ...
     |    ...
     |--- ab/ ...
     ...
     \--- zz/ ...
          | ...

The "pairtree_prefix" contains a string that should be prepended to
every identifier inferred from the pairtree rooted at "pairtree_root".
This may be used to reduce path lengths when every identifier in a given
pairtree shares the same initial substring. In the example above, the
pairpath "/aa/cd/" would thus correspond to the identifier
"http://n2t.info/ark:/13030/xt2aacd".

-----

Personally, I am quite fond of this mechanism, both for interchange and
for on-disc storage - migration of self-contained objects (book page
scan collections for example) is made easier, as you might only need to
change the prefix file.

Ben


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google