NOTICE: This version of the NSF Unidata web site (archive.unidata.ucar.edu) is no longer being updated.
Current content can be found at unidata.ucar.edu.
To learn about what's going on, see About the Archive Site.
Oops, forgot to send this one to the list.. Just for completeness here it is again: Hi John, I didn't respond directly to all the questions you asked but I hope that what I wrote is sufficient... John Caron wrote: snip.. > >>Thus I agree with benno that there is not a very >>meaningful distinction between them (and reconsider my listing of them >>as orthogonal concepts in my previous message). >> >>I wonder if it would be a good idea to merge these concepts and use a >>less loaded word, say "entry", to refer to an entity that has meaning to >>THREDDS and to end users, but not to a data access protocol, i.e. >> >><catalog> >><service name="X"/> >><service name="Y"/> >>... >> >><entry name="my_dataset"> >> >> <metadata name="global-metadata" url="..."/> >> <access name="global-X-access"/> >> >> <entry name="monthly-data"> >> <metadata name="monthly-metadata" url="..."/> >> <access name="X-with-COARDS" serviceType="X" url="..."/> >> <access name="X-with-no-COARDS" serviceType="X" url="..."/> >> <access name="X-flattened-to-2D" serviceType="X" url="http://..."/> >> <access name="Y" serviceType="Y" url="..."/> >> .... >> </entry> >> >> >></entry> >> > > Ok so an "entry" meets meaning 1), while an "access" meets meaning 3) (we > dont need to worry about meaning 2) here). > > Some questions: > > 1) Should we understand that all the access elements within an entry are > different versions of the same dataset? Should we disallow: > > <entry name="monthly-data"> > <metadata name="monthly-metadata" url="..."/> > <access name="monthly-data from MARS" serviceType="X" url="..."/> > <access name="monthly-data from VENUS" serviceType="X" url="..."/> > </entry> > No, I was not implying that for an <entry> tag. I would allow your example. > 2) is there any relationship between peer elements, in your example > > <access name="global-X-access"/> > <entry name="monthly-data"> Not necessarily. I think what I am trying to suggest is while it may be useful for humans to think of some consistent object being accessed via different services, this really does not translate it to anything meaningful at the machine level. Unless we actually try to define some machine-readable relationship between the accesses (e.g. Type 1 aggregation, etc - which gets into the whole data model can of worms) the only thing a machine can understand is a named and described hierarchy of access objects. Of course, something is being lost here from the human's point of view. Humans seem to want to make a distinction that is not significant to machines: "a collection of accesses to some single underlying object" vs "a collection of accesses to different underlying objects, that share some common theme" Is this is what <dataset> and <collection> have been intended to mean? If this is the case then I would suggest that a) this distinction be preserved by allowing both tags to be used(possibly renamed if it would clarify things); and b) data providers should be encouraged to mark up their catalogs appropriately using the two tags, so that THREDDS client UI's can take advantage of this to present catalogs in an intuitive way; but c) these tags should be completely interchangeable in all other ways (i.e. same type in the DTD/Schema, and same API calls, any tag that can go in a dataset can also go in a collection), since they are semantically equivalent at a machine level. Does that make any sense? Benno, would that satisfy you? - Joe (ready for a checkup with my ontologist) > > > > > -- Joe Wielgosz joew@xxxxxxxxxxxxx / (707)826-2631 --------------------------------------------------- Center for Ocean-Land-Atmosphere Studies (COLA) Institute for Global Environment and Society (IGES) http://www.iges.org
From owner-thredds@xxxxxxxxxxxxxxxx Fri 7 2002 Jun 13:10:01
Message-ID: <1023469801.3d00e8e927da3@xxxxxxxxxxxxxxxx> Date: Fri, 7 Jun 2002 13:10:01 -0400 From: Benno Blumenthal <benno@xxxxxxxxxxxxxxxx> In-Reply-To: <07d501c20da0$6c5da4c0$568c7580@xxxxxxxxxxxxxxxx> To: thredds@xxxxxxxxxxxxxxxx Subject: collection vs dataset EOD? Re: orthogonality (was Re: New attempt) Received: (from majordo@localhost) by unidata.ucar.edu (UCAR/Unidata) id g57HA3l06049 for thredds-out; Fri, 7 Jun 2002 11:10:03 -0600 (MDT) Received: from beluga2.ldgo.columbia.edu (beluga2.ldgo.columbia.edu [129.236.110.182]) by unidata.ucar.edu (UCAR/Unidata) with ESMTP id g57HA1J06034 for <thredds@xxxxxxxxxxxxxxxx>; Fri, 7 Jun 2002 11:10:02 -0600 (MDT) Organization: UCAR/Unidata Keywords: 200206071710.g57HA1J06034 Received: (from nobody@localhost) by beluga2.ldgo.columbia.edu (8.11.6/8.11.6/d: iri.mc,v 1.4 2001/12/06 15:09:45 root Exp root $) id g57HA1526726 for thredds@xxxxxxxxxxxxxxxx; Fri, 7 Jun 2002 13:10:01 -0400 (EDT) Received: from 66.92.97.91 ( [66.92.97.91]) as user benno@xxxxxxxxxxxxxxxx by iri.columbia.edu with HTTP; Fri, 7 Jun 2002 13:10:01 -0400 References: <3CF64D51.ABC712D7@xxxxxxxxxxxxxxxx> <053b01c20be8$fbf2d970$568c7580@xxxxxxxxxxxxxxxx> <3CFD05AA.93D111B7@xxxxxxxxxxxxxxxx> <057e01c20bff$9c8c94a0$568c7580@xxxxxxxxxxxxxxxx> <3CFD1E87.1DC51C44@xxxxxxxxxxxxxxxx> <3CFD34EF.20705@xxxxxxxxxxxxx> <060601c20caf$4f64a040$568c7580@xxxxxxxxxxxxxxxx> <3CFE43B3.224C33D6@xxxxxxxxxxxxxxxx> <3CFE4F8D.1BAA7D42@xxxxxxxxxxx> <3CFE7B10.6030806@xxxxxxxxxxxxx> <076101c20d90$db945ba0$568c7580@xxxxxxxxxxxxxxxx> <3CFFB59B.4080502@xxxxxxxxxxxxx> <07d501c20da0$6c5da4c0$568c7580@xxxxxxxxxxxxxxxx> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit User-Agent: Internet Messaging Program (IMP) 3.0 Sender: owner-thredds@xxxxxxxxxxxxxxxx Precedence: bulk Hi John and Joe, Since I was asked, I am answering, not that I am adding anything useful. Yes, if collections and datasets are completely interchangable in all machine-type ways, that works for me. I think John gives the definitive summary below. Of course, if I have a dataset that temporarily does not have any functioning access methods on a particular server, one may not always feel the need to relabel it a collection... Benno Quoting John Caron and Joe <caron@xxxxxxxxxxxxxxxx>:
> If this is the case then I would suggest that > > a) this distinction be preserved by allowing both tags to be > used(possibly renamed if it would clarify things); and > > b) data providers should be encouraged to mark up their catalogs > appropriately using the two tags, so that THREDDS client UI's can take > advantage of this to present catalogs in an intuitive way; but > > c) these tags should be completely interchangeable in all other ways > (i.e. same type in the DTD/Schema, and same API calls, any tag that can > go in a dataset can also go in a collection), since they are > semantically equivalent at a machine level. > > Does that make any sense? Benno, would that satisfy you? > > - Joe (ready for a checkup with my ontologist)
Quoting John:
Actually Im inclined to take it a bit further. Currently a collection is just some collection of datasets that share some common theme. If we allow it also to be a dataset (meaning it has a URL, can be selected, etc) then I think it should have the meaning that contained datasets are subsets or specializations of it. Because if they are not it seems to me that you might as well just represent the collection-as-dataset as a contained dataset element. [Maybe in this whole discussion I have been trying to convince myself of that :^] Does everyone agree with that meaning of nested datasets inside of collection-as-dataset? PS: There are still semantic difference between collections and datasets: A dataset has one or more access elements, a collection 0 or more. Collections contain datasets and nested collections. OTOH, datasets and collections look so similar already in the XML, its tempting to combine them (which i was playing with earlier in http://www.unidata.ucar.edu/projects/THREDDS/xml/InvCatalog.0.6a.dtd)
thredds
archives: