NOTICE: This version of the NSF Unidata web site (archive.unidata.ucar.edu) is no longer being updated.
Current content can be found at unidata.ucar.edu.

To learn about what's going on, see About the Archive Site.

Re: [thredds] ncml aggregation 200 times slower in thredds 4.6.1

  • To: Heiko Klein <heiko.klein@xxxxxx>
  • Subject: Re: [thredds] ncml aggregation 200 times slower in thredds 4.6.1
  • From: John Caron <caron@xxxxxxxx>
  • Date: Thu, 21 May 2015 20:39:52 -0600
ok, so looking closer, i see that there is a single file to cache the
results, so .1 seconds is more likely.

make sure all files have been cached, by requesting the time coordinate
values in a dods request (see post earlier today).

if thats the case, and subsequent accesses are still slow, send me the
mbr000_all.ncml file.

ps, i am gone until monday, so we can resume then...



On Thu, May 21, 2015 at 6:58 PM, John Caron <caron@xxxxxxxx> wrote:

> reading 4575 files in .1 seconds seems a bit too fast. Im guessing that
> the dataset is actually getting cached in memory, and you are seeing that
> performance. Then the question might be why isnt that happening as fast in
> 4.6?
>
> If you have "remote management" enabled, you can see what files are in the
> cache, and also clear the cache.
>
>
> http://www.unidata.ucar.edu/software/thredds/v5.0/tds/reference/RemoteManagement.html
>
> so, how long does it take in each version:
>
> 1) the first time that the aggregation is called, when theres nothing in
> the disk cache (eg  lustre/mnt/heikok/FAUNA/mbr000_all.ncml)
> 2) the first time that the aggregation is called after the TDS starts up,
> when the disk cache is populated, but the dataset is not in memory (or it
> has been cleared from memroy)
> 3) how long it takes after its in memory.
>
> i believe that 2) could take 7 secs, and 3) takes .1 second, and maybe 1)
> takes 10-120 secs.
>
> its possible that for 4.6, 2) has slowed down to 20 secs, and maybe its
> not getting memory cached so 3) never happens. I will investigate that
> possibility.
>
> if you get a chance to experiment with checking/clearing memory cache with
> the 2 versions, let me know the results.
>
> John
>
> On Wed, May 20, 2015 at 10:41 AM, John Caron <caron@xxxxxxxx> wrote:
>
>> ok, we'll see if we can reproduce the problem.
>>
>>
>>
>> On Wed, May 20, 2015 at 10:25 AM, Heiko Klein <heiko.klein@xxxxxx> wrote:
>>
>>> Hi John,
>>>
>>> in 4.6.1, request-time stays at ~20s each time I try it. Only in 4.3.23
>>> I see a huge perfomance-gain (from 7s to 0.1s) after the first fetch.
>>>
>>> Heiko
>>>
>>> ----- Original Message -----
>>> > Hi Heiko:
>>> >
>>> > Can you see whether the second time you access the dataset, if the
>>> times
>>> > are fast again?
>>> >
>>> > thanks,
>>> > John
>>> >
>>> > On Wed, May 20, 2015 at 2:07 AM, Heiko Klein <Heiko.Klein@xxxxxx>
>>> wrote:
>>> >
>>> > > Hi,
>>> > >
>>> > > I have some performance problems after upgrading to thredds 4.6.1.
>>> > >
>>> > > I'm aggregating a large dataset with a joinExisting aggregation.
>>> Reading
>>> > > the metadata from the aggregation took, with thredds 4.3.23, about
>>> 0.1s
>>> > > (first time up to 7s). After upgrading to 4.6.1, the same request
>>> takes 20s
>>> > > (second time, first time not measured) and is unusable slow. A
>>> 'ncview' of
>>> > > the aggregated dataset is no longer possible.
>>> > >
>>> > > Suspecting some caching problems, I followed the guidelines in
>>> > >
>>> http://www.unidata.ucar.edu/software/thredds/current/tds/reference/ThreddsConfigXMLFile.html
>>> > > The aggregation-cache contains two files
>>> > >
>>> > > $ ls -l file-lustre-mnt-heikok-FAUNA-mbr000_all.ncml
>>> > > lustre/mnt/heikok/FAUNA/mbr000_all.ncml
>>> > > -rw-r--r-- 1 tomcat7 tomcat7 74339 May 20 09:43
>>> > > file-lustre-mnt-heikok-FAUNA-mbr000_all.ncml
>>> > > -rw-r--r-- 1 tomcat7 tomcat7 74131 May 20 10:01
>>> > > lustre/mnt/heikok/FAUNA/mbr000_all.ncml
>>> > >
>>> > > (I guess the first one is thredds 4.3, while the second one is
>>> thredds 4.6)
>>> > >
>>> > >
>>> > > The ncml-file is:
>>> > >
>>> > > <netcdf xmlns="
>>> http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
>>> > >   <aggregation dimName="time" type="joinExisting">
>>> > >     <scan location="." regExp=".*snapMet000\.nc$" subdirs="true"/>
>>> > >   </aggregation>
>>> > > </netcdf>
>>> > >
>>> > >
>>> > > It is aggregating 4575 files, each about 1.3G.
>>> > >
>>> > >
>>> > > The requests according to the logs are:
>>> > > 2015-05-20T09:49:17.111 +0200 [    372240][     281] INFO  -
>>> > > threddsServlet - Remote host: 157.249.113.42 - Request: "GET
>>> > > /thredds/dodsC/lustreMnt/heikok/FAUNA/mbr000_all.ncml.dds HTTP/1.1"
>>> > > 2015-05-20T09:49:17.116 +0200 [    372245][     281] INFO  -
>>> > > threddsServlet - Request Completed - 200 - -1 - 5
>>> > > 2015-05-20T09:49:17.117 +0200 [    372246][     282] INFO  -
>>> > > threddsServlet - Remote host: 157.249.113.42 - Request: "GET
>>> > > /thredds/dodsC/lustreMnt/heikok/FAUNA/mbr000_all.ncml.das HTTP/1.1"
>>> > > 2015-05-20T09:49:17.131 +0200 [    372260][     282] INFO  -
>>> > > threddsServlet - Request Completed - 200 - -1 - 14
>>> > > 2015-05-20T09:49:17.134 +0200 [    372263][     283] INFO  -
>>> > > threddsServlet - Remote host: 157.249.113.42 - Request: "GET
>>> > > /thredds/dodsC/lustreMnt/heikok/FAUNA/mbr000_all.ncml.dds HTTP/1.1"
>>> > > 2015-05-20T09:49:17.138 +0200 [    372267][     283] INFO  -
>>> > > threddsServlet - Request Completed - 200 - -1 - 4
>>> > >
>>> > >
>>> > > Best regards,
>>> > >
>>> > > Heiko
>>> > >
>>> > >
>>> > > --
>>> > > Dr. Heiko Klein                   Norwegian Meteorological Institute
>>> > > Tel. + 47 22 96 32 58             P.O. Box 43 Blindern
>>> > > http://www.met.no                 0313 Oslo NORWAY
>>> > >
>>> > > _______________________________________________
>>> > > thredds mailing list
>>> > > thredds@xxxxxxxxxxxxxxxx
>>> > > For list information or to unsubscribe,  visit:
>>> > > http://www.unidata.ucar.edu/mailing_lists/
>>> > >
>>> >
>>>
>>
>>
>
  • 2015 messages navigation, sorted by:
    1. Thread
    2. Subject
    3. Author
    4. Date
    5. ↑ Table Of Contents
  • Search the thredds archives: