NOTICE: This version of the NSF Unidata web site (archive.unidata.ucar.edu) is no longer being updated.
Current content can be found at unidata.ucar.edu.

Re: [conduit] Conduit Outage 4/15 into 4/16

Good afternoon everyone,

There hasn't been an update message from NCEP since yesterday morning, but
as of this writing here's where things stand:

Data has been flowing on CONDUIT for a few hours now, though I do not know
if it's everything or if it'll stay online.  It came back up shortly before
the 12Z GFS and currently the 18Z NAM is coming in.  I'll try not to look
at it the wrong way.

On Wednesday morning a Critical Weather Day was issued to help ensure NCEP
et al. have all the resources needed for the recovery efforts.  Originally
the CWD was scheduled to end Saturday morning, but earlier today it was
extended to Monday evening.  My read is that says something about their
confidence level, and there's still a chance CONDUIT and other impacted
services could still go up and down; an all clear would be premature.

Model data continues to be available on nomads.ncep.noaa.gov and other
missing data may exist in other locations too, please reach out if you need
help finding data.

Here is the latest ADASDM update on the restoration efforts:
https://mesonet.agron.iastate.edu/wx/afos/p.php?pil=ADMSDM&e=202404180955

Here is the latest CWD statement:
https://mesonet.agron.iastate.edu/wx/afos/p.php?pil=ADASDM&e=202404191520

And here is that CONDUIT RTSTATS graph, showing showing there's life on the
feed again:
https://rtstats.unidata.ucar.edu/cgi-bin/rtstats/iddstats_num_nc?CONDUIT+conduit.unidata.ucar.edu

We will continue the monitor and I'll update these lists as more
information comes out, and if nothing else I'll send another update Monday.

Best,
-Mike


*Mike Zuranski*
Data Engineer II
NSF Unidata Program Center
University Corporation for Atmospheric Research


On Wed, Apr 17, 2024 at 9:47 AM Mike Zuranski <mzuranski@xxxxxxxx> wrote:

> Good morning all,
>
> The situation doesn't seem to have changed much from last night.  CONDUIT
> is still down, numerous impacts to NCEP web sites and services, cats & dogs
> living together...
>
> We continue to monitor but that's about all we can do.  The link I'm
> smashing the F5 button on is this rtstats chart, it's probably the easiest
> "Is it still down?" check at the time for CONDUIT:
>
> https://rtstats.unidata.ucar.edu/cgi-bin/rtstats/iddstats_num_nc?CONDUIT+conduit.unidata.ucar.edu
>
> If you are having a hard time finding model data that is normally on
> CONDUIT, nomads.ncep.noaa.gov has remained unaffected by all of this.
> Also, please reach out if you need assistance finding other sources of data
> or anything else.
>
> Below is a notice that sums things up nicely; this and other notices can
> be found at https://www.nco.ncep.noaa.gov/status/messages/ (luckily that
> site has come back).  We will update with any changes to the situation.
>
> SENIOR DUTY METEOROLOGIST NWS ADMINISTRATIVE MESSAGE
> NWS NCEP CENTRAL OPERATIONS COLLEGE PARK MD
> 1019Z WED APR 17 2024
>
>
> ...UPDATES TO RECENT NWS OPERATIONAL OUTAGES...
>
>
> WIDESPREAD WFO NETWORK OUTAGES...
> WFOs internet and AWIPS connections have remained stable since
> the circuit was restored in College Park Tuesday morning. NCO's
> network team will continue to work towards mitigating impacts
> during the recurring circuit outages in College Park.
>
>
> NWS BROADCAST REPEATED MESSAGES...
> The problem with NWS products being broadcasted multiple times
> was traced to Monday's efforts to mitigate impacts from the
> temperature spike in the College Park Data Center. NCO
> implemented a fix to correct the issue at 2:00pm EDT Tuesday.
>
>
> MRMS...
> CONUS QPE data continues to not update on MRMS
> (https://mrms.ncep.noaa.gov/data/). The problem has been linked
> to the College Park Data Center Outage. Efforts to restore the
> data will resume early Wednesday.
>
>
> RECOVERY EFFORTS IN THE COLLEGE PARK DATA CENTER...
> No significant updates during overnight restoration efforts. NCEP
> Center's (OPC, CPC, and WPC) operations remain severely degraded
> due to downed NetApp systems in College Park. No ETR.
>
> Current Known Impacts include:
> -NCEP Centers' websites hosted in CP that remain inaccessible
> include EMC and NCWCP intranet sites.
> -WPC, OPC, and CPC's operational product suites' status, range
> from being degraded to down.
> -FTPPRD is inaccessible in CP (Customers are able to use
> nomads.ncep.noaa.gov as a viable backup in the meantime)
> -NCO operations personnel are unable monitor NWS networks and
> circuits.
> -CONUS QPE data is not updating on MRMS
> (https://mrms.ncep.noaa.gov/data/)
> -Several layers are not updating on NWS Cloud Services (GIS and
> Map Viewer)
> -Multiple outside datasets are not available/delayed (UKMET data,
> ECMWF data, Canadian METARS, ACARS aircraft data)
>
>
>
>
> Gerhardt/SDM/NCO/NCEP
>
>
>
> Best,
> -Mike
>
>
> *Mike Zuranski*
> Data Engineer II
> NSF Unidata Program Center
> University Corporation for Atmospheric Research
>
>
> On Tue, Apr 16, 2024 at 10:05 PM Mike Zuranski <mzuranski@xxxxxxxx> wrote:
>
>> The time is now 03:05 UTC, do you know where your data is?
>>
>> At this time CONDUIT is still down.  I had an exchange with NCEP Ops
>> earlier this evening so I know they were still working on it then.  They
>> are fully aware of the breadth of the situation and are working with the
>> applicable parties to resolve this as soon as possible.  Given that I do
>> not plan on reaching out to them again on this unless the symptoms change.
>>
>> I haven't seen any admin notices or similar pertaining to this, but we
>> are watching closely.  I'll keep you posted with any updates I hear, and I
>> plan on making another status update tomorrow morning.
>>
>> In the meantime, the data that's missing on CONDUIT may well be found at
>> nomads.ncep.noaa.gov, which has remained unaffected during these
>> troubling times.  Feel free to reach out if we can help you find another
>> source of data or anything else.
>>
>> Best,
>> -Mike
>>
>>
>> *Mike Zuranski*
>> Data Engineer II
>> NSF Unidata Program Center
>> University Corporation for Atmospheric Research
>>
>>
>> On Tue, Apr 16, 2024 at 10:52 AM Mike Zuranski <mzuranski@xxxxxxxx>
>> wrote:
>>
>>> Greetings all,
>>>
>>> It appears the CONDUIT feed has been down between yesterday afternoon
>>> and just moments ago began to transmit data again.  It looks like this is
>>> being actively worked on but I'm keeping a close eye on the situation.
>>>
>>> While I haven't seen any notifications on the subject, the source of the
>>> outage is upstream from us so we don't have much control over it.  If data
>>> stops flowing again I'll reach out to the appropriate parties.
>>>
>>> Sorry for any inconvenience,
>>> -Mike
>>>
>>>
>>> *Mike Zuranski*
>>> Data Engineer II
>>> NSF Unidata Program Center
>>> University Corporation for Atmospheric Research
>>>
>>