Dispatches From The Geeks

News and Announcements from the MCS Systems Group

Author Archive

Data Center and systems outage, Monday, March 6, 2017.

In order to install a new breaker panel in the building 240 data center, electric power needs to be taken down for a subsection of the data center – unfortunately that section comprises the bulk of the computers in the room.

This outage will effectively take down the computers in the 240 data center. The outage window is expected to be from 9AM until 3PM on that day. The CELS IT systems affected will be:

* The CELS Linux/Unix services, including: file servers (project directories, home directories), login.mcs.anl.gov, building 240 linux desktops and linux print services, compute servers (see https://wiki.mcs.anl.gov/IT/index.php/General_MCS_Questions#computeservers for the list of compute servers), jenkins.cels.anl.gov, buildbot.mcs.anl.gov

* accounts.mcs.anl.gov

* BIO divisional servers (Y Drive, X Drive, print server – use CIS server instead)

* SVN repositories hosted at repo.anl-external.org.

* The license servers for PGI, NAG, starcd, mathematica, totalview, idl, and accelrys.

* other project-specific systems and services not provided by CELS IT – expect to get announcements from the systems administrators of those systems.

The following will *not* be affected:

* Most websites hosted by CELS systems (including wikis, confluence, wordpress)

* E-mail list servers

* License servers for Intel, esgee, matlab

* ANL Authentication (anl.gov domain accounts, Single Sign On, etc.)

* ANL or externally provided apps (Exchange, Box, Workday, etc.)

* Building 240 office-side operations. Only the data center will be affected.

We apologize for the inconvenience. This outage was scheduled to coincide with scheduled downtimes already lined up for the primary systems in the room. Due to the scope of the work being provided, there is no way to perform this work in a safe manner without this level of power outage.

Written by Craig Stacey

February 13, 2017 at 9:39 am

Posted in Uncategorized

E-Mail Service upgrades at Argonne

In the coming months, Argonne will be moving its email service from Microsoft Exchange servers hosted on-site running an old version of Microsoft Exchange to a hosted solution run by Microsoft. I’ve been on this new service for about a year at least, and it’s great – much more capable than the current offering at Argonne.

You’ll get more communications directly from CIS about this when you’re scheduled to upgrade, including pointers to documentation and letting you know what you can expect while the migration is happening.

The actual migration process for this is really painless, as well. In the end, as a user, you’d notice a 5-10 minute period where you couldn’t connect to your mailbox. Also, depending on what method you use to read your mail (POP/IMAP especially), you might need a little extra assistance getting your settings changed. But most people the change will be near seamless.

Due to some technical reasons, it’s easier if we migrate divisions together (shared calendars and mailboxes being a big part of this). These migrations will be starting this month, and run through September at the latest. If your division has some sort of event or project in this time frame that you think would be negatively affected by your division moving during that time frame, let me know and I’ll block it out for the division.

Thanks!


Craig

Written by Craig Stacey

February 1, 2017 at 8:54 am

Posted in Uncategorized

Endnote X8 site licensing

We have updated packages for Endnote, now at version X8. The updated packages can be found at the links below.

Mac users: http://www.cels.anl.gov/macos#Mac_OS_X_Software

Windows users: http://www.cels.anl.gov/windows/Software#Endnote_X8

Please note these links are for CELS (MCS, ALCF, EVS, BIO, CELS & associated institutes) employees only. Do not distribute these files or licenses beyond your own use. The distribution links will require you to login to your Box account to retrieve the files.

Written by Craig Stacey

January 19, 2017 at 5:28 pm

Posted in Uncategorized

Systems Announce File Server Maintenance, Saturday Dec 17

Greetings,

The file server migration and maintenance project is complete.

We had to reboot the login servers and all the compute nodes, our apologies if any of your jobs were interrupted.

You may need to reboot your workstation if you are not able to write into your home directory.

Contact the CELS Help Desk if you notice any issues.

Thanks for your patience.

Written by Craig Stacey

December 17, 2016 at 4:25 pm

Posted in Uncategorized

File Server Maintenance, Saturday Dec 17

The file server maintenance we postponed in October is now taking place and will continue from 9AM onward (all day) to finally migrate the rest of our unix homes off the aging file server onto our new one. To do this, we have functionally taken the home filesystem offline. During this period, all home filesystems will be read-only, and could be unavailable for stretches of time.

This should not affect e-mail, web, or other services beyond the MCS linux computing infrastructure (https://wiki.mcs.anl.gov/IT/index.php/Linux). During this period, you will not be able to save any files to your home directory, you should be able to login to MCS workstations and compute servers, including login.mcs.anl.gov, but this is not guaranteed. After the migration is complete, all linux workstations and compute servers that use this home directory server will be rebooted.

Thanks!

Written by Craig Stacey

December 17, 2016 at 9:12 am

Posted in Uncategorized

File Server Maintenance, Saturday Dec 17

The file server maintenance we postponed in October will be taking place on this coming Saturday, Dec 17 from 9AM onward (all day) to finally migrate the rest of our unix homes off the aging file server onto our new one. To do this, we need to functionally take the home filesystem offline. During this period, all home filesystems will be read-only, and will be unavailable for stretches of time.

This will not affect e-mail, web, or other services beyond the MCS linux computing infrastructure (https://wiki.mcs.anl.gov/IT/index.php/Linux). During this period, you will not be able to save any files to your home directory, and may not be able to login to any MCS workstations or compute servers, including login.mcs.anl.gov. After the migration is complete, all linux workstations and compute servers that use this home directory server will be rebooted.

Thanks!

Written by Craig Stacey

December 16, 2016 at 4:49 pm

Posted in Uncategorized

Reduced coverage tomorrow for CELS Service Desk and Support

Due to the lab’s recommendation on telecommuting tomorrow, our in-person presence will be reduced tomorrow. We’ll all be online and responding to requests. If something urgent comes up that requires an in-person presence, a number of us are prepared to come in and deal with it, but there will be obvious delays due to travel time.

If you need in-person help and are unable to use e-mail, please feel free to call the central CIS help desk at x-9999 who can open a ticket for you and assign it to us. Just ask them to assign it to the CELS Service Desk.

Thanks, and stay warm and safe!


Craig

Written by Craig Stacey

December 14, 2016 at 3:17 pm

Posted in Uncategorized

File Server Maintenance, Saturday Dec 17

The file server maintenance we postponed in October will be taking place on this coming Saturday, Dec 17 from 9AM onward (all day) to finally migrate the rest of our unix homes off the aging file server onto our new one. To do this, we need to functionally take the home filesystem offline. During this period, all home filesystems will be read-only, and will be unavailable for stretches of time.

This will not affect e-mail, web, or other services beyond the MCS linux computing infrastructure (https://wiki.mcs.anl.gov/IT/index.php/Linux). During this period, you will not be able to save any files to your home directory, and may not be able to login to any MCS workstations or compute servers, including login.mcs.anl.gov. After the migration is complete, all linux workstations and compute servers that use this home directory server will be rebooted.

If this poses an inconvenience to you that would require us to reschedule this, please let us know now. I will make another announcement on Thursday unless we reschedule due to your requests or issues we encounter in the prep for this.

Thanks!

Written by Craig Stacey

December 12, 2016 at 11:06 am

Posted in Uncategorized

COMPLETE: 11/12/2016 File Server Reboot

Greetings,

This is the ALL-CLEAR

The file server reboot completed without incident. All filesystems should be available again.

If you are having trouble do this first:

Restart your terminal session.
Reboot your workstation.

If this doesn’t seem to help then contact the CELS Helpdesk.

Thanks for your patience

Written by Craig Stacey

November 12, 2016 at 10:59 am

Posted in Uncategorized

REMIDER: 11/12/2016 File Server Reboot

File Server Reboot on Saturday, November 12th from 10:AM until 12 Noon

In order to apply important system updates and security patches we will be rebooting the file server called sto10.mcs.anl.gov this Saturday morning. We expect the work to take no more than an hour to complete but are scheduling a 2 hour “outage window” so we have time to take care of any unanticipated problems.

What do you need to do?
Before the outage begins you should suspend any automated scripts or processes that would access any of the subdirectories of the affected NFS mountpoints, (listed below). You should save any work you are doing in them and try to get any active terminal sessions out of the affected mountpoints. If you need advice on how to do this contact the CELS Help Desk
After the outage is over (indicated by our “all-clear” messages) you may need to reboot your work stations or log out and back in to your open ssh or terminal sessions.

If you feel that this outage window is unacceptable get in touch with us immediately so we can try to address your needs.

What are we doing?
We are shutting down and rebooting the file server named sto10.mcs.anl.gov

Why are we doing this?
There are a number of important operating system, security and service software updates available for the system.

When are we doing this?
We will begin work at 10:00 AM, Saturday, November 16, 2016

What services will be affected?
The UNIX NFS mounted “project directories” (listed below) will be offline for the duration of the outage.

How long will the outage last?
We anticipate that the outage will only last for about an hour, but it could be offline for up to 2 hours.

How will you know that the service is back online?
We will send an “all clear” message to this list, as well as a notice via twitter ( follow us at @mcssys ) and an update to the “Dispatches from the Geeks” blog https://mcssys.wordpress.com/

Thank you for your patience and we apologize for the inconvenience

List of NFS mountpoints that will be OFFLINE during the outage:

/nfs/UncertaintyClimate
/nfs/alcf-admins
/nfs/cels-media
/nfs/cels-systems
/nfs/geassm
/nfs/gtr.globus.org
/nfs/mcs-proj-climate
/nfs/mcs-proj-dynamics
/nfs/mcs-proj-magnetic
/nfs/mcs-proj-magnetic2
/nfs/mcs-proj-source
/nfs/ms-software
/nfs/ms-users
/nfs/noaa
/nfs/proj-climate1
/nfs/proj-climate2
/nfs/proj-davidk
/nfs/proj-dsl
/nfs/proj-emconsta
/nfs/proj-fischer00
/nfs/proj-fischer01
/nfs/proj-fischer02
/nfs/proj-fischer03
/nfs/proj-flash
/nfs/proj-fluids
/nfs/proj-genogrp
/nfs/proj-genomics
/nfs/proj-lans1
/nfs/proj-mpich
/nfs/proj-swat
/nfs/proj-sysbio
/nfs/proj-tpeterka
/nfs/sharp
/nfs/uso
/nfs/proj-phasor

Written by Craig Stacey

November 11, 2016 at 11:03 am

Posted in Uncategorized