Tuesday, March 25, 2014

Announcing OSG Software version 3.2.7

We are pleased to announce OSG Software version 3.2.7. This is a high-
priority update to the OSG 3.2 release series.

A few sites encountered a situation where the Globus GRAM Job Manager
would repeatedly crash. The circumstances to trigger the initial crash
were rare. However, once it crashed, it could not recover. This problem
is present in OSG versions 3.2.4 through 3.2.6. We applied a patch from
the Globus developers that addresses both the crash and the failure to
recover.

Release notes and pointers to more documentation can be found at:

https://www.opensciencegrid.org/bin/view/Documentation/Release3/Release327

Need help? Let us know:

https://www.opensciencegrid.org/bin/view/Documentation/Release3/HelpProcedure

We welcome feedback on this release!

Wednesday, March 19, 2014

OSG Software Release 3.2.7 - March 25th - Globus Job Manager Fix - GOC Ticket # 20220

Some sites have discovered an issue with the "globus-gram-job-manager" package distributed in OSG 3.2. The symptoms are: repeated crashes of the "globus-job-manager" process with a segmentation fault, new jobs not getting queued, and the subdirectories under "/var/lib/globus/gram_job_state" getting filled up with state files of smaller than usual size. The issue is caused by globus-job-manager not handling certain kinds of incomplete state files correctly.

The affected versions of the package are 13.53-1.2.osg32.el5 and 13.53-1.2.osg32.el6. These versions were first released in OSG 3.2.4. CEs running the 3.1 series or those that have not yet upgraded to one of these versions should not be affected.

We have a new version of the globus-gram-job-manager package available in the OSG testing repositories. The new versions are 13.53-1.3.osg32.el5 and 13.53-1.3.osg32.el6. Admins of CEs running the affected version of globus-gram-job-manager are strongly encouraged to upgrade via the following command: yum upgrade --enablerepo=osg-testing globus-gram-job-manager

OSG will be making a 3.2.7 release containing this fix on March 25th.

Please see ticket 20220 at:
https://ticket.grid.iu.edu/20220

Tuesday, March 18, 2014

GOC Service Update - Tuesday, March 25th at 13:00 UTC

The GOC will upgrade the following services beginning Tuesday, March 25, 2014 at 13:00 UTC.
The GOC reserves 8 hours in the unlikely event that unexpected problems are encountered.
We encourage users to test affected services before the production release.

GLIDEIN
*Apply a patch to glideinwms factory to fix a multicore glidein memory bug causing glideins to fail on startup.

OASIS
*Replace the oasis.opensciencegrid.org Virtual Machine with a new one that has a rebuilt repository with much smaller but more numerous catalogs.
*Change osg-oasis-update command on oasis-login.opensciencegrid.org to first check to make sure all files are readable by other before publishing changes.

Monday, March 17, 2014

Announcing the 2014 OSG User School

ANNOUNCING THE 2014 OPEN SCIENCE GRID USER SCHOOL!

If you could access thousands, maybe millions of hours of computing, how
would it transform your research? What discoveries would you make?

We are looking for qualified students to attend the 2014 Open Science Grid
(OSG) User School, where they will learn how to use high throughput
computing (HTC) to harness vast amounts of computing power for research.

Using lectures, discussions, roleplays, and lots of hands-on work with OSG
experts in high throughput computing, students will learn how HTC systems
work, how to run and manage many jobs and huge datasets to implement a full
scientific computing workflow, and where to turn for help and more info.

Worried about costs? Successful applicants will get financial support to
attend the OSG School (July 7-10) at the beautiful University of Wisconsin
in Madison.

Ideal candidates are science, technology, engineering, and mathematics
(STEM) graduate students whose research demands large-scale computing.
Also, we will consider applications from faculty, staff, and advanced
undergraduates, so make a good case for yourself!

IMPORTANT DATES

Application Period: March 10 - April 4
OSG User School: July 7-10

MORE INFORMATION AND APPLICATIONS

Web: http://www.opensciencegrid.org/UserSchool
Email: osg-school-2014@opensciencegrid.org
Facebook: https://www.facebook.com/OSGUserSchool
Twitter: https://twitter.com/OSGUserSchool

Please forward this announcement to help us reach potential students. And
consider posting our flyer where appropriate:

https://twiki.opensciencegrid.org/twiki/pub/Education/OSGUserSchool2014/2014-osg-user-school-flyer.pdf

Wednesday, March 12, 2014

OSG All Hands Meeting - Registration

Dear Colleagues,

We are just a few weeks away from the OSG All Hands Meeting! We encourage you to register and participate in this important annual event for the Consortium. We have a very full and interesting agenda – the details of which are being built out at https://indico.fnal.gov/conferenceDisplay.py?confId=7207

If you plan on attending please register and make your lodging arrangements soon.

The SLAC National Accelerator Laboratory is hosting this event to reflect its increasing activity as a member of OSG Communities and the OSG Consortium itself, and we are using the opportunity to invite and include more of our local researchers and scientists from Standard University as well as SLAC itself. The scheduled topics include:
* Monday: US ATLAS and US CMS computing workshops
* Tuesday: The OSG Campus Infrastructures Community: campus distributed computing infrastructures and the national cyber ecosystem
* Wednesday: Plenary presentations from Scientists benefiting from the OSG as well as the OSG Project Leads giving the status and future plans
* Thursday:
OSG New Technologies
Federated Storage Workshop (USA and Europe)
* Friday:
Federated Storage Workshop (USA and Europe)

We invite you to visit the 2014 OSG AHM site at http://app.certain.com/profile/web/index.cfm?PKwebID=0x5948342f2c&varPage=home soon and register.

Amber Boehnlein, SLAC Division Director, Scientific Computing Applications
Lothar Bauerdick, OSG Executive Director
Ruth Pordes, OSG Council Chair

Tuesday, March 11, 2014

Announcing OSG Software version 3.1.31 and 3.2.6

We are pleased to announce OSG Software versions 3.1.31 and 3.2.6.

OSG 3.2.6 contains:

* Update to HTCondor-CE 0.6.3
(now works with PBS, plus SLURM through SLURM's PBS emulation layer)
* Update to GlideinWMS 3.2.3 (several bug fixes)
* Update to HTCondor 8.0.6 (several bug fixes)

Both 3.1.31 and 3.2.6 contain:

* Many updated gratia probes, (e.g. htcondor, slurm, psacct, dcache-transfer)
* Disable certinfo check on probes that don't need it
* Update BeStMan to support "root" SRM transfer protocol
* Update to XRootD 3.3.6 (several minor bug fixes)

OSG 3.1.31 contains:
* Improved integration between osg-configure, osg-info-services and gip
(Previously released in OSG 3.2.5)

Release notes and pointers to more documentation can be found at:

https://www.opensciencegrid.org/bin/view/Documentation/Release3/Release3131
https://www.opensciencegrid.org/bin/view/Documentation/Release3/Release326

Need help? Let us know:

https://www.opensciencegrid.org/bin/view/Documentation/Release3/HelpProcedure

We welcome feedback on this release!

Thursday, March 6, 2014

GOC Services Update - Tuesday, March 11th at 13:00 UTC

The GOC will upgrade the following services beginning Tuesday, March 11, 2014 at 14:00 UTC. The GOC reserves 8 hours in the unlikely event that unexpected problems are encountered. We encourage users to test affected services before the production release.

OIM 3.27
* Updating approver_vo_id on host certificate request records with NULL approver_vo_id with current *best* matching VO IDs.
* Removed /goc URL reference for goc ticket submitter
* Fixed CNEditor input width issue.
* Removed old bootstrap libs Updated divrep.jar (DivRepUpload / Base64 encoding) and stripped old js/css and updated bootheaders to use CDNs.

MyOSG 2.21
* Removed ReSS service from operations status overview page.
* OSG Display (git/master 0554f5b)
* Fixing typo for “Month” instead of “Months”.

GOC-TX 1.35
* Modifying the way GOC-TX pulls ticket comments from ServiceNow (per Mike Baker)

GOC Ticket 1.73
* Added a null check to suppress false alerts from search controller
* Updated GOC ticket assignment configuration
* Removed Dan F. from campusgrid form assignment.

Monday, March 3, 2014

RESOLVED: Problems Issuing Certificates via OSG PKI

The previously reported issue with issuing OSG PKI certificates has been resolved. OSG Operations and DigiCert restored service at approximately 5:40pm Eastern time. It is now safe to continue issuing certificates as you have done in the past.

Users affected by this issue will be contacted and working certificates reissued.

We apologize for any inconvenience this may have caused.

Problems Issuing Certificates via OSG PKI

The OSG PKI is currently having an issue with new certificates not being signed properly. OSG Operations is currently working with DigiCert (issuer of OSG certificates) to investigate the cause. We will send an update as soon as we have more information. Meanwhile, please do not issue any certificate using OSG PKI tools and OIM web interface until further notice.

We apologize for any inconvenience this may cause.