Report of the Run Coordination Task Force
Version 2.2
March 18, 1999
Task Force Members
Vince Cianciolo, Achim Franz, Steve Pate, Michael Sivertz (Chair)
The Task Force met during the December Core Week to go over the charge, and to collect points of importance which must be included in the recommendations.
Every member of the Task Force brought strong opinions about various aspects of Run Coordination, but there was unanimous agreement that the most important aspect was a clear chain of command. The collaboration needs to select competent people to fill the positions in Run Coordination. This should be easy to do with a base of over 400 collaborators. Then the collaboration must grant authority to those people and make clear where the responsibility rests for all of the tasks associated with running the experiment.
From this starting point, all of the other recommendations are details. What follows are the detailed recommendations of the Task Force.
The person responsible for directing the operation of the detector is the Run Coordinator (RC).
The person supervising the day to day operations of the detector is the Period Coordinator (PC).
The person responsible for activities of the Shift Crew during a shift is the Shift Leader (SL).
The Institutional Board (IB), in consultation with the Executive Council (EC), shall determine the algorithm by which the shift commitment of PHENIX collaborating institutions is calculated. This applies to shift assignments for the PC, SL and the Shift Crew. It shall be the responsibility of the IB members or their designates to communicate with the RC regarding shift assignments. The Task Force requests that the shift assignment algorithm be made as simple as possible in order to reduce the bookkeeping load on the RC.
The Shift Crew commitment shall be one full week of shifts, and shifts shall be assigned in one week blocks. The PC shift commitment shall be for two weeks.
The shift commitment shall be allocated irrespective of all other service tasks.
The Run Coordinator shall be appointed by the Spokesperson in consultation with the EC. This appointment will be for a period of one year, beginning approximately 12 weeks prior to each 37 week running period. This will allow the RC time to prepare for the activities associated with the up-coming run.
The RC shall be resident at BNL during the appointment.
It is the duty of the RC to report to the Spokesperson and EC on a monthly basis, or as requested by same.
Before each 37 week running period starts, the RC will send out shift assignments to the collaborating institutions via the IB members. The shift assignments will be a calendar with shifts allocated to institutions. There will be three specific roles to be assigned; Period Coordinator, Shift Leader, and Shift Crew. The IB members will respond to the RC by giving the names of people for each assigned shift. In case of scheduling conflicts, it is up to the institution, not the RC, to arrange shift trades. If IB members do not respond to the RC request in a timely manner, the Spokesperson will take action to ensure compliance.
Each Period Coordinator (PC) will serve for a two week term of duty.
Each IB member shall identify at least one person from their institute who can act as Period Coordinator. The RC will approve the list of PC nominees submitted by the IB members. The RC shall draw up a PC Shift Schedule allocating two week slots to the collaborating institutions, according to the algorithm determined by the EC. The RC will send this PC Shift List to the IB members. The IB members shall sign up their nominee on the PC Shift Schedule. If a collaborating institution does not have a qualified nominee, then a representative from that institution shall come to PHENIX and be trained for the position by the RC.
Each Shift Leader (SL) will serve for a one week term of duty.
The RC shall draw up a SL Shift Schedule allocating one week slots to the collaborating institutions. The IB members will sign up their nominee on the SL Shift Schedule, pending the approval of the RC. The RC will approve the list of SL nominees submitted by the IB members. If a collaborating institution does not have a qualified nominee, then a representative from that institution shall come to PHENIX and be trained for the position by the RC.
A SL will be on hand at all times during running periods to direct activities on the shift. The person selected as SL will remain SL for the duration of the shift period (one week). Each person who serves as SL will also serve as the Alternate PIC for Building 1008.
The RC shall have the following responsibilities:
The PC shall have the following responsibilities during each two week shift period:
The SL shall have the following responsibilities:
During running periods there shall be a minimum of four people on shift at all times. This will include a SL and three other members of the shift crew. For later reference these shift members will be called "Shift Leader", "Shift Assistant 1", "Shift Assistant 2", and "Shift Assistant 3". During watch periods, such as when there are machine studies or during start up, there shall be a minimum of two people on shift at all times.
The Shift Crew shall have the following responsibilities:
If a shift crew member does not show up (or arrives late for shift) the PC shall determine the most appropriate way of dealing with the situation.
The shift quantum is one full week of shifts. A person coming to BNL to take shifts should expect to take them in one week (7 shifts) blocks. The persons taking the same shift for one week will therefore serve together for that week and form a single "shift crew".
The role of Shift Assistant 3 is a special one in that people taking this shift can split their assignment. Some collaborators will not always be able to commit to a week of shifts at a time, so when they are given a week of shifts they can trade part of their shift with other collaborators in order to make a more convenient schedule for both people. There will be a web site managed by the RC dedicated to facilitating shift trading.
Cycle A: The Shift Leader and Shift Assistant 1 shall take shifts on Cycle A.
"Owl" Shift 00:00 08:00
"Day" Shift 08:00 16:00
"Evening" Shift 16:00 24:00
Cycle B: The Shift Assistant 2 and Shift Assistant 3 shall take shifts on Cycle B.
"Rooster" Shift 04:00 12:00
"Physicist" Shift 12:00 20:00
"Vampire" Shift 20:00 04:00
In order to keep the detector from collapsing in chaos every Monday as the new group of shifters replaces the old group, it is proposed that the new shift cycles be staggered. There are two methods of staggering under consideration. The decision of which method to implement will be left to the RC, as more running experience is gained.
Method A: In the first method, only one member of the three-person shift crew will be replaced at any particular time. Each week-long period of shifts for the "Shift Leader" will begin on Monday. The new week of shifts for "Shift Assistant 1" will begin on Wednesday. The new week of shifts for "Shift Assistant 2" will begin on Thursday. The new week of shifts for "Shift Assistant 3" will begin on Friday.
Method B: In the second method, only one shift crew will be replaced at any particular time. The Owl (and Rooster) Shift Crew will be replaced on Mondays, Day (and Physicist) Shift Crew will be replaced Wednesday, and Evening (and Vampire) Shift Crew will be replaced on Fridays.
At any given moment, the following persons are in charge of the immediate operations of the experiment:
Run Coordinator
Period Coordinator
Shift Leader
All others persons, including experts who have been called in, must act in accordance with the instructions of these people. All major changes to the running conditions of the experiment must be cleared with the RC in advance. Such changes should be discussed and planned at the weekly and daily meetings, if time permits.
During the operation of PHENIX there will be situations in which the shift crew, following documented experimental operating procedures, is instructed to "call an expert." There will be other situations when the proper course of action is not even documented. Such situations (which may or may not constitute an emergency) cannot and should not be handled solely by the shift crew.
The task force recommends that the Run Coordinator maintain a list of On-Call Experts. This list should include at least two persons for each subsystem (the primary and an alternate) and the task force can foresee additional on-call requirements not strictly related to any given subsystem (e.g., the Period Coordinator, infrastructure, etc.). Each person on the list must be locally available to respond to such situations involving their area of expertise. In addition, the task force recommends that PHENIX provide cellular phones to each primary On-Call Expert.
Even with this list carefully maintained, situations may arise in which neither the primary nor the alternate On-Call Expert can be contacted. For this reason, the Task Force recommends that the Run Coordinator maintain a second list - Off-Call Experts. This list would have the same organization as the On-Call Experts list, broken down by subsystem with additional categories as required. Persons on this list should be listed in order of contact preference so that, e.g., the first person is effectively the second on-call alternate, the second person is the third on-call alternate, etc. Persons on this list would not have any responsibility to be locally available but could provide back up expertise in emergencies.
Every PHENIX Shift Crew member will take all required BNL and RHIC training courses. The PHENIX Safety Coordinator shall supply the RC with a list of all BNL and RHIC Training Requirements for people on shift. PHENIX as a whole will take responsibility for preparing materials sufficient to train Shift Crew members in the safe and productive operation of PHENIX. The RC shall maintain a web-accessible list of all required BNL, RHIC and PHENIX-specific training requirements. All shifters are advised to check the RC web site well in advance of their shifts to ensure that they have completed all the required training. Each PHENIX Shift Crew member is ultimately responsible for realizing the great responsibility that being on shift entails and for taking the shift training seriously.
Each subsystem will supply thoroughly documented operating procedures. The call-list protocol recommended by the task force will provide significant incentive to subsystems to make these procedures comprehensive and easy to understand since any situation not found in the procedures will require calling in the designated On-Call Expert. Training for shift duty shall include a thorough reading of the documentation pertaining to normal shift duties and a perfunctory understanding of possible non-standard situations and procedures.
This documentation shall:
In addition to studying the documentation provided, new Shift Crew members are encouraged to audit shifts prior to start of their first official shift. "Auditing" is the practice of sitting in on an active shift and watching everything that goes on during the shift. Advance arrangement should be made with the RC to avoid crowds in the counting room.
The task force recommends that the PC and SL make a point of mentoring Shift Crew members not trained for these jobs. This will increase the pool of candidates from which new nominees can be picked.
RHIC is a new machine and currently no detailed time-scale for the startup is available. This makes it difficult to set up a formal procedure.
In the coming months each subsystem will define its role, i.e. what detector in which configuration, and what will be their needs in the ER, such as gas, HV, and LV. This will require prior approval by the DC and RHIC/PHENIX Safety.
The RC for the ER (Achim Franz) will soon submit a request to IB members for a list of people available for the ER shifts (PC, SL, and Shift Crew). On a similar time scale a list of experts for each subsystem will be defined so that the On-Call Expert list can be made up.
As soon as detectors are put on operating gas, round the clock shift coverage will start. There will be a minimum of two person per shift in watch mode. All PHENIX groups shall cover these shifts, not only the subsystems using gaseous detectors. Subsystems shall be allowed at this time to setup and monitor ancillary systems, HV, LV, cooling, etc. Full documentation about emergency procedures is required prior to operating the detectors in full running mode.
If RHIC is close to getting beam in the machine DAQ and GL-1 experts shall be on call. They will be given priority to set up a GL-1 trigger and timing. Priorities shall be set in advance during the daily meeting, allowing one subsystem to change run, trigger or timing configurations for their tests, which may influence the other systems running in parallel.
The broad recommendations of the Task Force can be summarized as follows: