Parent Directory
|
Revision Log
|
Revision Graph
Put scripts in cvs area
<html>
<head><title>ATLAS FullChainTest</title></head>
<body>
<TABLE BORDER=0 CELLSPACING=0 CELLPADDING=5><TR>
<TD><h2>ATLAS FullChainTest</h2>
<p><b>Author: <a href="http://cern.ch/Seth.Zenz">Seth Zenz</a></b> (Email: Seth dot Zenz at cern dot ch)
<p>Welcome to the results page for the automated ATLAS Full Chain Test. The purpose of the test is to rapidly find errors in the full production chain that occur every few events, so that the production system doesn't have to. It runs on the RTT queues at CERN for each completed nightly, and is started via a cron job that checks for new releases once per hour.
<p><b>Status</b> <i>(Last update: March 21, 13h00 CET)</i>
<ul>
<li>Running automatically on 12.0.6 bugfix nightlies (reopened)
<ul>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_0.html">rel_0</a>: ran out of disk; all ok in quick test rerun
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_1.html">rel_1</a>: all ok <b><i>except</b></i>
<ul>
<li>TAG failed as above. Problem now reproducable, under investigation</li>
<li>"Oracle error ORA-02391: exceeded simultaneous SESSIONS_PER_USER limit" in one job; under investigation</li>
</ul></li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_2.html">rel_2</a>: all ok (including pileup) except TAG as above</li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_3.html">rel_3</a>: pileup accidentally killed; all ok otherwise except tag
<ul><li>ERROR Unable to set property PrepareAll of TriggerPrepLooper.L2TrigPrep_L2JetPreparator</li></ul></li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_4.html">rel_4</a>: pileup still running, <b>all</b> non pileup ok!
</ul></li>
<li>Now also running automatically on release 13 dev nightlies
<ul>
<li>All jobs have failed so far, due to unignored ERRORs</li>
</ul></li>
<li>Pileup digitization and reconstruction tested starting in rel_6
<ul><li>Running only two events as of rel_1 due to large CPU time (~11 min/event)
</ul></li>
</ul>
</TD><TD><img src="doc/Flowchart.jpg">
<p><TABLE BORDER=3 CELLPADDING=5 CELLSPACING=0><TR><TD><b>Links</b>
<ul>
<li>ERROR output: <a href="bugfix/error_output">bugfix</a> <a href="dev/error_output">dev</a></li>
<li>checkFile.py Output: <a href="bugfix/check_file">bugfix</a> <a href="dev/check_file">dev</a></li>
<li>Last good root file from each stage: <a href="bugfix/last_good_root">bugfix</a> <i>dev</i> <a href="12.0.6.1/root_files">12.0.6.1</a></li>
<li>Directories where transforms were run: <a href="bugfix/full_output">bugfix</a> <a href="dev/full_output">dev</a>
<ul><li>From here, navigate to individual run directories labelled by date, release and job type, and original log files can be found</li></ul></li>
<li>AFS access
<ul>
<li>/afs/cern.ch/atlas/offline/external/FullChainTest</li>
<li>Links to last good root file - /{branch}/last_good_root</li>
<li>Run directories - /{branch}/full_output</li>
</ul></li> </ul> </TD></TR></TABLE>
</TD</TR></TABLE>
<b>Talks</b>
<ul>
<li><a href="doc/Zenz-Validation-FullChainTest-27Feb2007.pdf">27 February 2007</a></li>
<li><a href="doc/Zenz-LBLSoftware-FullChainTest-13Mar2007.pdf">13 March 2007</a></li>
</ul>
<b>Details</b>
<p>The test does the following:
<ul>
<li>Automatically checks stamp link once per hour to see if it has changed, and starts jobs if so.
<ul>
<li>bugfix - /afs/cern.ch/atlas/software/builds/nightlies/12.0.X/AtlasProduction/latest_copied_releaseBF32BS3ProdOpt</li>
<li>dev - /afs/cern.ch/atlas/software/builds/nightlies/dev/AtlasProduction/latest_copied_releaseDev32BS4ProdOpt</li>
</ul></li>
<li>Current jobs:
<ul><li>DC3.005200.T1_McAtNlo_Jimmy - 5000 events generated, 50 events simulated/digitized</li>
<li>DC3.005300.PythiaH130zz4l - 5000 events generated, 50 events simulated/digitized</li>
<li>DC3.006384.PythiaH120gamgam - 5000 events generated, 50 events simulated/digitized</li>
<li>DC3.005188.A3_Ztautau_filter - 5000 events generated, 50 events simulated/digitized</li>
<li>RecoAll - basic reconstruction of the output from all of the above in one job (200 events)
<ul><li>ESD, AOD, TAG</li></ul></li>
<li>BackRecoAll - same as RecoAll, but reconstructs using older release (currently 12.0.6.1)
<li>PileupDigi - Digitize, with pileup, hits from DC3.005200.T1_McAtNlo_Jimmy (50 events)
<ul>
<li>Uses luminosity of 2*10^33, cavern background 10x nominal
<li><a href="pileup_input">Input files for cavern background and min bias</a></li>
</ul></li>
<li>Pileup reconstruction using pileup digi output
<ul><li>ESD, AOD, TAG</li></ul></li>
</ul></li>
<li>If any step in production fails, use the last good version of the output for the next step.</li>
<li>Make the "last good version" for each step publicly available.</li>
<li>Post-processing
<ul><li>ERROR file
<ul><li>First lists job transform summary, including all un-ignored ERRORs</li>
<li>Next lists all ERRORs in context</li></ul></li>
<li>Output from checkFile.py available for all ESD and AOD files produced</li></ul></li>
<li>Key results are accessible from this website, and the entire directory can be browed on afs:
<ul><li>/afs/cern.ch/atlas/offline/external/FullChainTest</li></ul></li>
</ul>
<p><b>In progress</b>
<ul>
<li>(See status.)</li>
</ul>
<b>To be added</b>
<ul>
<li>List selected warnings</li>
<li><a href="doc">Documentation</a></li>
</ul>
<b>Old status messages</b>
<ul>
<li>Week of March 5
<ul>
<li>rel_2: run on AOD and TAG fails</li>
<li>rel_3: TAG fails, everything else (including AOD) ok</li>
<ul><li>Rerun due to intermittent environment problem - now fixed</li></ul></li>
<li>rel_4: Build late; reco not run due to setup error (see rel_5 instead)</li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_5.html">rel_5</a>: all ok </li>
</ul></li>
<li>Week of March 12
<ul>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_6.html">rel_6</a>: all ok except pileup AOD (pileup TAG untested)
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_0.html">rel_0</a>: all ok except pileup AOD (pileup TAG untested)
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_1.html">rel_1</a>: all ok except pileup AOD</li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_2.html">rel_2</a>: all ok (including pileup AOD), except...
<ul><li>pileup digi and non-pileup TAG not run due to script typos</li></ul></li>
<li>rel_2 rerun: non-pileup TAG ok... pileup digi exceeded queue time limit
<ul>
<li>Killed after 13134 CPU sec (queue = atlasrttmedium)</li>
<li>Pileup digi will use long queue starting in rel_4</li>
</ul></li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_3.html">rel_3</a>: all ok, except pileup digi exceeded time limit again</li>
<li>rel_4: non-pileup all ok <b>except TAG</b>, pileup ok
<ul><li>TAG OK in rerun — problem not reproducable</li></ul></li>
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_5.html">rel_5</a>: bad run, old packages in local test area
<li><a href="http://atlas-computing.web.cern.ch/atlas-computing/links/distDirectory/nightlies/projects/nicos_web_areaBF32BS3ProdOpt/nicos_content_6.html">rel_6</a>: all ok except TAG
<ul><li>"not reproducable" problem is back</li><li>under further investigation</li></ul></li>
</ul></li>
</ul>
<body></html>
| CERN Central CVS service | ViewVC Help |
| Powered by ViewVC 1.0.9 |