Difference: DanielHaydenDataSamples (22 vs. 23)

Revision 2307 Mar 2011 - DanielHayden

Line: 1 to 1
 
META TOPICPARENT name="DanielHayden"

Data Samples

Line: 56 to 56
 You need to obtain the following two files via, although the latter most is really just an example file:

> get_files PDGTABLE.!MeV

Changed:
<
<
> get_files Evgen_trf.py (Importantly for MC10)

>
>
> get_files EvgenCorrAtlfast_trf.py
> get_files Evgen_trf.py

 
Changed:
<
<
Then you are ready to make your first sample. Using Evgen_trf.py, you can type this with the -h command after to see what the following tags mean, but to summarise in order they are; CentreOfMassE / RunNumber (Has to match your jobOption) / Starting Event / Number of Events / Random 10 Digit Seed / JobOptions / Output Filename.
>
>
Then you are ready to make your first sample. Using "EvgenCorrAtlfast_trf.py, you can type this with the -h command after to see what the following tags mean, but to summarise in order they are; CentreOfMassE / RunNumber (Has to match your jobOption) / Starting Event / Number of Events / Random 10 Digit Seed / JobOptions / Output Filename.
  In Reality if you ONLY want the Electron Container information etc without Jet Info and other things, then you only need to run the EvgenCorrAtlfast_trf.py.

> Evgen_trf.py 7000 0700000 1 5000 1234567812 MC10.0700000.Pythia_Gee01_1000.py Output.pool.root

> EvgenCorrAtlfast_trf.py 7000 0700000 1 5000 1234567812 MC10.0700000.Pythia_Gee01_1000.py Output.pool.root AOD.pool.root ntuple.pool.root

Changed:
<
<
Note here, that MC10.070000.template.py is your joboptions file, and 5000, is the number of events you are going to make in your sample. The other file names are your output. I would recommend redirecting these to your tmp directory on lxplus as they can be 100mb big or so. Just remember to send them to linappserv1 or somewhere before you next log off so they don't disappear!
>
>
Note here, that MC10.070000.Gee01_1000.py is your joboptions file, and 5000, is the number of events you are going to make in your sample. The other file names are your output. I would recommend redirecting these to your tmp directory on lxplus as they can be 100mb big or so. Just remember to send them to your institutes local server or somewhere before you next log off so they don't disappear!
 

Making Validation Plots & Collecting Information

Changed:
<
<
The next step is to take one of your no doubt already existing analysis programs, and strip it down so all it does it create plots of basic things like the, et spectrum, eta spectrum, and invariant mass, as an example. Apply no cuts, as you just want to see what the sample looks like. Save these plots, and include them with all the other information that you will eventually upload.
>
>
The next step is to take one of your no doubt already existing analysis programs, and strip it down so all it does it create plots of basic things like the, et spectrum, eta spectrum, and invariant mass, as an example. Apply no (or minimal) cuts, as you just want to see what the sample looks like. Save these plots, and include them with all the other information that you will eventually upload.
  Important! When the job finishes running, you will be left with a number of log files, namely EvgenCorrAtlfast_trf.log for Evgen.log, you will want to keep these files as they contain all of your cross section information!!! Also remember to take note of the event generator efficiency, which is usually set to about 0.90, and can be found on the last line of your joboptions file usually. You will also want to look through them to see exactly what version of Pythia you are using, if you haven't taken note of it before now.
Line: 96 to 97
 

Automated Process

Changed:
<
<
First of all I want to thank Peter Waller for his coding expertise which enabled me to realise this Process. I was faced with having to do all of the above, for 15 samples, and I thought there must be a simpler way of doing things, which led me to come up with this idea.
>
>
First of all I want to thank Peter Waller for his coding expertise which helped me realise this Process. I was faced with having to do all of the above, for 15 samples, and I thought there must be a simpler way of doing things, which led me to come up with this idea.
  First of all you have to have set up most of the above steps, i.e. you need to:
Line: 105 to 106
 
  • Copy this Macro to the run directory of the above.
  • Follow the steps below.
Changed:
<
<
Now you will need to make a folder (I put it on my Desktop) and inside the folder do two things; Make a folder for every sample (Named Accordingly), and download the files below into the main folder.
>
>
Now you will need to make a folder (I put it on my Desktop) and inside the folder do two things; Make a folder for every sample (Named Accordingly, i.e. for MC10.0700000.Pythia_Gee01_500.py, I make a subfolder called Gee01_500 and so on...), and download the files below into the main folder.
  run_one_job.sh
run_all.sh
parse_evgen_log.py
joblist.txt
Changed:
<
<
You will have to change the paths in these files (namely joblist.txt and run_one_job.sh) to match and make sense for your own setup, but the files are self explanatory and I have added comments where necessary to make it a fairly trivial process. Also I have named a jobOptions called MCProduction.py in the macro but have omitted it here. This file simply runs over the .root output file and outputs the histograms you require for validation (usually; Pt, Eta, and Invariant Mass) without any cuts. I have not included it here as everyones setup / needed plots will be slightly different, it is easy to make, and as different releases come out the code might change where as what I have provided should always be true.
>
>
You will have to change the paths in these files (namely joblist.txt and run_one_job.sh) to match and make sense for your own setup, but the files are self explanatory and I have added comments where necessary to make it a fairly trivial process. Also I have named a jobOptions called MCProduction.py in the macro but have omitted it here. This file simply runs over the .root output file and outputs the histograms you require for validation (usually; Pt, Eta, and Invariant Mass) without any (or minimal) cuts as I previously described. I have not included it here as everyones setup / needed plots will be slightly different, it is easy to make, and as different releases come out the code might change where as what I have provided should always be true.
 
Changed:
<
<
The final step is to put a JobOptions File into each respective folder, and then simply open a terminal, navigate to the folder and run with ./run_all.sh.
>
>
The final step is to put a JobOptions File into each respective subfolder, and then simply open a terminal, navigate to the main folder containing all of the downloaded pieces of code above, and your subfolders with jobOptions in, and then run with ./run_all.sh.
  The result should be that each sample folder is filled with the needed validation plots, and cross-section text file, thus simplifying and automating the whole process to the nth degree!

Line: 119 to 120
 The result should be that each sample folder is filled with the needed validation plots, and cross-section text file, thus simplifying and automating the whole process to the nth degree!

Deleted:
<
<
 

Links

Changed:
<
<
>
>
 
Line: 145 to 143
  -->
Changed:
<
<
META FILEATTACHMENT attachment="run_all.sh" attr="h" comment="" date="1267188942" name="run_all.sh" path="run_all.sh" size="51" stream="run_all.sh" user="Public.DanielHayden" version="1"
META FILEATTACHMENT attachment="run_one_job.sh" attr="h" comment="" date="1267189001" name="run_one_job.sh" path="run_one_job.sh" size="1922" stream="run_one_job.sh" user="Public.DanielHayden" version="1"
META FILEATTACHMENT attachment="parse_evgen_log.py.txt" attr="h" comment="" date="1267189014" name="parse_evgen_log.py.txt" path="parse_evgen_log.py" size="709" stream="parse_evgen_log.py" user="Public.DanielHayden" version="1"
META FILEATTACHMENT attachment="job_list.txt" attr="h" comment="" date="1267189030" name="job_list.txt" path="job_list.txt" size="2217" stream="job_list.txt" user="Public.DanielHayden" version="1"
>
>
META FILEATTACHMENT attachment="run_all.sh" attr="h" comment="" date="1299503341" name="run_all.sh" path="run_all.sh" size="52" stream="run_all.sh" user="Public.DanielHayden" version="2"
META FILEATTACHMENT attachment="run_one_job.sh" attr="h" comment="" date="1299503371" name="run_one_job.sh" path="run_one_job.sh" size="1758" stream="run_one_job.sh" user="Public.DanielHayden" version="2"
META FILEATTACHMENT attachment="parse_evgen_log.py.txt" attr="h" comment="" date="1299503400" name="parse_evgen_log.py.txt" path="parse_evgen_log.py" size="726" stream="parse_evgen_log.py" user="Public.DanielHayden" version="2"
META FILEATTACHMENT attachment="job_list.txt" attr="h" comment="" date="1299503415" name="job_list.txt" path="job_list.txt" size="1728" stream="job_list.txt" user="Public.DanielHayden" version="2"
 
META FILEATTACHMENT attachment="ValidationPlots.C" attr="h" comment="" date="1267189084" name="ValidationPlots.C" path="ValidationPlots.C" size="589" stream="ValidationPlots.C" user="Public.DanielHayden" version="1"
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding RHUL Physics Department TWiki? Send feedback