Mike Whalley Durham University M R Whalleydurham ac

  • Slides: 11
Download presentation
Mike Whalley Durham University M. R. Whalley@durham. ac. uk 10 th May 2007 SLAC-PPA

Mike Whalley Durham University M. R. Whalley@durham. ac. uk 10 th May 2007 SLAC-PPA Summit 1

The Durham HEP Database Group We are a small group based in the IPPP

The Durham HEP Database Group We are a small group based in the IPPP at Durham University in the UK, whose primary mission is the compilation of: “Products” HEP Reaction/Scattering Data Reaction Database & Data Reviews Personnel involved: Mike Whalley Joanne Bentham – DBM/Project Manager – Database Assistant Funded by PPARC(UK) STFC(UK) PLUS……. Since ~ 1975 – originally mainly 2 body final state data …now…. to compile ALL published data on HEP scattering cross sections etc. . UK Mirror Sites of…. . SLAC-SPIRES – (hep etc…) + LBNL Review of Particle Physics 10 th May 2007 SLAC-PPA Summit 2

Types of Particle Physics Data Particle Properties Masses Lifetimes Spin etc… LBNL PDG Bibliographic

Types of Particle Physics Data Particle Properties Masses Lifetimes Spin etc… LBNL PDG Bibliographic Experimental and Theory papers SLAC spires/hep & ar. Xiv archives CDF (Fermilab) jet cross section 10 th May 2007 Reaction (scattering) Cross Sections Polarizations Event Shapes etc…. ab cd… Durham “Reaction Data” Database H 1 (DESY) low-x F 2 measurements SLAC-PPA Summit 3

Present Journals ar. Xiv Experiments Durham Data Compilers Durham HEPDATA BDMS Reaction Database Users

Present Journals ar. Xiv Experiments Durham Data Compilers Durham HEPDATA BDMS Reaction Database Users DESY Keywords PDG UK Mirrors SPIRES HEP Berkeley SLAC BDMS PDG 10 th May 2007 SLAC-PPA Summit SPIRES HEP Reaction Database Mirror 4

Reaction/Scattering Data - The Durham Database Group WHY? Large amounts of money are spent

Reaction/Scattering Data - The Durham Database Group WHY? Large amounts of money are spent on experiments to collect the data therefore efforts should be made to make sure it is not lost and available in the long term. Such a data store is essential if, for example, earlier and maybe lower energy data, as well as current data, are to be used in data/theory comparisons, tuning Monte Carlos and in designing new experiments. To provide an easy, and consistent, way of locating data. Strengths & Weaknesses Strengths: • Long term commitment • Done by physicists • Comprehensive coverage Weaknesses: • Old DBMS • Limited output formats • Complicated search syntax • Lack of modern networking • Lack of personnel (see future …. ) Data from Journals – peer reviewed – (not prelim. or conf. ) – direct from experiments if only in plot form – verified by authors. ~10, 000 records(papers) -1970 s-present data – currently ~150/year 10 th May 2007 SLAC-PPA Summit 5

Data Reviews • Since 1984 the HEPDATA group has produced and published reviews of

Data Reviews • Since 1984 the HEPDATA group has produced and published reviews of “timely and topical” subsets of the data in the HEPDATA database. • Published in Io. P Journal of Physics G and also since ~1995 on-line as web pages. • Enlist the help of experts in the particular subject. • The purpose is to provide a comprehensive “one place” archive of the data. • The on-line version is kept up-todate as new data appear. • The process of producing the review also audits the database ensuring that it contains all the data on a particular topic. 10 th May 2007 SLAC-PPA Summit 6

The Durham-SPIRES connection < 1984 ppfs/ppas received as paper copies + QSPIRES (email) +

The Durham-SPIRES connection < 1984 ppfs/ppas received as paper copies + QSPIRES (email) + STAIRS (at RAL) 1984 Durham HEPDATA group produced a database – using BDMS, - weekly ppfs and merging in the ppas. Accessible by logging into remote machines (with guest account). ~1993 Moved to web based front end. Updating weekly, then eventually nightly – but just the ppf/ppa subset of the data. Added conference, hepnames, … 1999 Full mirror service developed – the cut-down version was not enough. 2006 Full rsync of all spires databases nightly. Uses ‘IRN’ as the link 10 th May 2007 SLAC-PPA Summit 7

Future Journals ar. Xiv Experiments ascii root aida xml Durham Data Compilers BDMS Users

Future Journals ar. Xiv Experiments ascii root aida xml Durham Data Compilers BDMS Users Old Reaction Database PDG 10 th May 2007 Java coded data model SPIRES HEP SLAC-PPA Summit My. SQL New Reaction Database UK Mirrors CEDAR MC validation Monte Carlos generate observed distributions Jet. Web Users BDMS Reaction Database 8

CEDAR HEPDATA - Durham JETWEB - UCL + = Combined E-science DAta Resource for

CEDAR HEPDATA - Durham JETWEB - UCL + = Combined E-science DAta Resource for HEPDATA – archive of HEP data. JETWEB – a “tool” developed to facilitate the comparison and tuning of Monte Carlo programs (eg) PYTHIA, HERWIG etc. . with real data £ 350 K over 3 years from the PPARC E-Science call to update and join these two together to make a powerful data/MC tuning resource for the start of the LHC 10 th May 2007 SLAC-PPA Summit 9

CEDAR workplan MC programs Experiments Inputting data directly publications HEPDATA Design new (relational) DB

CEDAR workplan MC programs Experiments Inputting data directly publications HEPDATA Design new (relational) DB schema. CEDAR network/grid JETWEB USERS Migrate data to the new DBMS – My. SQL JETWEB uses Grid technology to run fitting jobs remotely Modify JETWEB to take data directly from the new HEPDATA DB Network new DB to JETWEB (or to any user’s programme!) Re-write HZTOOL in C++ to handle new MCs for LHC Develop direct entry and maintenance of data by the experiments. Develop the GRID accessiblility 10 th May 2007 SLAC-PPA Summit 10

The Durham HEP Databse Group - Summary • Since 1975 – Data Compilation of

The Durham HEP Databse Group - Summary • Since 1975 – Data Compilation of all types of HEP scattering data • Products: Reaction Database Data Reviews • Maintain the UK Mirror sites of SLAC/SPIRES LBNL PDG web pages • Future: 1. 2. 3. 4. New Reaction Database – My. SQL with Java based model. Expand output types : graphics, ascii, root, aida, xml, etc… Improve input methods (eg direct maintainence by expts(? )… Involvement with CEDAR (MC validation) project. 10 th May 2007 SLAC-PPA Summit 11