Giving researchers what they want SPIRES Highenergy physics

  • Slides: 46
Download presentation
Giving researchers what they want SPIRES, High-energy physics and subject repositories Travis Brooks SLAC

Giving researchers what they want SPIRES, High-energy physics and subject repositories Travis Brooks SLAC National Accelerator Laboratory INSPIRE Collaboration OAI 6 June 18 2009 T. Brooks OAI 6 18/6/09

Overview a. History of Subject Repositories in High Energy Physics a. User driven b.

Overview a. History of Subject Repositories in High Energy Physics a. User driven b. Current status and observations a. User driven c. Future Plans a. User driven T. Brooks OAI 6 18/6/09

Infrastructure a. The basic facilities, services and installations needed for the functioning of a

Infrastructure a. The basic facilities, services and installations needed for the functioning of a community or society wiktionary. org T. Brooks OAI 6 18/6/09

Community: HEP a. Questions like: a. b. What is the universe made of How

Community: HEP a. Questions like: a. b. What is the universe made of How does that stuff (us) get along with everything else b. HEP Researchers a. b. About 20 -30, 000 worldwide Distinction between Theory and Experiment T. Brooks OAI 6 18/6/09

Users a. Theory a. b. c. d. 50% of the people 80% of the

Users a. Theory a. b. c. d. 50% of the people 80% of the papers Small, global collaborations (<10 authors) Self-Contained papers b. Experiment 50% of the people 20% of the papers Large, global collaborations a. >2000 authors on CERN LHC papers d. Big centers of research a. SLAC, Fermilab, CERN, DESY, KEK a. b. c. T. Brooks OAI 6 18/6/09

Community: HEP a. Connections a. b. c. Labs connected to experiments People connected in

Community: HEP a. Connections a. b. c. Labs connected to experiments People connected in collaborations Institutes connected to their papers b. Information Needs a. b. c. d. Results as fast as possible New ideas shared rapidly Conversational Simplicity of discovery T. Brooks OAI 6 18/6/09

Where do users look? T. Brooks OAI 6 18/6/09

Where do users look? T. Brooks OAI 6 18/6/09

Read Journals? a. Several places to look b. Too Slow – Researchers read (and

Read Journals? a. Several places to look b. Too Slow – Researchers read (and cite) preprints in the first few months T. Brooks OAI 6 18/6/09

Preprint Culture a. Connections + desire for speed -> Preprint culture a. driven at

Preprint Culture a. Connections + desire for speed -> Preprint culture a. driven at the researcher level b. Rapid Communication c. Self-contained papers d. Self-contained community of experts T. Brooks OAI 6 18/6/09

Search Institutional Repositories? a. Not favored by HEP researchers b. Too many places to

Search Institutional Repositories? a. Not favored by HEP researchers b. Too many places to look a. Search is complex c. Many papers not in any IR a. Leaks, Institutions without IR, older papers, etc. T. Brooks OAI 6 18/6/09

Where do users look? T. Brooks OAI 6 18/6/09

Where do users look? T. Brooks OAI 6 18/6/09

SPIRES T. Brooks OAI 6 18/6/09

SPIRES T. Brooks OAI 6 18/6/09

SPIRES’ History a. First HEP Institutional Repositories store papers b. Distributed via postal mail

SPIRES’ History a. First HEP Institutional Repositories store papers b. Distributed via postal mail to major centers c. SPIRES catalogs (and distributes) preprints received at SLAC d. Centralized, community-driven model a. Major lab libraries. . . essentially the world HEP preprint catalog. e. Preprint list SPIRES distributes preprint list "what's new" on weekly basis (much faster than publication) b. Published papers get put on “anti-preprint” list (preprints that became published) c. Really Simple Syndication! a. T. Brooks OAI 6 18/6/09

SPIRES’ History a. Collaboration of DESY, Fermilab and SLAC b. Community driven and defined

SPIRES’ History a. Collaboration of DESY, Fermilab and SLAC b. Community driven and defined c. Currently 1 -1. 5 Million queries/month d. Index to HEP literature for 35 years a. b. c. Via terminal login Via email Via web (1 st U. S. Website/1 st web database) T. Brooks OAI 6 18/6/09

ar. Xiv. org a. Since 1991 - “Extension” of SPIRES to Fulltext b. Electronic

ar. Xiv. org a. Since 1991 - “Extension” of SPIRES to Fulltext b. Electronic Preprint dissemination T. Brooks OAI 6 18/6/09

User Satisfaction a. No mandate, no debate, no advocacy: a. 100% Author driven b.

User Satisfaction a. No mandate, no debate, no advocacy: a. 100% Author driven b. Author-formatted peer-reviewed revisions uploaded c. (Almost) all publishers allow self-archiving. Fraction of articles posted to ar. Xiv T. Brooks OAI 6 18/6/09

Where Do Physicists Search? From 2007 survey of 2, 000 physicists by CERN, DESY,

Where Do Physicists Search? From 2007 survey of 2, 000 physicists by CERN, DESY, Fermilab and SLAC. Gentil-Beccot et al, Information Resources in High-Energy Physics: Surveying the Present Landscape and Charting the Future Course. J. Am. Soc. Inf. Sci. 60: 150 -160, 2009 ar. Xiv: 0804. 2701 T. Brooks OAI 6 18/6/09

Benefits to Researchers a. ar. Xiv+SPIRES Centralized discipline-based repository with curated metadata/search a. Discovery

Benefits to Researchers a. ar. Xiv+SPIRES Centralized discipline-based repository with curated metadata/search a. Discovery is easy ( 1 -stop ) b. Includes Peer reviewed literature a. matching/joining if preprinted c. Access is easy a. dois, urls, ar. Xiv b. Links to every known copy d. Speed is instant for preprints, peer review follows after the necessary delay b. The best features of Journals and Repositories, combined a. T. Brooks OAI 6 18/6/09

Researchers like speed 1. Articles as a mode of discussion 2. Rapidly advancing field

Researchers like speed 1. Articles as a mode of discussion 2. Rapidly advancing field T. Brooks OAI 6 18/6/09

Benefits to Repositories a. SPIRES + ar. Xiv Authors motivated to submit. . .

Benefits to Repositories a. SPIRES + ar. Xiv Authors motivated to submit. . . since they search there SPIRES/ar. Xiv is where the HEP conversation takes place a. If you don't submit, you don't get read c. Affiliation search a. IR can fill themselves from affiliation searches a. b. T. Brooks OAI 6 18/6/09

Benefits to Publishers a. Can reach all of HEP in one place a. b.

Benefits to Publishers a. Can reach all of HEP in one place a. b. SPIRES/ar. Xiv directs eyeballs to the published versions Integrated services a. Cross-linking b. Submit papers from ar. Xiv to journal c. Metadata feeds. . in both directions T. Brooks OAI 6 18/6/09

Why SPIRES + ar. Xiv? a. Grew from a community a. b. c. Global

Why SPIRES + ar. Xiv? a. Grew from a community a. b. c. Global collaborations Connections with large research centers Researchers, Repositories, Publishers all involved b. Evolved from user needs: a. b. c. Simplicity of discovery Speed of communication Published literature T. Brooks OAI 6 18/6/09

Future of HEP Information a. Continue to evolve b. Conversations on ar. Xiv a.

Future of HEP Information a. Continue to evolve b. Conversations on ar. Xiv a. Noting, but not waiting for peer review. c. blog/wiki - like Most of the everyday information research tasks in HEP are carried out on one of two sites b. Freely accessible content c. Community driven a. d. Use technology to tighten this relationship further…with an existing community T. Brooks OAI 6 18/6/09

Future of HEP Information a. HEP becoming more interdisciplinary a. Particle astrophysics b. Literature

Future of HEP Information a. HEP becoming more interdisciplinary a. Particle astrophysics b. Literature growing more complex a. b. Computer code Objects that aren’t papers, but are “information” a. “Datasets”, figures, tables c. Advances in information systems a. b. c. Modern coding and design Mashups Web 2. 0 T. Brooks OAI 6 18/6/09

Hidden 20 FTE – Can be utilized via interactive techniques Hidden 20 FTE From

Hidden 20 FTE – Can be utilized via interactive techniques Hidden 20 FTE From 2007 survey of 2, 000 physicists by CERN, DESY, Fermilab and SLAC Gentil-Beccot et al, Information Resources in High-Energy Physics: Surveying the Present Landscape and Charting the Future Course. J. Am. Soc. Inf. Sci. 60: 150 -160, 2009 ar. Xiv: 0804. 2701 T. Brooks OAI 6 18/6/09

SPIRES’ Future? a. SPIRES should grow with the field and with technology b. SPIRES’

SPIRES’ Future? a. SPIRES should grow with the field and with technology b. SPIRES’ 35 year old infrastructure cannot take advantage of new tools a. b. Needs a solid foundation on which to build 3 -4 Years ago SPIRES began looking for migration possibilities T. Brooks OAI 6 18/6/09

INSPIRE a. Joint Project of CERN, DESY, Fermilab and SLAC b. Migrate SPIRES to

INSPIRE a. Joint Project of CERN, DESY, Fermilab and SLAC b. Migrate SPIRES to CERN’s Invenio platform c. Rollout: End 2009 d. SPIRES Community Organization transitions to INSPIRE a. b. Bring down rigidly defined walls Move to 21 st century T. Brooks OAI 6 18/6/09

Invenio: Modern System… a. Stable, modern, extensible software stack (LAMP) b. Fast, even with

Invenio: Modern System… a. Stable, modern, extensible software stack (LAMP) b. Fast, even with large (discipline) repository c. Focused on search d. Open Source (GPL) community a. b. Substantial HEP use (CERN, ILC, …) Over 20 production instances worldwide e. Modular architecture f. Based on open standards a. MARCXML, OAI-PMH, etc g. Flexible in every layer T. Brooks OAI 6 18/6/09

Complementing SPIRES’ Strengths a. Decades of trusted, curated content b. Experience managing a discipline

Complementing SPIRES’ Strengths a. Decades of trusted, curated content b. Experience managing a discipline wide information resource c. Close relationship with worldwide user community d. Operational resources at major labs a. Will move forward to INSPIRE T. Brooks OAI 6 18/6/09

Opportunities a. Understanding Authors a. b. c. d. Claim your papers Which J. Ellis?

Opportunities a. Understanding Authors a. b. c. d. Claim your papers Which J. Ellis? (Already have affiliation data) Assist in referee selection Standardizing formats for author list b. Data Objects Index locations of large data stores a. Connect them to papers b. Hosting figures, tables, plots and other smaller data objects a. T. Brooks OAI 6 18/6/09

Opportunities a. Keywording/Tagging a. b. Automated extraction using taxonomy User tagging a. You tell

Opportunities a. Keywording/Tagging a. b. Automated extraction using taxonomy User tagging a. You tell your group b. You tell PDG b. Closer work with other fields c. Improved Jobs system for HEP T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

INSPIRE and Repositories a. Define a consistent API a. b. c. Federating searches generating

INSPIRE and Repositories a. Define a consistent API a. b. c. Federating searches generating bibliometrics (on the grid, even!) metrics for organizations b. Will use open standards for metadata exchange a. b. c. d. SWORD populating other repositories OAI-PMH for harvesting and exposing OAI-ORE for Tags/Comment, Data and other objects Start on preprints. . continue through journal T. Brooks OAI 6 18/6/09

INSPIREing Future a. INSPIRE continues the tradition of discipline repositories in HEP b. HEP

INSPIREing Future a. INSPIRE continues the tradition of discipline repositories in HEP b. HEP discipline repositories are not addons or afterthoughts, but a part of the Infrastructure a. b. c. With users as active partners With user needs forefront in the design and operation Built by a community, for a community T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

T. Brooks OAI 6 18/6/09

Infrastructure a. The basic facilities, services and installations needed for the functioning of a

Infrastructure a. The basic facilities, services and installations needed for the functioning of a community or society wiktionary. org T. Brooks OAI 6 18/6/09

Questions? a. For more information on INSPIRE see http: //www. projecthepinspire. net T. Brooks

Questions? a. For more information on INSPIRE see http: //www. projecthepinspire. net T. Brooks OAI 6 18/6/09