Solr Wayback Demo of the Solr Wayback search
Solr. Wayback Demo of the Solr. Wayback search interface and playback engine for WARCs Anders Klindt Myrvoll Programme Manager – the Danish web archive NAS Workshop 22. February 2019 Madrid
12 -06 -2021
12 -06 -2021
INDEX SEARCH/ FRONT END INTERFACE British Library Webarchivediscovery/ Warc-indexer framework HARVEST WWW PLAYBACK ENGINE Build in socks proxy to prevent leaking TOOLS PWID WEB ARCHIVE WITH ARC/WARC FILES Out of the box, open source web-application for researchers to explore Arc/Warc files.
PWID/XML web archive content coverage Best poster i. PRES 2018 pwid: mia. oszk. hu: 2018 -04 -24 T 09: 06: 21: page: https: //mnm. hu/en/museum Eld Zierau time of archiving archived URL PWID IIPC, Wellington, 2018
Visualization of crawltimes
Domain development over time
Installing Solr. Wayback • Easy to install and use on Mac, Linux and Windows. Contains Webserver, Solr and warcindexing tool. Just drop Arc/Warcs into a folder and start exploring the corpus. • Github-link https: //github. com/netarchivesuite/solrwayback
More info The National Széchényi Library - Hungary http: //193. 6. 201. 202/solrwayback/ Gabor Vitez re-wrote the geo search form google maps to open streetmaps. Athens University of Economics and Business (older version of Solr. Wayback) http: //archive. aueb. gr/ • Toke Eskildsen has helped with sparring / performance tuning and a warc export. • Niels Gamborg has made 75% of the front end search interface and tools. Retired now. Abstract IIPC, Wellington, 2018
Contact Solr. Wayback - Thomas Egense teg@kb. dk @Thomas. Egense PWID - Eld Zierau elzi@kb. dk @Eld. Zierau General inquires – Anders Klindt Myrvoll ankm@kb. dk @Anders. Klindt IIPC, Wellington, 2018
Questions and discussion IIPC, Wellington, 2018
Search example showing hits. Images are shown in search-result.
Google like image search in the web-archive. 12 -06 -2021
SOLRWayback showing an archived webpage with an overlay statistics and further navigation options.
Page previews for different harvest times of a given url. Images are generated real-time and uses the build in socks proxy to prevent leaking to the live web.
Interactive domain link graph
Visualization of crawltimes
XML/PWID
Search by gps location for images having exif location information
Domain development over time
- Slides: 21