Quick Links
Research Publications
Publications 2024
Janek Bevendorff¹,∗ , Matti Wiegmann²,∗ , Martin Potthast¹³ , and Benno Stein²
Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹Leipzig University, ² Bauhaus-Universität Weimar, ³ ScaDS.AI, *equal contributation
Noor Afshan Fathima¹, Andreas Wagner¹, Martin Golasowski², Jon Truckenbrodt³, Katja Mankinen⁴, Stephan Hachinger5, Michael Granitzer6*
Launch of the Pilot Infrastructure
The document provides a detailed overview of the first deliverable of Work Package 5 (WP5) entitled “D5.1 Launch of the Pilot Infrastructure”, which is part of the OpenWebSearch.eu project. March 2024. Zenodo.
Link to publication
¹European Organization for Nuclear Research, ² Technical University of Ostrava, ³ German Aeropsace Center, ⁴ CSC – IT-Center for Sciene,Finland, 5 Leibniz Supercomputing Centre, 6 University of Passau, *Project leader
Sebastian Schmidt¹, Ines Zelch¹, Janek Bevendorff¹, Maik Fröbe¹, Martin Pottharst¹, Michael Granitzer²
The OpenWebSearch WARC parsing & content analysis library Version 1.0
This publication reviews the status of Deliverable D2.1 “The OpenWebSearch WARC parsing and content analysis library” of the OpenWebSearch.eu project. March 2024. Zenodo.
Link to publication
¹Leipzig University, ² University of Passau
Publications 2023
Michael Granitzer et al.¹
Impact and Development of an Open Web Index for Open Web Search
In: JASIST, Willey, August 2023
Link to publication
¹University of Passau
Maik Fröbe¹, Jan Heinrich Reimer¹, Sean MacAvaney², Niklas Deckers³, Simon Reich³, Janek Bevendorff⁴, Benno Stein⁴, Matthias Hagen¹, Martin Potthast³
The Information Retrieval Experiment Platform
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Friedrich-Schiller-University Jena, ² University of Glasgow, ³ Leipzig University, ⁴ Bauhaus-University Weimar
Janek Bevendorff¹, Sanket Gupta¹, Johannes Kiesel¹, Benno Stein¹
An Empirical Comparison of Web Content Extraction Algorithms
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Bauhaus-University Weimar
Harrisen Scells¹ and Martin Potthast¹
Pybool_ir: A Toolkit for Domain-Specific Search Experiments
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Leipzig University
Harrisen Scells¹, Ferdinand Schlatt², and Martin Potthast¹
Smooth Operators for Effective Systematic Review Queries
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Leipzig University, ² Friedrich-Schiller-University Jena
Jan Heinrich Reimer¹, Sebastian Schmidt², Maik Fröbe¹, Lukas Gienapp², Harrisen Scells², Benno Stein³, Matthias Hagen¹, Martin Potthast²
The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Friedrich-Schiller-University Jena, ² Leipzig University, ³ Bauhaus-University Weimar
Negin Ghasemi¹, Mohammad Aliannejadi², Hamed Bonab³, Evangelos Kanoulas², Arjen P. de Vries¹, James Allan⁴, and Djoerd Hiemstra¹
Cross-Market Product-Related Question Answering
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Radboud University, ² University of Amsterdam, ³ Amazon Inc., ⁴ University of Massachusetts Amherst
Chris Kamphuis¹, Aileen Lin², Siwen Yang², Jimmy Lin², Arjen P. de Vries¹, and Faegheh Hasibi¹
MMEAD: MS MARCO Entity Annotations and Disambiguations
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Radboud University, ² University of Waterloo
Miriam Louise Carnot¹, Lorenz Heinemann¹, Jan Braker¹, Tobias Schreieder¹, Johannes Kiesel², Maik Fröbe³, Martin Potthast¹, and Benno Stein²
On Stance Detection in Image Retrieval for Argumentation
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Leipzig University, ² Bauhaus-University Weimar, ³ Martin-Luther-Universität Halle Wittenberg
Tim De Jonge¹, Djoerd Hiemstra¹
UNFair: Search Engine Manipulation, Undetectable by Amortized Inequity
In: Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT ’23), ACM, July 2023
Link to publication
¹ Radboud University
Sheikh Mastura Farzana¹, Tobias Hecking¹
Geoparsing at Web-scale – Challenges and Opportunities
In: GeoExT 2023: First International Workshop on Geographic Information Extraction from Texts at ECIR 2023, CEUR Workshop proceedings, April 2023
Link to publication
¹ German Aerospace Center (DLR)
Talks + Lectures
2023
Michael Granitzer¹, Christine Plote²
“NGIForum23”
Participation in Panel Discussion on “Open Web Search, Large Language Models and Beyond” at the NGI Forum, 15-16 November 2023
Link to recording
¹ University of Passau, ² Open Search Foundation
Christine Plote¹
“Der Traum vom freien Raum – Das Internet und seine Utopien”
Presentation & Talk at Akademie Tutzing, Tutzing, Germany, October 28, 2023
Link
¹ Open Search Foundation
Stefan Voigt¹
“5-year into the Open Search Initiative: state of play and future perspective”
Presentation & Talk at EnviroInfo 2023, Garching near Munich, Germany, October 11-13, 2023
Link
¹ Open Search Foundation
Dieter Kranzlmüller¹, Stephan Hachinger¹
“Open Search – new approaches towards a sustainable search infrastructure”
Presentation & Talk at EnviroInfo 2023, Garching near Munich, Germany, October 11-13, 2023
Link
Leibniz Supercomputing Centre of the BAdW (LRZ)
Tobias Hecking¹
“OpenSearch@DLR project – new tools for scientific search and access to environmental data”
Presentation & Talk at EnviroInfo 2023, Garching near Munich, Germany, October 11-13, 2023
Link
¹ German Aerospace Center (DLR)
Michael Granitzer¹
“OpenWebSearch.eu – Building an Open Web Index for an open web search ecosystem”
Keynote at LWDA 2023 – “Learning, Knowledge, Data, Analysis”, Marburg, Germany, October 9-11, 2023
Link
¹ University of Passau
Janek Bevendorff¹, Matti Wiegmann¹, Martin Potthast², Benno Stein¹
“Product Spam on YouTube: A Case Study?”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Bauhaus University Weimar, ² WEBIS
Djoerd Hiemstra¹, Gijs Hendriksen¹, Chris Kamphuis¹, Arjen P. de Vries¹
“Challenges of index exchange for search engine interoperability?”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Radboud University
Maik Fröbe¹, Theresa Elstner², Harrisen Scells³, Benno Stein⁴, Martin Potthast³
“Prototyping Open Web Search Applications with TIRA: A Case Study in Research-oriented Teaching”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Friedrich Schiller Universität Jena, ² University of Leipzig, ³ WEBIS, ⁴ Bauhaus University Weimar
Michael Granitzer¹
“openwebsearch.eu – Where do we stand?”
Invited Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ University of Passau
Michael Dinzinger¹, Saber Zerhoudi¹, Michael Granitzer¹, Mohammed Al-Maamari¹, Mahmoud Istaiti¹, Jelena Mitrovic¹
“OWler: Preliminary results for building a Collaborative Open Web Crawler”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ University of Passau
Ines Zelch Leipzig¹, Matthias Hagen¹, Martin Potthas²
“Commercialized Generative AI: A Critical Study of the Feasibility and Ethics of Generating Native Advertising Using Large Language Models in Conversational Web Search”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Friedrich-Schiller-University Jena, ² WEBIS
Mohammed Al-Maamari¹, Mahmoud Istaiti¹, Michael Granitzer¹, Michael Dinzinger¹, Saber Zerhoudi¹, Jelena Mitrovic¹
“A Comprehensive Dataset for Webpage Classification”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ University of Passau
Jasmin Tietgen¹, Maari Alanko²
“Governance Towards an Open Web Index”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Open Search Foundation, ² CSC
Alexander Nussbaumer¹, Sebastian Gürtl¹, Christian Gütl¹, Rohit Kaushik¹, G. Hendriksen²
“Conceptual Design and Implementation of a Prototype Search Application using the Open Web Search Index”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Graz University of Technology , ² Radboud University
Alexander Bondarenko¹, Maik Fröbe¹, Johannes Kiesel², Ferdinand Schlatt³, Valentin Barriere⁴, Brian Ravenet⁵, Léo Hemamou⁶, Simon Luck⁷, Jan Heinrich Reimer³, Benno Stein², Martin Potthast⁸, Matthias Hagen³
“Overview of Touché 2023: Argument and Causal Retrieval”
Presentation & Talk at CLEF 2023, Thessaloniki, Greece, September 18-21, 2023
¹ WEBIS, ² Bauhaus University Weimar, ³ Friedrich-Schiller-University Jena, ⁴ Centro Nacional de Inteligencia Artificial (CENIA), ⁵ University Paris-Saclay, ⁶ Sanofi R&D, ⁷ University of Bologna, ⁸ University of Leipzig
Janek Bevendorff¹, Ian Borrego-Obrador², Mara Chinea-Ríos², Marc Franco-Salvador², Maik Fröbe³, Annina Heini⁴, Krzysztof Kredens⁴, Maximilian Mayerl⁵, Piotr Pęzik⁴, Martin Potthast⁶, Francisco Rangel², Paolo Rosso⁴, Efstathios Stamatatos⁷, Benno Stein¹, Matti Wiegmann¹, Magdalena Wolska¹, Eva Zangerle⁵
“Overview of PAN 2023: Authorship Verification, Multi-Author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection Condensed Lab Overview”
Presentation & Talk at CLEF 2023, Thessaloniki, Greece, September 18-21, 2023
¹ Bauhaus University Weimar, ² Symanto Research, ³ WEBIS, ⁴ Aston University, ⁵ University of Innsbruck, ⁶ University of Leipzig, ⁷ University of the Aegean,
Michael Granitzer¹
“OpenWebSearch.eu – Building an Open Web Index for an open web search ecosystem”
Keynote at Flexible Query and Answering Conference 2023, Palma de Mallorca, September 5-7, 2023
Link
¹ University of Passau
Maik Fröbe¹, Tim Gollub², Benno Stein², Matthias Hagen³, Martin Potthast⁴
“Clickbait Spoiling”
Presentation & Talk at SemEval2023@ACL23 in Toronto, Canada, July 9-14, 2023
¹ WEBIS, ² Bauhaus University Weimar, ³ Martin-Luther-University Halle-Wittenberg, ⁴ University of Leipzig
Michael Granitzer¹
“Foundation AI, LLMs and Generative AI”
Talk, BDVA Activity Group (AG54), online, 29 June 2023
¹ University of Passau
Michael Granitzer¹
“European Web Crawls and LLMs: the OpenWebSearch.eu Project. ”
Invited Talk at Meta-Forum 2023, Brussles, Belgium, 27 June 2023
Link
¹ University of Passau
Christine Plote, Alexander Decker¹
“Digital Native oder Digital Naive? Was jeder über Google & Co wissen sollte …”
Invited Talk at HAW (Hamburg University of Applied Sciences), Hamburg, Germany, 7 June 2023
¹ Open Search Foundation
Michael Dinzinger¹
“Distributed and legally compliant crawling in the OpenWebSearch.EU project”
Presentation at IRIXYS workshop, INSA Lyon, France, May 10 2023
¹ University of Passau
Maik Fröbe¹, Lukas Gienapp², Martin Potthast², Matthias Hagen³
“Bootstrapped nDCG Estimation”
Presentation & Talk at ECIR 2023 in Dublin, Ireland, April 2-6, 2023
¹ WEBIS, ² University of Leipzig, ³ Martin-Luther-University Halle-Wittenberg
Maik Fröbe¹, Matti Wiegmann², Nikolay Kolyada², Bastian Grahm³, Theresa Elstner³, Frank Loebe³, Matthias Hagen⁴, Benno Stein², Martin Potthast³
“Continuous Integration for Reproducible Shared Tasks with TIRA.io”
Presentation & Talk at ECIR 2023 in Dublin, Ireland, April 2-6, 2023
¹ WEBIS, ² Bauhaus University Weimar, ³ University of Leipzig, ⁴ Friedrich-Schiller-University Jena
Stefan Voigt¹
“Auf dem Weg zu einer offenen Europäischen Internetsuche – Wie ein offener Webindex Europas digitale Souveränität fördert.”
Webinar at Gesellschaft für Informatik, online, 23 March 2023
¹ Open Search Foundation
Michael Dinzinger¹ , Aurora Gonzalez Vidal²
“Two sister initiatives for a paradigm change in open search and discovery on the internet”
Contributed Lightning Talk at FOSDEM 2023, Brussles, Belgium, February 5, 2023
Link
¹ University of Passau, ² NGI Search
Stefan Voigt¹, Michael Granitzer², Isabell Claus³, Christine Plote¹
“Towards an Open European Internet Search”
Online lecture at DG CONNECT University Online Session, January 19, 2023
Link
¹ Open Search Foundation, ² University of Passau, ³ thinkers.ai
Christine Plote¹, Stefan Voigt¹
“Die Internetsuche auf dem Prüfstand – Warum wir Alternativen zu Google & Co brauchen,
und was jeder von uns tun kann”
Webinar at BayernLab, online, January 11, 2023
Link
¹ Open Search Foundation
2022
Michael Granitzer¹, Per Öster², Lukas Vojacek³
“Towards an Open Web Search Infrastructure”
Contributed talk at European Open Science Cloud Symposium, Prague, Czech Republic, November 11 2022
Link
¹ University Passau, ² CSC, ³ IT4I
Stefan Voigt¹
Invited Talk
EC High Level Meeting for the future of the internet in Prague, Czech Republic, November 2, 2022
¹ Open Search Foundation
Michael Granitzer¹
“Piloting Open Websearch: OpenWebSearch.eu”
Invited Talk at 4th Open Search Symposium #ossym22, Geneva, Switzerland, October 10 2022
Link
¹ University Passau
Tim Smith¹
“Search, Open Search and Science online”
TED Talk at TEDx Verbier, September 3 2022
Link to Stream
¹ European Organization for Nuclear Research (CERN)
Other Presentations
2024
Christine Plote¹
Gut zu finden? Wie Suchmaschinen unseren Blick auf die Welt prägen.
Talk at ‘Orientierung in der digitalen Welt. Wohin führen uns Google, TikTok & Co?‘ at ‘Evangelische Akademie Tutzing‘, Tutzing, Germany, 2 March 2024
¹Open Search Foundation
Christine Plote¹
Google kennt Dich: Über Suchmaschinen, Privatsphäre und wie man es besser machen kann
Talk at ‘Das Otto‘, Neuburg, Germany, 15 January 2024
¹Open Search Foundation
2023
Christine Plote et al¹
Maustag 2023
Booth with interactive program at the Open Door Event at LRZ, Garching, Germany, 3 October 2023
¹Open Search Foundation
Christine Plote¹
Lecture series „Herausforderungen und Kompetenzen in der der Digitalisierung“
Invited talk at HAW Hamburg University of Applied Sciences, Hamburg, Germany, 7 June 2023
¹Open Search Foundation
Christine Plote¹
re:publica23
Panel participation at re:publica 2023 conference, Berlin, Germany, 6 June 2023
¹Open Search Foundation
Christine Plote¹ , Stefan Voigt¹
Safer Internet Day 2023
Booth at Safer Internet Day Conference 2023, Berlin, Germany, 14 February, 2023
¹Open Search Foundation
2022
Katerina Slaninova¹, Jan Martinovic¹, Vit Vondrak¹
SC 2022
Booth Presentation at SC 2022, Dallas, USA, November 13-18, 2022
¹IT4I