Quick Links
Research Publications
Publications 2025
Emma J. Gerritse¹, Faegheh Hasibi¹, Arjen P. de Vries¹
Embeddings to Empower Entity Retrieval
In: Information Retrieval Research, June 2025
Link to publication
¹Radboud University Nijmegen
Proceedings
Sebastian Gürtl1, Alexander Nussbaumer1, Christian Gütl1
Supporting Vertical Web Search and Customized Search Applications with the Modular and Open Framework MOSAIC
In: Proceedings of the 2nd International Workshop on Open Web Search Co-located With the 47th European Conference on Information Retrieval (ECIR 2025), December 2025
Link to publication
1Graz University of Technology
Zerhoudi Saber1, Granitzer Michael1
UXSim: Towards a Hybrid User Search Simulation
In: Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM ’25), November 2025
Link to publication
1University of Passau
Laura Caspari1, Michael Dinzinger1, Michael Granitzer1, Jelena Mitrovic1
Extracting and Utilizing Structured Data from the Open Web Index
In: 7th International Open Search Symposium (OSSYM2025), October 2026
Link to publication
1University of Passau
Noor A. Fathima1, Michael Dinzinger 2, Michael Granitzer 2, Andreas Wagner1
Architecting the Datastore for the URL Frontier of OpenWebSearch.eu
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1CERN Geneva, 2University of Passau
Jason Theodoropoulos1, & Joonas Kesäniemi1
Using The Open Web Index To Create New Search Applications For Research.fi
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1CSC – IT Center for Science
Elias Sandner1, 2, Daniel Scharf2, Thomas Wautischar2, Igor Jakovljevic1, Alice Simniceanu3, Luca Fontana3, Andre Henriques1 , Andreas Wagner1 , Christian Gütl2
Assessing the Reliability of Human and LLM-Based Screening in Systematic Reviews: A Study on First-Time Reviewers
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1 CERN Geneva, 2Graz University of Technology, 3WHO Geneva
Dmytro Zhuk1, Elias Sandner2,3, Igor Jakovljevic2, Alice Simniceanu3, Luca Fontana3, Andre Henriques2, Andreas Wagner2, Christian Gütl4
Automating License-Aware Full-Text Retrieval for Systematic Reviews: An End-To-End Scalable System to Reduce Reviewer Workload
In: 7th International Open Search Symposium (OSSYM2025), October 2026
Link to publication
1University of Vienna, 2CERN Geneva, 3WHO Geneva, 4Graz University of Technology
Christine Plote1, Alexander Nussbaumer2
A Charter For Public Interest Internet Search
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1Open Seatch Foundation e.V. Starnberg, 2Technical University Graz
Saber Zerhoudi1, Michael Granitzer1
In-Browser Agentic Web: a Decentralized Approach to Information Access
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1University of Passau
Felix Holz1, Daniel Scharf1, Alexander Nussbaumer1, Sebastian Gürtl1
Adding Retrieval Augmented Generation to the MOSAIC Framework
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1Graz University of Technology
Gijs Hendriksen1, Djoerd Hiemstra1, Arjen P. de Vries1
Efficient Session Search using Topical Index Shards
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1Radbound University Nijmegen
Kateřina Slaninová1, Pavlína Smolková1
Large-Scale Graph Visualisation of Open Web Index and its Evolution in Time
In: 7th International Open Search Symposium (OSSYM2025), October 2025
Link to publication
1IT4Innocationa, VBS – Technical University of Ostrava
Lukas Gienapp1, Christopher Schröder3,4, Stefan Schweter5, Christopher Akiki4,6, Ferdinand Schlatt7, Arden Zimmermann8, Phillipe Genêt9 and Martin Potthast1
The German Commons – 154 Billion Tokens of Openly Licensed Text for German Language Models
In: CoRR, October 2025
Link to publication
1University of Kassel; hessian.AI, ScaDS.AI Kassel, 3InfAI, 4ScaDS.AI Leipzig, 5Independent Researcher Holzkirchen, 6Leipzig University, 7Friedrich-Schiller University Jena, 8German National Library Leipzig, 9German National Library Frankfurt
Daria Alexander11, Maik Fröbe2 , Gijs Hendriksen1, Matthias Hagen2 , Djoerd Hiemstra1, Martin Potthast3, Arjen P. de Vries1
Team OpenWebSearch at LongEval: Using Historical Data for Scientific Search Working notes
In: CLEF 2025, September 2025
Link to publication
1Radbound University Nijmegen, 2 Friedrich-Schiller-University Jena, 3University of Kassel; hessian.AI, ScaDS.AI
Janek Bevendorff1,2, Yuxia Wang3, Jussi Karlgren4, Matti Wiegmann2,5, Maik Fröbe6, Akim Tsivgun7, Jinyan Su8, Zhuohan Xie3, Mervat Abassy9, Jonibek Mansurov3, Rui Xing3,10, Minh Ngoc Ta11, Kareem Ashraf Elozeiri12, Tianle Gu13, Raj Vardhan Tomar14, Jiahui Geng3, Ekaterina Artemova15, Artem Shelmanov3, Nizar Habash16, Efstathios Stamatatos17, Iryna Gurevych3,18, Preslav Nakov3, Martin Potthast5,19,Benno Stein2
Overview of the “Voight-Kampff” Generative AI Authorship Verification Task at PAN and ELOQUENT 2025 Working notes
In: CLEF 2025, September 2025
Link to publication
1Leipzig University, 2Bauhaus-University Weimar, 3Mohamed bin Zayed University of Artificial Intelligence, 4University of Helsinki, 5University of Kassel, 6Friedrich-Schiller-University Jena, 7Nebius AI; KU Leuven, 8Cornell University, 9Alexandria University, 10The University of Melbourne, 11BKAI Research Center – Hanoi University of Science and Technology, 12Zewail Citiy of Science and Technology, 13Tsinghua University, 14Cluster Innovation Center – University of Delhi, 15Toloka AI, 16New York University Abu Dhabi, 17University of the Aegean, 18TU Darmstadt, 19hessian.AI; ScaDS.Ai
Johannes Kiesel1, Çağrı Çöltekin2, Marcel Gohsen3, Sebastian Heineking4, Maximilian Heinrich3, Maik Fröbe5, Tim Hagen6,7, Mohammad Aliannejadi8, Sharat Anand3, Tomaž Erjavec9, Matthias Hagen5, Matyáš Kopp10, Nikola Ljubešić9, Katja Meden9, Nailia Mirzakhmedova3, Vaidas Morkevičius11, Harrisen Scells2, Moritz Wolter4, Ines Zelch4,5, Martin Potthast6,7,12, Benno Stein3
Overview of Touché 2025: Argumentation Systems
In: CLEF 2025, September 2025
Link to publiaction
1GESIS – Leibniz Institute for Social Sciences, 2University of Tübingen, 3Bauhaus-University Weimar, 4Leipzig University, 5Friedrich-Schiler-University Jena, 6University of Kassel, 7hessian.AI, 8University of Amsterdam, 9Jožef Stefan Institute, 10Charles University, 11Kaunas University of Technology, 12ScaDS.AI
Ines Zelch1,2, Matthias Hagen1, Benno Stein3, Johannes Kiesel4
Reproducing the Argument Quality Prediction of Project Debater
In: ArgMining 2025, July 2025
Link to publication
1Friedrich-Schiller-University Jena, 2Leipzig University, 3Bauhaus-University Weimar, 4GESIS – Leibniz Institute for Social Sciences
Ines Zelch1,2, Matthias Hagen1, Benno Stein3, Johannes Kiesel4
Segmentation of Argumentative Texts by Key Statements for Argument Mining from the Web
In: ArgMining 2025, July 2025
Link to publication
1Friedrich-Schiller-Universit Jena, 2Leipzig University, 3Bauhaus-University Weimar, 4GESIS – Leibniz Istitute for Social Sciences
Lukas Gienapp1,2,3, Niklas Deckers1,3, Martin Potthast1,2,3, Harrisen Scells4
Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins
In: 15th International Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR 2025), July 2025
Link to publication
1University of Kassel, 2ScaDS.AI, 3hessian.AI, 4University of Tübingen
Jan Heinrich Merker1, Maik Fröbe1, Benno Stein2, Martin Potthast3, and Matthias Hagen1
Axioms for Retrieval-Augmented Generation
In: 15th International Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR 2025), July 2025
Link to publication
1Friedrich-Schiller-University Jena, 2Bauhaus-University Weimar, 3University of Kassel & hessian.AI & ScaDS.AI
Janek Bevendorff1,2, Matti Wiegmann2,3, Emmelie Richter2, Martin Potthast3,4, Benno Stein2
The Two Paradigms of LLM Detection: Authorship Attribution vs. Authorship Verification
In: ACL Findings 2025, July 2025
Link to publication
1Leipzig University, 2Bauhaus-University Weimar, 3University of Kassel, 4hessian.AI & ScaDS.AI
Lukas Gienapp1,2, Tim Hagen3,4, Maik Fröbe5, Matthias Hagen5, Benno Stein6, Martin Potthast2,3,4,Harrisen Scells3,4
The Viability of Crowdsourcing for RAG Evaluation
In: 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), July 2025
Link to publication
1Leizig University, 2ScaDS.AI, 3University of Kassel, 4hessian.AI, 5Friedrich-Schiller-University Jena, 6Bauhaus-University Weimar
Michael Dinzinger1, Laura Caspari1, Kanishka Ghosh Dastidar1, Jelena Mitrović1, Michael Granitzer1
WebFAQ: A Multilingual Collection of Natural Q&A Datasets for Dense Retrieval
In: SIGIR 2025, July 2025
Link to publication
1University of Passau
Mohanna Hoveyda1, Harrie Oosterhuis1, Arjen P. de Vries1, Maarten de Rijke2, Faegheh Hasibi1
Adaptive Orchestration of Modular Generative Information Access Systems
In: SIGIR 2025, July 2025
Link to publication
1Radboud University Nijmegen, 2University of Amsterdam
Djoerd Hiemstra1
Score-Fitted Indexes and Constant Length Indexes for Information Retrieval (Short Papers)
In: SIGIR 2025, July 2025
Link to publication
1Radboud University Nijmegen
Daria Alexander1, Arjen P. de Vries1
In a Few Words: Comparing Weak Supervision and LLMs for Short Query Intent Classification (Short Papers)
In: SIGIR 2025, July 2025
Link to publication
1Radboud University Nijmegen
Janek Bevendorff1, Daryna Dementieva2, Maik Fröbe3, Bela Gipp4, André Greiner-Petter4, Jussi Karlgren5, Maximilian Mayerl6, Preslav Nakov7, Alexander Panchenko8, Martin Potthast9,10,11, Artem Shelmanov7, Efstathios Stamatatos12, Benno Stein13, Yuxia Wang7, Matti Wiegmann13, Eva Zangerle6
Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection: Extended Abstract
In: LNCS, ECIR 2025, April 2025
Link to publication
1Leipzig University, 2Technical University of Munich, 3Friedrich-Schiller-University Jena, 4Georg-August-University Göttingen, 5Silo AI Helsinki, 6University of Innsbruck, 7Mohamed bin Zayed University of Artificial Intelligence Abu Dhabi, 8Skoltech & AIRI Moscow, 9Kassel University, 10hessian.AI, 11ScaDS.AI, 12University of the Aegean, 13Bauhaus-University Weimar
Maik Fröbe1, Andrew Parry2, Harrisen Scells3, Shuai Wang4, Shengyao Zhuang4,5, Guido Zuccon4, Martin Potthast6, Matthias Hagen1
Corpus Subsampling: Estimating the Effectiveness of Neural Retrieval Models on Large Corpora
In: LNCS, ECIR 2025, April 2025
Link to publication
1Friedrich-Schiller-University Jena, 2University of Glasgow, 3University of Kassel, 4University of Queensland, 5CSIRO, 6University of Kassel & hessian.AI & ScaDS.AI
Jan Heinrich Merker1, Janek Bevendorff², Maik Fröbe1, Tim Hagen³, Harrisen Scells³, Matti Wiegmann², Benno Stein², Matthias Hagen1, Martin Potthast³,4
Web-Scale Retrieval Experimentation with chatnoir-pyterrier
In: LNCS, ECIR 2025, April 2025
Link to publication
1Friedrich-Schiller-University Jena, ²Bauhau-University Weimar, ³University of Kassel and hessian.AI, 4ScaDS.AI
Bogdan Ionescu¹, Henning Müller², Dan-Christian Stanciu¹, Ahmad Idrissi-Yaghir³, Ahmedkhan Radzhabov4, Alba García Seco de Herrera5, Alexandra Andrei¹, Andrea Storås6, Asma Ben Abacha7, Benjamin Bracke³, Benjamin Lecouteux8, Benno Stein9, Cécile Macaire8, Christoph M. Friedrich³, Cynthia Sabrina Schmidt10, Diandra Fabre8, Didier Schwab8, Dimitar Dimitrov11, Emmanuelle Esperança-Rodier8, Gabriel Constantin¹, Helmut Becker10, Hendrik Damm³, Henning Schäfer10, Ivan Rodkin12, Ivan Koychev11, Johannes Kiesel9, Johannes Rückert³, Josep Malvehy13, Liviu-Daniel Ștefan1, Louise Bloch³, Martin Potthast14, Maximilian Heinrich9, Michael A. Riegler6, Mihai Dogariu1, Noel Codella7, Pål Halvorsen6, Preslav Nakov12, Raphael Brüngel³, Roberto Adres Novoa15, Rocktim Jyoti Das16, Steven A. Hicks6, Sushant Gautam6, Tabea M.G. Pakull10, Vajira Thambawita6, Vassili Kovalev17,18, Wen-Wai Yim7, Zhuohan Xie12
ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications
In: LNCS, ECIR 2025, April 2025
Link to publication
¹National University of Science and Technology POLITEHNICA Bucharest, ²HES-SO Valais-Wallis, ³University of Applied Science and Arts Dortmund,
4Belarus State University, 5National University of Distance Education Spain, 6SimulaMet Norway, 7Microsoft USA, 8Grenoble Alpes University, 9Bauhaus-University Weimar, 10University Hospital Essen, 11Sofia University, 12Mohamed bin Zayed University of Artificial Intelligence United Arab Emirates, 13Hospital Clinic of Barcelona, 14University of Kassel, hessian.AI, and ScaDS.AI, 15Stanford University, 16Indian Institute of Technology Delhi, 17Belarus National Academy of Sciences, 18Belarus State University, Belarus
Ferdinand Schlatt1, Maik Fröbe1, Harrisen Scells2,6, Shengyao Zhuang3,4, Bevan Koopman3, Guido Zuccon4, Benno Stein5, Martin Potthast2,6,7, Matthias Hagen1
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-ranking
In: LNCS, ECIR 2025, April 2025
Link to publication
1Friedrich-Schiller-University Jena, 2University of Kassel, 3CSIRO,
4University of Queensland, 5Bauhaus-University Weimar, 6hessian.AI, 7ScaDS.AI
Sebastian Heineking1, Jonas Probst1, Daniel Steinbach1, Martin Potthast2,3, Harrisen Scells1,2
Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions
In: LNCS, ECIR 2025, April 2025
Link to publication
1University of Leipzig, 2University of Kassel and hessian.AI, 3ScaDS.AI
Ferdinand Schlatt1, Maik Fröbe1, Harrisen Scells2,6, Shengyao Zhuang3,4, Bevan Koopman3, Guido Zuccon4, Benno Stein5, Martin Potthast2,6,7, Matthias Hagen1
Set-Encoder: Permutation-Invariant Inter-passage Attention for Listwise Passage Re-ranking with Cross-Encoders
In: LNCS, ECIR 2025 submissions (Martin Potthast, Arjen de Vries), April 2025
Link to publication
¹Friedrich-Schiller-University Jena, ²University of Kassel, 3CSIRO, 4The University of Queensland, 5Bauhaus-University Weimar, 6hessian.Ai, 7ScaDS.AI
Michael Granitzer¹, Mohamad Hayek², Sebastian Heineking³, Gijs Hendriksen4, Martin Golasowski5, Michael Dinzinger¹, Saber Zerhoudi¹
OpenWebSearch.eu – Building an Open Web Index on EuroHPC JU Infrastructures
In: Proceedings of the Second EuroHPC user day, Procedia Computer Science, March 2025
Link to publication
¹University of Passau, 2 Leibniz Supercomputing Centre Munich,³University of Leipzig, 4Radbound University Nijmegen, 5IT4I National Supercomputing Center Ostrava
Publications 2024
Sheikh Mastura Farzana¹, Maik Fröbe², Michael Granitzer³, Gijs Hendriksen4, Djoerd Hiemstra4, Martin Potthast5,6,7, Arjen P. de Vries4, Saber Zerhoudi³
Report on the 1st International Workshop on Open Web Search (WOWS 2024) at ECIR 2024
In: SIGIR Forum, June 2024
Link to publication
¹German Aerospace Center, ²Friedrich-Schiller-University Jena, ³University of Passau, 4Radbound University Nijmegen, 5University of Kassen, 6hessian.AI, 7ScaDS.AI
Michael Granitzer¹ et al.
Impact and Development of an Open Web Index for Open Web Search
In: JASIST, May 2024
Link to publication
¹University Passau
Proceedings
Saber Zerhoudi¹, Michael Granitzer¹
Generative Agents Navigating Digital Libraries
In: ICADL 2024, December 2024
Link to publication
¹University of Passau
Mohammed Al-Maamari¹, Istaiti Mahmoud¹, Saber Zerhoudi¹, Michael Dinzinger¹, Michael Granitzer¹, Jelena Mitrović¹
Impact of Tokenization Techniques on URL Classification
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹University of Passau
Sarah Frank1,2, Sebastian Schäffer¹, Andreas Wagner², Alexander Steinmaurer³
Creating Explainable Summaries for Long Scientific Documents using Large Language Models
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹Graz University of Technology, ²CERN Geneva, ³Institute of Digital Sciences Austria
Alexander Nussbaumer¹, Sebastian Gürtl¹, Johannes Honeder¹, Tobias Hecking², Christian Gütl¹
Enriching Science Search with the Open Search Framework MOSAIC
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹Graz University of Technology, ²German Aerospace Center Cologne
Gijs Hendriksen¹, Djoerd Hiemstra¹, Arjen de Vries¹
An Open Source Implementation of Web Clustering Algorithms for Selective Search
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹Radbound University Nijmegen
Noor Afshan Fathima¹, Martin Golasowski², Michael Granitzer³, Andreas Wagner¹, Chris Ariyo4, Gijs Hendriksen5, John Truckenbrodt6, Katja Mankinen4, Michael Dinzinger³, Mikael Karlsson4, Mohamad Hayek7, Stavros Moiras¹, Lukas Vojacek², Stephan Hachinger7, JanMartinovič²
Federated Data Infrastructure for the Open Web Search
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹CERN Geneva, ²IT4I National Supercomputing Center Ostrava, ³University of Passau, 4CSC – IT Center for Science Espoo, 5Radbound University Nijmegen, 6German Aerospace Center Berlin,7Leibniz Supercomputing Centre Munich
Michael Dinzinger¹, Michael Granitzer¹, Jelena Mitrović¹, Saber Zerhoudi¹
OWLer: A Distributed and Collaborative Open Web Crawler
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹University of Passau
Noor Afshan Fathima¹, Michael Granitzer², Michael Dinzinger², Andreas Wagner¹
Architecting the Opensearch Service at CERN for OpenWebSearch.EU
In: 6th International Open Search Symposium (OSSYM2024), October 2024
Link to publication
¹CERN Geneva, ²University of Passau
Johannes Kiesel¹, Marcel Gohsen¹, Nailia Mirzakhmedova¹, Matthias Hagen², Benno Stein¹
Who Will Evaluate the Evaluators? Exploring the Gen-IR User Simulation Space
In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. 15th International Conference of the CLEF Association (CLEF 2024), September 2024
Link to publication
¹Bauhaus-University Weimar, ²Friedrich-Schiller-University Jena
Janek Bevendorff1,2, Matti Wiegmann², Jussi Karlgren³, Luise Dürlich4, Evangelia Gogoulou4, Aarne Talman5, Efstathios Stamatatos6, Martin Potthast7,8,9, Benno Stein²
Overview of the “Voight-Kampff” Generative AI Authorship Verification Task at PAN and ELOQUENT 2024
In: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), September 2024
Link to publication
¹Leipzig University, ²Bauhaus-University Weimar, ³Silo AI Helsinki, 4RISE Research Institutes of Sweden, 5University of Helsinki,6University of the Aegean, 7University of Kassel, 8hessian.AI, 9SCaDS.AI
Niklas Deckers1,2, Julia Peters, Martin Potthast1,2
Manipulating embeddings of stable diffusion prompts
In: IJCAI ’24: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, August 2024
Link to publication
¹Leipzig University, ²SacDS.AI
Maik Fröbe¹, Harrisen Scells², Theresa Elstner², Christopher Akik²i, Lukas Gienapp², Jan Heinrich Reimer³, Sean MacAvaney4, Benno Stein5, Matthias Hagen¹, Martin Potthast6,7,8
Resources for Combining Teaching and Research in Information Retrieval Coursework
In: SIGIR ’24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2024
Link to publication
¹Friedrich-Schiller-University Jena, ²Leipzig University,³Instirute for Computer Science, Friedrich-Schiller-University Jena, 4University of Glasgow, 5Bauhaus-University Weimar, 6University of Kassel, 7hessian.AI, 8SCaDS.AI
Maik Fröbe¹, Joel Mackenzie², Bhaskar Mitra³, Franco Maria Nardini4, Martin Potthast5,6,7
ReNeuIR at SIGIR 2024: The Third Workshop on Reaching Efficiency in Neural Information Retrieval
In: SIGIR ’24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2024
Link to publication
¹Friedrich-Schiller-University Jena, ²The University of Queensland, ³Microsoft Research Montreal,4ISTI-CRN Pisa, 5University of Kassel, 6hessian.AI, 7SCaDS.AI
Nandan Thakur¹, Luiz Bonifacio², Maik Fröbe³, Alexander Bondarenko3,4, Ehsan Kamalloo¹, Martin Potthast5,6,7, Matthias Hagen³, Jimmy Lin¹
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR
In: SIGIR ’24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2024
Link to publication
¹University of Waterloo, ²UNICAMP & University of Waterloo Campinas, ³Friedrich-Schiller-University Jena, 4Leipzig University, 5University of Kassel, 6hessian.AI, 7SCaDS.AI
Lukas Gienapp1,2, Harrisen Scells¹, Niklas Deckers1,2, Janek Bevendorff¹, Shuai Wang³, Johannes Kiesel4, Shahbaz Syed¹, Maik Fröbe5, Guido Zuccon³, Benno Stein4, Matthias Hagen5, and Martin Potthast6,7
Evaluating Generative Ad Hoc Information Retrieval
In: SIGIR ’24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2024
Link to publication
¹Leipzig University, ²SacDS.AI, ³The University of Queensland, 4Bauhaus-University Weimar, 5Friedrich-Schiller-University Jena, 6University of Kassel, 7hessian.AI
Saber Zerhoudi¹, Michael Granitzer¹
PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents
In: IR-RAG workshop @ SIGIR24, July 2024
Link to publication
¹University of Passau
Timon Ziegenbein¹, Shahbaz Syed², Martin Potthast, Henning Wachsmuth¹
Objective Argument Summarization in Search
In: Robust Argumentation Machines. RATIO 2024, July 2024
Link to publication
¹Leibniz University Hannover, ²Leipzig University
Laura Caspari¹, Kanishka Ghosh Dastidar¹, Saber Zerhoudi¹, Jelena Mitrovic¹, Michael Granitzer¹
Beyond Benchmarks: Evaluating Embedding Model Similarity for Retrieval Augmented Generation Systems
In: IR-RAG workshop @ SIGIR24, July 2024
Link to publication
¹University Passau
Nailia Mirzakhmedova¹, Johannes Kiesel¹, Milad Alshomary², Maximilian Heinrich¹, Nicolas Handke³, Xiaoni Cai¹, Valentin Barriere, Doratossadat Dastgheib, Omid Ghahroodi, MohammadAli SadraeiJavaheri, Ehsaneddin Asgari, Lea Kawaletz, Henning Wachsmuth², Benno Stein¹
The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments
In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
Link to publication
¹Bauhaus-University Weimar, ²Leibniz University Hannover, ³Leipzig University
Janek Bevendorff¹, Xavier Bonet Casals², Berta Chulvi³, Daryna Dementieva4, Ashaf Elnagar5, Dayne Freitag6, Maik Fröbe7, Damir Korenčić³, Maximilian Mayerl8, Animesh Mukherjee9, Alexander Panchenko10, Martin Potthast1,11, Francisco Rangel12, Paolo Rosso3,13, Alisa Smirnova14, Efstathios Stamatatos15, Benno Stein16, Mariona Taulé2, Dmitry Ustalov17, Matti Wiegmann16, Eva Zangerle8
Overview of PAN 2024: Multi-Author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification (extended abstract)
In: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, March 2024
Link to publication
¹Leipzig University, ²University of Barcelona, ³Techical University of Valencia, 4Technical University of Munich,5University of Sharjah, 6SRI International, 7Friedrich-Schiller-University Jena,8University of Innsbruck,9Indian Institute of Technology Kharagpur, 10Skolkovo Institute of Science and Technology, 11ScaDS.AI, 12Symanto Research, 13ValgrAI, 14Toloka,15University of the Aegean, 16Bauhaus-Unversity Weimar,17JetBrains
Bogdan Ionescu, Henning Müller, Ana Maria Drăgulinescu, Ahmad Idrissi-Yaghir, Ahmedkhan Radzhabov, Alba Garcia Seco de Herrera, Alexandra Andrei, Alexandru Stan, Andrea M. Storås, Asma Ben Abacha, Benjamin Lecouteux, Benno Stein³, Cécile Macaire, Christoph M. Friedrich, Cynthia Sabrina Schmidt, Didier Schwab, Emmanuelle Esperança-Rodier, George Ioannidis, Griffin Adams, Henning Schäfer, Hugo Manguinhas, Ioan Coman, Johanna Schöler, Johannes Kiesel, Johannes Rückert, Louise Bloch, Martin Potthast, Maximilian Heinrich³, Meliha Yetisgen, Michael A. Riegler, Neal Snider, Pål Halvorsen, Raphael Brüngel, Steven A. Hicks, Vajira Thambawita, Vassili Kovalev, Yuri Prokopchuk & Wen-Wai Yim
Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024 (conference paper)
In: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, March 2024
Link to publication
³Bauhaus-Unversity Weimar
Maik Fröbe¹, Theresa Elstner², Harrisen Scells², Benno Stein³, Martin Potthast2,4
Prototyping Open Web Search Applications with TIRA: A Case Study in Research-oriented Teaching
5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹Friedrich-Schiller-University Jena, ²Leipzig University, ³Bauhaus-University Weimar, 4ScaDS.AI
Simon Hitzginger¹, Alexander Nussbaumer¹, Christian Gütl¹, Chiara Ruß-Baumann¹
Understanding and Mitigating Cognitive Bias during Web Search
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹Graz University of Technology
Alexander Nussbaume¹, Rohit Kaushi1,2, Gijs Hendriksen³, Sebastian Gürtl¹, Christian Gütl¹
Conceptual Design and Implementation of a Prototype Search Application using the Open Web Search Index
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹Graz University of Technology, ²University of Waterloo, ³Radbound University Nijmegen
Ines Zelch¹,², Matthias Hagen¹, Martin Potthast²,³
Commercialized Generative AI: A Critical Study of the Feasibility and Ethics of Generating Native Advertising Using Large Language Models in Conversational Web Search
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹Friedrich-Schiller-University Jena, ²Leipzig University, ³ScaDS.AI
Djoerd Hiemstra¹, Gijs Hendriksen¹, Chris Kamphuis¹, Arjen P. de Vries¹
Challenges of Index Exchange for Search Engine Interoperability
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹Radbound University Nijmegen
Janek Bevendorff¹, Matti Wiegmann¹, Martin Potthast²,³, Benno Stein¹
Product Spam On YouTube: a Case Study
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹Bauhaus-University Weimar, ²Leipzig University, ³ScaDS.AI
Mohammed Al-Maamari¹, Mahmoud Istaiti¹, Saber Zerhoudi¹, Michael Dinzinger¹, Michael Granitzer¹, Jelena Mitrović¹
A Comprehensive Dataset for Webpage Classification
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹University of Passau
Michael Dinzinger¹, Mohammed Al-Maamari¹, Saber Zerhoudi¹, Mahmoud Istaiti¹, Jelena Mitrović¹, Michael Granitzer¹
OWler: Preliminary results for building a Collaborative Open Web Crawler
In: 5th International Open Search Symposium (OSSYM2023), March 2024
Link to publication
¹University of Passau
Andrew Parry¹, Maik Fröbe², Sean MacAvaney¹, Martin Potthast3,4, Matthias Hagen²
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models (full paper)
In: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, in March 2024
Link to publication
¹University of Glasgow, ²Friedrich-Schiller-University Jena, ³Leipzig University, 4ScaDS.AI
Johannes Kiesel¹, Çağrı Çöltekin², Maximilian Heinrich¹, Maik Fröbe³, Milad
Alshomary4, Bertrand De Longueville5, Tomaž Erjavec6, Nicolas Handke7, Matyáš Kopp8, Nikola Ljubešić6, Katja Meden6, Nailia Mirzakhmedova1, Vaidas Morkevičius9, Theresa Reitis-Munstermann5, Mario Scharfbillig5, Nicolas Stefanovitch5, Henning Wachsmuth4, Martin Potthast7,10, Benno Stein¹
Overview of Touché 2024: Argumentation Systems (full paper)
In: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, in March 2024
Link to publication
¹Bauhaus-University Weimar, ²University of Tübingen, ³Friedrich-Schiller-University Jena,4Leibniz University Hannover,5European Commission, Joint Research Centre,6Jožef Stefan Institute, 7Leipzig University, 8Charles University, 9Kaunas University of Technology, 10ScaDS.AI
Janek Bevendorff¹,∗ , Matti Wiegmann²,∗ , Martin Potthast¹³ , and Benno Stein²
Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹Leipzig University, ² Bauhaus-Universität Weimar, ³ ScaDS.AI, *equal contributation
Ferdinand Schlatt¹, Maik Fröbe¹, Matthias Hagen¹,
Investigating the Effects of Sparse Attention on Cross-Encoders
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹Friedrich-Schiller University, Jena
Andrew Parry¹, Maik Fröbe², Sean MacAvaney¹, Martin Potthast³, Matthias Hagen²,
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹University of Glasgow, ²Friedrich-Schiller University, Jena, ³Leipzig University
Gijs Hendriksen¹, Michael Dinzinger², Sheikh Mastura Farzana³, Noor Afshan Fathima⁴, Maik Fröbe5, Sebastian Schmidt6, Saber Zerhoudi², Michael Granitzer², Matthias Hagen5, Djoerd Hiemstra¹, Martin Potthast6 and Benno Stein7
The Open Web Index: Crawling and Indexing the Web for Public Use
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹Radboud University Nijmegen, ²University of Passau, ³German Aerospace Center, ⁴CERN, 5Friedrich-Schiller University, 6Leipzig University, 7Bauhaus University Weimar
Gijs Hendriksen¹, Djoerd Hiemstra¹, Arjen de Vries¹
Weighted AUReC: Handling Skew in Shard Map Quality Estimation for Selective Search
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹Radboud University Nijmegen
Johannes Kiesel¹, Marcel Gohsen¹, Nadia Mirzakhmedova¹, Matthias Hagen², Benno Stein¹
Simulating Follow-up Questions in Conversational Search
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Lecture Notes in Computer Science, March 2024. Springer.
Link to publication
¹Bauhaus University Weimar, ²Friedrich-Schiller University, Jena
Sheikh Farzana¹, Maik Fröbe², Michael Granitzer³, ³Saber Zerhoudi, Gijs Hendriksen⁴, Djoerd Hiemstra⁴, Martin Potthast5,
1st International Workshop on Open Web Search
In: In Advances in Information Retrieval. 46th European Conference on IR Research (ECIR 2024), Conference Paper, March 2024. Springer.
Link to publication
¹German Aerospace Center, ²Friedrich-Schiller University, Jena, ³University of Passau, ⁴Radboud University Nijmegen,5Leipzig University
Ines Zelch¹, Matthias Hagen¹, Martin Potthast²
A User Study on the Acceptance of Native Advertising in Generative IR
In: Proceedings of the 2024 Conference on Human Information Interaction and Retrieval (CHIIR ’24)
Link to publication
¹Friedrich-Schiller University, Jena, ²Leipzig University
Michael Dinzinger¹, Michael Granitzer¹
A longitudinal study of content control mechanisms
In: Proceedings of the Temporal Web Analytics Workshop ’24 (TempWeb)
Link to Publication
¹University of Passau
Sebastian Schmidt, Ines Zelch¹,², Janek Bevendorff³, Benno Stein³,
Matthias Hagen¹, Martin Potthast²
Detecting Generated Native Ads in Conversational Search
In: WWW ’24: Proceedings of the ACM Web Conference 2024, in February 2024
Link to publication
¹Friedrich-Schiller-University Jena, ²Leipzig University, ³Bauhaus-University Weimar
Michael Dinzinger¹, Florian Hess², Michael Granitzer¹
A Survey of Web Content Control for Generative AI
In: WOWS 2024, January 2024
Link to publication
¹University of Passau, ²Carl von Ossietzky Universität Oldenburg
Please also view the projects deliverables.
Publications 2023
Michael Granitzer et al.¹
Impact and Development of an Open Web Index for Open Web Search
In: JASIST, Willey, August 2023
Link to publication
¹University of Passau
Maik Fröbe¹, Jan Heinrich Reimer¹, Sean MacAvaney², Niklas Deckers³, Simon Reich³, Janek Bevendorff⁴, Benno Stein⁴, Matthias Hagen¹, Martin Potthast³
The Information Retrieval Experiment Platform
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Friedrich-Schiller-University Jena, ² University of Glasgow, ³ Leipzig University, ⁴ Bauhaus-University Weimar
Janek Bevendorff¹, Sanket Gupta¹, Johannes Kiesel¹, Benno Stein¹
An Empirical Comparison of Web Content Extraction Algorithms
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Bauhaus-University Weimar
Harrisen Scells¹ and Martin Potthast¹
Pybool_ir: A Toolkit for Domain-Specific Search Experiments
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Leipzig University
Harrisen Scells¹, Ferdinand Schlatt², and Martin Potthast¹
Smooth Operators for Effective Systematic Review Queries
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Leipzig University, ² Friedrich-Schiller-University Jena
Jan Heinrich Reimer¹, Sebastian Schmidt², Maik Fröbe¹, Lukas Gienapp², Harrisen Scells², Benno Stein³, Matthias Hagen¹, Martin Potthast²
The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Friedrich-Schiller-University Jena, ² Leipzig University, ³ Bauhaus-University Weimar
Negin Ghasemi¹, Mohammad Aliannejadi², Hamed Bonab³, Evangelos Kanoulas², Arjen P. de Vries¹, James Allan⁴, and Djoerd Hiemstra¹
Cross-Market Product-Related Question Answering
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Radboud University, ² University of Amsterdam, ³ Amazon Inc., ⁴ University of Massachusetts Amherst
Chris Kamphuis¹, Aileen Lin², Siwen Yang², Jimmy Lin², Arjen P. de Vries¹, and Faegheh Hasibi¹
MMEAD: MS MARCO Entity Annotations and Disambiguations
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Radboud University, ² University of Waterloo
Miriam Louise Carnot¹, Lorenz Heinemann¹, Jan Braker¹, Tobias Schreieder¹, Johannes Kiesel², Maik Fröbe³, Martin Potthast¹, and Benno Stein²
On Stance Detection in Image Retrieval for Argumentation
In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), ACM, July 2023
Link to publication
¹ Leipzig University, ² Bauhaus-University Weimar, ³ Martin-Luther-Universität Halle Wittenberg
Tim De Jonge¹, Djoerd Hiemstra¹
UNFair: Search Engine Manipulation, Undetectable by Amortized Inequity
In: Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT ’23), ACM, July 2023
Link to publication
¹ Radboud University
Sheikh Mastura Farzana¹, Tobias Hecking¹
Geoparsing at Web-scale – Challenges and Opportunities
In: GeoExT 2023: First International Workshop on Geographic Information Extraction from Texts at ECIR 2023, CEUR Workshop proceedings, April 2023
Link to publication
¹ German Aerospace Center (DLR)
Project Deliverables
Noor Afshan Fathima¹, Andreas Wagner¹, John Truckenbrodt², Mohamad Hayek³, Martin Golasowksi⁴, Michael Dinzinger5, Saber Zerhoudi5 , Sebastian Heineking6, Ines Zelch6, Gjis Hendriksen7, Michael Granitzer5*
Training Material Available for Partners
This document outlines the second deliverable of Work Package 5 (WP5) within the OpenWebSearch.eu project, titled “D5.2 Training Material for Partners”. November 2024. Zenodo.
Link to publication
¹European Organization for Nuclear Research, ²Deutsches Zentrum für Luft- und Raumfahrt e. V., ³Leibniz Supercomputing Centre, ⁴VSB – Technical University of Ostrava, University of Passau, Leipzig University, Radboud University, Nijmegen, 5University of Passau, 6Leipzig University, 7Radboud University, Nijmegen, * Project Lead
Alexander Nussbaumer¹, Chiara Ruß-Baumann¹, Izidor Mlaker²
Report of Privacy, Transparency, and Trust Models for Search Applications V2
This document describes the deliverable D4.3 “Report of privacy, transparency, and trust models for search applications V2”. November 2024. Zenodo.
Link to publication
¹Graz University of Technology, ²A1 Slovenia
Sebastian Gürtl¹, Alexander Nussbaumer¹, Christian Gütl¹, Izidor Mlaker², Rigon Sallauka², Roxanne El Baff³, Tobias Hecking⁴
Search Application Prototypes
This document describes the deliverable D4.1 “Search Application Prototypes”. It reports on initial developments of search applications in the context of tasks T4.1 and T4.2, including a a science search application and a restaurant recommender. November 2024. Zenodo.
Link to publication
¹Graz University of Technology, ²A1 Slovenia, ³Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR),⁴German Aerospace Center
Gjis Hendriksen¹, Djoerd Hiemstra¹, Arjen de Vries¹, Sebastian Gürtl², Alexander Nussbaumer², Sebastian Heineking³, Ines Zelch³, Martin Pottharst³, Michael Granitzer⁴*, Michael Dinzinger⁴, Saber Zerhoudi⁴, Noor Afshan Fathima5
The OpenWebSearch Hub and the Open Web Index Y2
This document describes the second version (out of three) of deliverable D3.3 “The OpenWebSearch Hub and the Open Web Index”. November 2024. Zenodo.
Link to publication
¹Radboud University, Nijmegen, ²Graz University of Technology, ³Leipzig University, ⁴University of Passau,5European Organization for Nuclear Research, *Project Lead
Sebastian Heineking¹, Ines Zelch¹, Janek Bevendorff¹, Sheik Mastura Farzana², Laura Caspari³, Martin Pottharst¹
Semantic Enrichment Algorithms and Models
This document reviews the status of Deliverable D2.3 “Semantic Enrichment Algorithms and Models”. The deliverable is reviewed in comparison to the objectives planned in the proposal, and with respect to how it integrates with other components of the project. November 2024. Zenodo.
Link to publication
¹Leipzig University, ² Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR), ³University of Passau
Saber Zerhoudi¹, Michael Dinzinger¹, Michael Granitzer¹*
Open Webmaster Console Software Stack & Services V1
This document provides a detailed overview of the first deliverable for D1.4 “Open Webmaster Console Software Stack & Services”. It outlines the current progress in the development and launch of the OWS Webmaster Console and its services. November 2024. Zenodo.
Link to publication
¹University of Passau, * Project Lead
Saber Zerhoudi¹, Michael Dinzinger¹, Michael Granitzer¹*
Open Website Index & Open Crawl Storage Index V1
This document provides a detailed overview of the first deliverable for D1.3 “Open Website Index & Open Crawl Storage Index”. It outlines the current progress in the development and launch of the Open Website Index & Open Crawl Storage Index. November 2024. Zenodo.
Link to publication
¹University of Passau, * Project Lead
Michael Dinzinger¹, Saber Zerhoudi¹, Tobias Hecking², Sebastian Heineking³, Martin Pottharst³, Gjis Hendriksen⁴, Noor Afshan Fathima5, Michael Granitzer¹*
Crawler Coordination Software Stack & Demonstrator V2
This document provides a detailed overview of the second deliverable for D1.2 “The OpenWebSearch Crawler and the Crawling Frontier”. It outlines the achievements in developing and launching the Open Web Crawler (OWLer) and its related software components. November 2024. Zenodo.
Link to publication
¹University of Passau, ² Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR), ³ Leipzig University, ⁴ Radboud University, Nijmegen, 5 European Organization for Nuclear Research, * Project Lead
Noor Afshan Fathima¹, Andreas Wagner¹, Martin Golasowski², Jon Truckenbrodt², Katja Mankinen⁴, Stephan Hachinger5, Michael Granitzer6*
Launch of the Pilot Infrastructure
The document provides a detailed overview of the first deliverable of Work Package 5 (WP5) entitled “D5.1 Launch of the Pilot Infrastructure”, which is part of the OpenWebSearch.eu project. March 2024. Zenodo.
Link to publication
¹European Organization for Nuclear Research, ² Technical University of Ostrava, ³ German Aerospace Center, ⁴ CSC – IT-Center for Sciene,Finland, 5 Leibniz Supercomputing Centre, 6 University of Passau, *Project Lead
Sebastian Schmidt¹, Ines Zelch¹, Janek Bevendorff¹, Maik Fröbe¹, Martin Pottharst¹, Michael Granitzer²*
The OpenWebSearch WARC parsing & content analysis library Version 1.0
This publication reviews the status of Deliverable D2.1 “The OpenWebSearch WARC parsing and content analysis library” of the OpenWebSearch.eu project. March 2024. Zenodo.
Link to publication
¹Leipzig University, ² University of Passau, *Project Lead
Talks + Lectures
2024
Christine Plote¹
Gut zu finden? Wie Suchmaschinen unseren Blick auf die Welt prägen.
Talk at ‘Orientierung in der digitalen Welt. Wohin führen uns Google, TikTok & Co?‘ at ‘Evangelische Akademie Tutzing‘, Tutzing, Germany, 2 March 2024
¹Open Search Foundation
Christine Plote¹
Google kennt Dich: Über Suchmaschinen, Privatsphäre und wie man es besser machen kann
Talk at ‘Das Otto‘, Neuburg, Germany, 15 January 2024
¹Open Search Foundation
2023
Michael Granitzer¹, Christine Plote²
“NGIForum23”
Participation in Panel Discussion on “Open Web Search, Large Language Models and Beyond” at the NGI Forum, 15-16 November 2023
Link to recording
¹ University of Passau, ² Open Search Foundation
Christine Plote¹
“Der Traum vom freien Raum – Das Internet und seine Utopien”
Presentation & Talk at Akademie Tutzing, Tutzing, Germany, October 28, 2023
Link
¹ Open Search Foundation
Stefan Voigt¹
“5-year into the Open Search Initiative: state of play and future perspective”
Presentation & Talk at EnviroInfo 2023, Garching near Munich, Germany, October 11-13, 2023
Link
¹ Open Search Foundation
Dieter Kranzlmüller¹, Stephan Hachinger¹
“Open Search – new approaches towards a sustainable search infrastructure”
Presentation & Talk at EnviroInfo 2023, Garching near Munich, Germany, October 11-13, 2023
Link
Leibniz Supercomputing Centre of the BAdW (LRZ)
Tobias Hecking¹
“OpenSearch@DLR project – new tools for scientific search and access to environmental data”
Presentation & Talk at EnviroInfo 2023, Garching near Munich, Germany, October 11-13, 2023
Link
¹ German Aerospace Center (DLR)
Michael Granitzer¹
“OpenWebSearch.eu – Building an Open Web Index for an open web search ecosystem”
Keynote at LWDA 2023 – “Learning, Knowledge, Data, Analysis”, Marburg, Germany, October 9-11, 2023
Link
¹ University of Passau
Janek Bevendorff¹, Matti Wiegmann¹, Martin Potthast², Benno Stein¹
“Product Spam on YouTube: A Case Study?”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Bauhaus University Weimar, ² WEBIS
Djoerd Hiemstra¹, Gijs Hendriksen¹, Chris Kamphuis¹, Arjen P. de Vries¹
“Challenges of index exchange for search engine interoperability?”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Radboud University
Maik Fröbe¹, Theresa Elstner², Harrisen Scells³, Benno Stein⁴, Martin Potthast³
“Prototyping Open Web Search Applications with TIRA: A Case Study in Research-oriented Teaching”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Friedrich Schiller Universität Jena, ² University of Leipzig, ³ WEBIS, ⁴ Bauhaus University Weimar
Michael Granitzer¹
“openwebsearch.eu – Where do we stand?”
Invited Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ University of Passau
Michael Dinzinger¹, Saber Zerhoudi¹, Michael Granitzer¹, Mohammed Al-Maamari¹, Mahmoud Istaiti¹, Jelena Mitrovic¹
“OWler: Preliminary results for building a Collaborative Open Web Crawler”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ University of Passau
Ines Zelch Leipzig¹, Matthias Hagen¹, Martin Potthas²
“Commercialized Generative AI: A Critical Study of the Feasibility and Ethics of Generating Native Advertising Using Large Language Models in Conversational Web Search”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Friedrich-Schiller-University Jena, ² WEBIS
Mohammed Al-Maamari¹, Mahmoud Istaiti¹, Michael Granitzer¹, Michael Dinzinger¹, Saber Zerhoudi¹, Jelena Mitrovic¹
“A Comprehensive Dataset for Webpage Classification”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ University of Passau
Jasmin Tietgen¹, Maari Alanko²
“Governance Towards an Open Web Index”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Open Search Foundation, ² CSC
Alexander Nussbaumer¹, Sebastian Gürtl¹, Christian Gütl¹, Rohit Kaushik¹, G. Hendriksen²
“Conceptual Design and Implementation of a Prototype Search Application using the Open Web Search Index”
Presentation & Talk at 5th International Open Search Symposium #OSSYM23, Geneva, Switzerland and Online, October 4-6, 2023
Link
¹ Graz University of Technology , ² Radboud University
Christine Plote et al¹
Maustag 2023
Booth with interactive program at the Open Door Event at LRZ, Garching, Germany, 3 October 2023
¹Open Search Foundation
Alexander Bondarenko¹, Maik Fröbe¹, Johannes Kiesel², Ferdinand Schlatt³, Valentin Barriere⁴, Brian Ravenet⁵, Léo Hemamou⁶, Simon Luck⁷, Jan Heinrich Reimer³, Benno Stein², Martin Potthast⁸, Matthias Hagen³
“Overview of Touché 2023: Argument and Causal Retrieval”
Presentation & Talk at CLEF 2023, Thessaloniki, Greece, September 18-21, 2023
¹ WEBIS, ² Bauhaus University Weimar, ³ Friedrich-Schiller-University Jena, ⁴ Centro Nacional de Inteligencia Artificial (CENIA), ⁵ University Paris-Saclay, ⁶ Sanofi R&D, ⁷ University of Bologna, ⁸ University of Leipzig
Janek Bevendorff¹, Ian Borrego-Obrador², Mara Chinea-Ríos², Marc Franco-Salvador², Maik Fröbe³, Annina Heini⁴, Krzysztof Kredens⁴, Maximilian Mayerl⁵, Piotr Pęzik⁴, Martin Potthast⁶, Francisco Rangel², Paolo Rosso⁴, Efstathios Stamatatos⁷, Benno Stein¹, Matti Wiegmann¹, Magdalena Wolska¹, Eva Zangerle⁵
“Overview of PAN 2023: Authorship Verification, Multi-Author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection Condensed Lab Overview”
Presentation & Talk at CLEF 2023, Thessaloniki, Greece, September 18-21, 2023
¹ Bauhaus University Weimar, ² Symanto Research, ³ WEBIS, ⁴ Aston University, ⁵ University of Innsbruck, ⁶ University of Leipzig, ⁷ University of the Aegean,
Michael Granitzer¹
“OpenWebSearch.eu – Building an Open Web Index for an open web search ecosystem”
Keynote at Flexible Query and Answering Conference 2023, Palma de Mallorca, September 5-7, 2023
Link
¹ University of Passau
Maik Fröbe¹, Tim Gollub², Benno Stein², Matthias Hagen³, Martin Potthast⁴
“Clickbait Spoiling”
Presentation & Talk at SemEval2023@ACL23 in Toronto, Canada, July 9-14, 2023
¹ WEBIS, ² Bauhaus University Weimar, ³ Martin-Luther-University Halle-Wittenberg, ⁴ University of Leipzig
Michael Granitzer¹
“Foundation AI, LLMs and Generative AI”
Talk, BDVA Activity Group (AG54), online, 29 June 2023
¹ University of Passau
Michael Granitzer¹
“European Web Crawls and LLMs: the OpenWebSearch.eu Project. ”
Invited Talk at Meta-Forum 2023, Brussles, Belgium, 27 June 2023
Link
¹ University of Passau
Christine Plote, Alexander Decker¹
“Digital Native oder Digital Naive? Was jeder über Google & Co wissen sollte …”
Invited Talk at HAW (Hamburg University of Applied Sciences) for the lecture series “Herausforderungen und Kompetenzen in der der Digitalisierung“, Hamburg, Germany, 7 June 2023
¹ Open Search Foundation
Christine Plote¹
re:publica23
Panel participation at re:publica 2023 conference, Berlin, Germany, 6 June 2023
¹Open Search Foundation
Michael Dinzinger¹
“Distributed and legally compliant crawling in the OpenWebSearch.EU project”
Presentation at IRIXYS workshop, INSA Lyon, France, May 10 2023
¹ University of Passau
Maik Fröbe¹, Lukas Gienapp², Martin Potthast², Matthias Hagen³
“Bootstrapped nDCG Estimation”
Presentation & Talk at ECIR 2023 in Dublin, Ireland, April 2-6, 2023
¹ WEBIS, ² University of Leipzig, ³ Martin-Luther-University Halle-Wittenberg
Maik Fröbe¹, Matti Wiegmann², Nikolay Kolyada², Bastian Grahm³, Theresa Elstner³, Frank Loebe³, Matthias Hagen⁴, Benno Stein², Martin Potthast³
“Continuous Integration for Reproducible Shared Tasks with TIRA.io”
Presentation & Talk at ECIR 2023 in Dublin, Ireland, April 2-6, 2023
¹ WEBIS, ² Bauhaus University Weimar, ³ University of Leipzig, ⁴ Friedrich-Schiller-University Jena
Stefan Voigt¹
“Auf dem Weg zu einer offenen Europäischen Internetsuche – Wie ein offener Webindex Europas digitale Souveränität fördert.”
Webinar at Gesellschaft für Informatik, online, 23 March 2023
¹ Open Search Foundation
Christine Plote¹ , Stefan Voigt¹
Safer Internet Day 2023
Booth at Safer Internet Day Conference 2023, Berlin, Germany, 14 February, 2023
¹Open Search Foundation
Michael Dinzinger¹ , Aurora Gonzalez Vidal²
“Two sister initiatives for a paradigm change in open search and discovery on the internet”
Contributed Lightning Talk at FOSDEM 2023, Brussles, Belgium, February 5, 2023
Link
¹ University of Passau, ² NGI Search
Stefan Voigt¹, Michael Granitzer², Isabell Claus³, Christine Plote¹
“Towards an Open European Internet Search”
Online lecture at DG CONNECT University Online Session, January 19, 2023
Link
¹ Open Search Foundation, ² University of Passau, ³ thinkers.ai
Christine Plote¹, Stefan Voigt¹
“Die Internetsuche auf dem Prüfstand – Warum wir Alternativen zu Google & Co brauchen,
und was jeder von uns tun kann”
Webinar at BayernLab, online, January 11, 2023
Link
¹ Open Search Foundation
2022
Katerina Slaninova¹, Jan Martinovic¹, Vit Vondrak¹
SC 2022
Booth Presentation at SC 2022, Dallas, USA, November 13-18, 2022
¹IT4I
Michael Granitzer¹, Per Öster², Lukas Vojacek³
“Towards an Open Web Search Infrastructure”
Contributed talk at European Open Science Cloud Symposium, Prague, Czech Republic, November 11 2022
Link
¹ University Passau, ² CSC, ³ IT4I
Stefan Voigt¹
Invited Talk
EC High Level Meeting for the future of the internet in Prague, Czech Republic, November 2, 2022
¹ Open Search Foundation
Michael Granitzer¹
“Piloting Open Websearch: OpenWebSearch.eu”
Invited Talk at 4th Open Search Symposium #ossym22, Geneva, Switzerland, October 10 2022
Link
¹ University Passau
Tim Smith¹
“Search, Open Search and Science online”
TED Talk at TEDx Verbier, September 3 2022
Link to Stream
¹ European Organization for Nuclear Research (CERN)



