OWS.EU Partner in Focus: Leibniz Supercomputing Centre

The Leibniz Supercomputing Centre (LRZ) is the second partner we are introducing following our portrait of the University of Passau. The LRZ forms part of the BADW (Bayerische Akademie der Wissenschaften), providing technical support and supercomputing power, delivers a robust infrastructure for the Open Web Index. The research team of LRZ is guided and supported by the chairman of the board of directors, Prof. Dr. Dieter Kranzlmüller. The team includes Research and Information Management Team leader Megi Sharikadze, Research Data Management Team leader Stephan Hachinger, Research Managers: Shahab Khormali, Jirathana Dittrich and Nana Gratiashvili, Research Associates Mohamad Hayek and Stuart Gordon, and Communications Manager Anita Schuffert. The LRZ team has multiple functions: coordinates the project management and research activities, takes care of financial support to third-party program of the project, contributes to infrastructure work-package, actively participates in dissemination, communication and exploration measures as well as in topics such as governance, legal and ethical aspects of OpenWebSearch.eu.

Shahab Khormali is in charge of project management, with a primary focus on Cascade funding, also known as Financial Support for Third Parties (FSTP), activities to be tackled within the project. We asked him about crucial milestones thus far as well as the outlook for the last project period and beyond.

Shahab Khormali, Leibniz Supercomputing Centre of the BAdW (LRZ), Research Manager

Please describe your tasks (LRZ) in the project. What is your field of expertise that you bring to the project?

Shahab: BADW-LRZ, as one of the foremost European computing centers, contributes to the OpenWebSearch.eu project in two key areas. The first is the technical domain, where, along with the other participating data centers in the consortium, we are responsible for providing a state-of-the-art storage and compute infrastructure. In this context, LRZ’s Research Data Management team, led by Dr. Stephan Hachinger, applies its expertise on developing and managing highly scalable, reliable and secure computing infrastructure, and supports the project in executing core services and storing core data products.

The second area concerns project management and coordination and communication. Within this framework, LRZ’s Research and Information Management (RIME) team, comprising experts in science management and communication and heading by Dr. Megi Sharikadze, leads the “Project Management and Coordination Office (PMCO)” and actively supports the “Open Web Search Ecosystem and Sustainability” and “Outreach and Communication” project goals.

In PMCO, the LRZ team works closely with the project coordinator and partners to oversee the overall project management. We together ensure that the project remains aligned with the project objectives, while progressing efficiently and successfully. We are responsible for delivering optimal project performance, managing resources efficiently, overseeing reporting, maintain effective communication and networking with the relevant stakeholders, supporting international cooperation, fostering innovation and capacity building. Additionally, we coordinate the entire process of third-party contributions, handling everything from the initial preparation and announcement of open calls to the evaluation, awarding, distribution of funds, and final closure of grants. The PMCO oversees that each step is carried out smoothly and efficiently, and maintains clear processes, especially, for continuous communication with all involved third-party partners.

How is the project progressing so far?

Shahab: The project is progressing very well being on track. We have achieved several key milestones, including the successful completion of the mid-term project review with extremely positive feedback from the reviewers. This also included the approval of all deliverables and milestones in the first periodic report. Furthermore, we conducted all three open calls for third-party contributors as planned. The selected third-parties have been successfully integrated into the project. Moreover, we are introducing the project to key stakeholders across Europe and enhancing the project’s visibility within the relevant networks and circles.

What are the challenges you are facing with regard to your tasks?

Shahab: My main task belongs to PMCO activities. In this context, I do not think we have faced any major challenges. However, one important topic for us and the consortium is the continuation of the project in the form of a follow-up project. This relates to our ecosystem and sustainability responsibilities. In this context, we are closely following up on this matter and actively working to address it by applying for and securing new funds.
Another challenge worth pointing out is establishing long-term and productive relationships with industrial stakeholders and policymakers. Engaging effectively with these groups and ensuring ongoing collaboration throughout the project is an area that requires continuous attention and improvement.

Which milestones do you plan to achieve in the next months?

Shahab: A key milestones in the coming months is the completion of the first group of third-party projects, meaning that all partners should timely finalize their projects, and submit their project reports. This will allow us to close these projects officially. In addition to this, we look forward to conducting the mid-term review for the second and third groups of third-party projects in March/April 2025. Another important consideration is that the new end date of the project (due to a six-month prolongation) must be integrated across all project tasks.

What makes the OWS project special?

Shahab: In my opinion, the OpenWebSearch.eu is special and important due to its focus on cutting-edge IT and internet search technologies, which are continuously evolving being highly dynamic. It brings together highly specialized partners with state-of-the-art infrastructure and technologies, ensuring innovation and tangible outcomes. Additionally, the involvement of third-party contributors adds diverse expertise, enriching the project’s outcomes.

Do you already have plans for the time after the project ends?

Shahab: Yes, we have plans for the continuation of the project’s themes and objectives beyond its end date, and we are actively pursuing new funding opportunities to continue the work.

Thank you for the interview!

Read more about LRZ: https://openwebsearch.eu/partners/leibniz-supercomputing-centre-lrz/

First third-party partners finalise their projects to bolster Open Web Search

In November 2024, the first third-party partners completed their projects funded under the Community Programme of OpenWebSearch.EU and delivered crucial results to bolster an European Open Web Search.

The Community Programme plays an essential role in the Horizon Europe project OpenWebSearch.EU. It strengthens community activities around Open Search and allows the integration of new aspects and expertise through a cascade funding scheme. In March 2023 the OpenWebSearch.EU project issued its first open call that asked for contributions tolegally compliant data gathering and identifying legal or economic aspects that enable or block the development and maintenance of an Open Web Index. A team of experts chose six third-party partners to realise their project plans, ranging from legal assessments to the development of software tools, with funding from the OpenWebSearch.EU project. Three partners recently finalized their work and presented the results to the OpenWebSearch.EU consortium. Another three projects were granted a prolongation until the end of March 2025.

Here is an overview of the projects which are already completed

Project MRC 

The MRC project delivered a thorough market potential assessment of the Open Web Search Initiative and an Open Web Index, aimed at creating a European open web search infrastructure. Therefore, Mücke Roth & Company (MRC)evaluated economic and societal impacts, using both top-down and bottom-up methods to ensure a comprehensive analysis across various scenarios, including qualitative feedback from future users. The final report presents a variety of significant benefits of an Open Web Index with broad applicability across Europe and also recommendations for strategic directions and business model adjustments to optimize economic and societal benefits. The team completed their work and the project in April 2024 and presented results to the public in September 2024. The study can be downloaded here: https://openwebsearch.eu/the-project/research-results/market-potential-assessment-of-an-european-open-web-index/

Project LOREN 

The LOREN project analysed the legal constraints and requirements of crawling, storing, aggregating, enriching, and sharing web data operationally in an Open Web Index. The conducted research focuses on the legal aspects of building and operating an Open Web Index, including data protection, copyright, and intellectual property rights. As a result, a report was provided with a comprehensive analysis of the legal constraints and requirements for building and operating an Open Web Index with actionable advice. The project was completed in September 2024.

Project LAW4OSAI

The goal of the LAW4OSAI project was to enable license-aware crawling of content, particularly text and images, by automatically identifying and retrieving content licenses. The LAW4OSAI team investigated technical and legal aspects of the topic. Furthermore, they annotated a data set for the detection of content licenses and developed a resource-efficient Python library that is able to not only detect licenses on webpages but also map those licenses to assets on the webpage to distinguish, e.g., between different licenses for images and text on the same page. Last but not least, a framework to assess content licenses with regard to their compatibility with the training of Machine Learning models was developed and popular open licenses (like Creative Commons) were reviewed based on this framework. The project ended in November 2024 after an extension of two months. More: https://www.utwente.nl/en/bms/law4osai/.

 

The following projects received a prolongation until the end of March 2025

Project Open Console 

The goal of the Open Console project is to build an infrastructure (called Open Console) to share information about websites and thereby improve the availability and quality of knowledge about websites. The team developed a production-ready version of the Open Console, where website owners can create an account and log in to the console. Different types of ownership proofs were implemented as well as a demo service for the Open Web Crawler “OWLer” that was developed in the OpenWebSearch.EU project in order to create the Open Web Index. With this, Open Console enables the communication between the OpenWebSearch.EU project and owners of websites that were crawled by the OWLer. The project was extended for six months with additional funding in order to build another example service for Open Console, called “Lethe”, that will help to handle take-down requests.

 

Project LISA

The LISA project created a legal framework for the development and operation of an Open Web Index. First, the team defined what constitutes illegal content and established the legal duties for operators of an Open Web Index. Legal requirements for take-down requests, including those for criminal content, IP infringements, and data protection were set out. Additionally, the ownership of digital content and of the Open Web Index was clarified and the legal framework necessary for sharing the index was created. Furthermore, the team drafted an End User License Agreement (EULA) anda comprehensive legal framework for the Open Web Index, including governance structures and guidelines as well as best practices for its operation. The project was extended until March 2025 with additional funding to explore and finalise different licensing models for the Open Web Index.

 

Project ALMASTIC

The ALMASTIC project conducted a comprehensive review of legislative norms relevant to the development and implementation of the Open Web Index. The analysis encompassed a total of 50 regulatory instruments at the European Union level. These regulations were assessed to determine their potential impact on the OpenWebSearch.EU project, with findings categorized as low, medium, or high impact. The medium and high impact regulations were prioritized for more in-depth evaluation, focusing on key areas like copyright, data protection or cybersecurity. Furthermore, a final review regarding the Open Web Index Licence developed within the ALMASTIC project was provided. The project was extended for six months with additional funding to conduct a more focused legal framework assessment and broaden the Open Web Index License.

 

The members of the OpenWebSearch.EU project are very satisfied with the outcome of the first call from the Community Programme. Ursula Gmelch from the Open Search Foundation manages the OpenWebSearch.EU community. She explains:

The OpenWebSearch.eu project lives from its community. Our third-party partners in particular contribute tremendously on many of the aspects that are most important to the OpenWebSearch.eu project. They deliver, in a short time, really insightful and valuable results that we use as building blocks for the basis of European Open Web Search. From economic to legal and technical aspects – the results will help us make well-informed decisions and make great strides forward. We are happy to continue this approach with the extension of three of our third-party projects. This will allow them to deepen their research and help us push Open Web Search substantially forward.

 

 

EU research power against Google’s dominance | ZDF heute

OpenWebSearch.eu was mentioned on the website of German newscast ZDF heute

Book author and journalist Stefan Mey wrote up a portrait on the OpenWebSearch.eu project, which has just entered its third year. The Horizon Europe funded project forms part of the Open Search Initiative initiated by Starnberg based Open Search Foundation. The organization has been alligning forces with 13 European research entities since September 2022 to attempt build a European Open Web Index.

In the article OSF chairman Dr. Stefan Voigt who also works as a researcher with consortium partner German Aerospace Center  is quoted as follows: “Our aim is to provide a directory of all content on the World Wide Web as a public service.”

The idea of a free and open web index should promote more diversity in the narrow search engine market. Currently roundabout 3.10 Billion URLs have been crawled, with the status being regularly updated here: https://openwebsearch.eu/the-project/status/

Read the ZDF heute article: https://www.zdf.de/nachrichten/wissen/open-web-search-google-alternative-konkurrenz-monopol-100.html