eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

197
active users

#openwebsearch

0 posts0 participants0 posts today

🚀 NGI Forum Day 2 is underway
With the same !
Highlights from the opening talks:

🗣️ @StefanVoigt
Let’s build a public, transparent web search infrastructure like Galileo or Copernicus.

🗣️ @MichaelGranitzer
Search influences behaviour is crucial 20–80%. It's core to LLMs and AI.

Let’s treat as a base public service, built on EU HPC centres.

@NGICommons is part of this adventure

@ngi

🔍 Neues EU-Projekt.. will Websuche offener & ethischer machen: Weniger Abhängigkeit von großen Plattformen, mehr Transparenz & Datenschutz. #OpenWebSearch #EthicalAI #CERN #WWW
👉 home.cern/news/news/computing/

CERNEuropean project to make web search more open and ethicalOn 6 June, the OpenWebSearch.eu consortium released a pilot of a new infrastructure that aims to make European web search fairer, more transparent and commercially unbiased. With strong participation by CERN, the European Open Web Index (OWI) is now open for use by academic, commercial and independent teams under a general research licence, with commercial options in development on a case-by-case basis. The OpenWebSearch.eu initiative was launched in 2022, with a consortium made up of 14 leading research institutions from across Europe, including CERN. The project aims to build a public web index that offers an alternative to existing indexes held by companies like Google (USA), Microsoft (USA), Baidu (China) and Yandex (Russia). Web indexes provide the back-end data infrastructure behind search engines, and today the companies that manage them determine what content is searchable and how it is ranked. Currently, Europe does not have a search index of its own, making it vulnerable to digital dependence.  The OWI offers a clear alternative based on European values. The project’s cross-disciplinary nature, ensuring continuous dialogue between technical teams and legal, ethical and social experts, ensures that fairness and privacy are built into the OWI from the start. “Over thirty years since the World Wide Web was created at CERN and released to the public, our commitment to openness continues,” says Noor Afshan Fathima, IT research fellow at CERN. “Search is the next logical step in democratising digital access, especially as we enter the AI era.” The OWI facilitates AI capabilities, allowing web search data to be used for training large language models (LLMs), generating embeddings and powering chatbots. The CERN team has built key parts of the infrastructure that power the OWI’s crawling and indexing capabilities. This means that it tracks which webpages should be scanned. The system handles about 9 million URLs per hour, which equates to roughly 3 terabytes of public web data a day, with the aim of indexing 30–50% of the text-based web by the end of 2025. “We have already hit our target of indexing one petabyte of openly licensed web data, and our public dashboard helps users monitor that progress,” says Noor. CERN is also contributing to other parts of the project. For example, it is scanning its own public physics content to enhance the OWI, as well as developing an internal index and its own search tools and services. Currently, a prototype of a use case for the OWI is in development: known as “Nooon”, this research-driven search engine is dedicated to people with disabilities who require search engines that surface structured, accessible and representative information while ensuring privacy in both access and contribution. The release of the OWI, which has received funding from the European Union’s Horizon research and innovation programme, comes at a pivotal time. The European Commission’s Invest AI initiative is set to mobilise 200 billion euros for artificial intelligence, and the OWI offers a powerful foundation of open data for innovation. Furthermore, as Microsoft plans to retire access to the Bing index, the OWI will be able to offer an alternative index for European search engines. After two and a half years of intensive research and development, anybody can now request access to the OWI by signing up at openwebindex.eu/auth/login. Note that the project provides a web index, and not a search engine or API, and users wishing to build their own search engines or chatbots will need a working knowledge of how to apply web index data.  Read more: OpenWebSearch website Ethical, open and non-commercial: the the Open Web Search project is designed to provide Europe with the right alternative to existing search engines (home.cern) Towards an unbiased digital world (CERN Courier) Empowering data sovereignty through OpenWebSearch.eu (CERN Computing blog, behind the CERN SSO)  

🚅 Time to take the #OWI to Brussels!
We are excited to co-organize the morning sessions at this year's #NGIForum on 20 June.

Our project leader @grani will speak on the freshly launched #OWI, both looking back and forward.

Following his keynote, there will be exciting policy statements from MEPs Alexandra Geese & Lina Gálvez plus two panel discussions surrounding #WebSovereignty and the future of #OpenWebSearch.

Join NGI Forum in Brussels or online.
Register here: ti.to/ngi/ngi-forum-2025

Starting to see (and getting a bit excited about) some components of openwebsearch.eu, and I was wondering if the EU will finally get its own Common Crawl, like dataset (commoncrawl.org).

It seems the crawling results aren't publicly accessible yet, and there's already some discussion about GDPR implications.

At this pace, we're still far from being able to compete with US-scale open data efforts 🤦‍♂️

#europe #commoncrawl #openwebsearch

🔗 pipeline.shared-search.eu/
🔗 pipeline.shared-search.eu/expl

pipeline.shared-search.euCrawl PipelineShared effort to extract useful data from search engine crawls.

🤩 The time has come - we are kicking off our new #webinar series next week!

As part of our December #CommunityUpdate we invite you to join the webinar "Hands-on utilization of the Open Web Crawler – OWI onboarding" with Michael Dinzinger and Saber Zerhoudi – both from University of Passau.

When: 2nd December, 15:00-16:15 CET
Where: Online via Big Blue Button
Register here: openwebsearch.eu/community/ows

Open Web Search – Promoting Europe‘s Independence in Web SearchCommunity Updates - Open Web Search – Promoting Europe‘s Independence in Web Search

22 papers from 68 authors – #ossym24 offered a broad spectrum of scientific work on on the topics of #internetsearch and #openwebsearch.

Vol. 6 of the International Open Search Symposium proceedings summarises the research results that were presented at the @LRZ_DE in Munich.

👉 Download the issue for free: e-publishing.cern.ch/index.php

🤩 We already look forward to the next year's conference – the #ossym25, taking place from 8 - 10 October 2025 at CSC - IT Center for Science.

@osf #opensearch

e-publishing.cern.ch Proceedings of the International Open Search Symposium

▶ OWS.EU Community Meet-Up @ #ossym24 📢

From today on #OpenSearch enthusiasts meet at the Open Search Symposium at the @LRZ_DE in Garching, close to #Munich.

We are proud to be part of #ossym24 and to have this year the opportunity to invite participants to the first OWS.EU Community Meet-Up @ ossym tonight at 7 PM.

Read more about the #ossym and the Meet-Up on our website: openwebsearch.eu/ows-ossym24/

#OpenWebSearch #NextGenerationInternet #openweb @osf