eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

211
active users

#DataSets

0 posts0 participants0 posts today

Data Rescue Project: Data Rescue Project Launches New Portal. “The Data Rescue Project (DRP) is excited to announce the launch of the DRP Portal—a milestone in our collective effort to protect and preserve at-risk public information. … The Portal makes it easy to discover rescued datasets by government offices sharing the data, topic, and more.”

https://rbfirehose.com/2025/06/25/data-rescue-project-data-rescue-project-launches-new-portal/

My prediction is that we won’t ever get public release of early OpenAI, Google, or even Anthropic #training #datasets.

Why? There are too many rich hard-right conservative backers who need all the misogyny, racism & hate speech to stay there.

We could have just & equal #AI, but we won’t. There’s too much money & power to be made of injustice.

📣 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗦𝘂𝗽𝗽𝗼𝗿𝘁 𝗛𝘂𝗯 𝟱 𝗝𝘂𝗻𝗲: 𝗪𝗼𝗿𝗸𝘀𝗵𝗼𝗽 𝗪𝗲𝗯 𝗦𝗰𝗿𝗮𝗽𝗶𝗻𝗴 𝘂𝘀𝗶𝗻𝗴 𝗣𝘆𝘁𝗵𝗼𝗻 🐍
Is much of the information you need for your #research available on websites, but not as downloadable #datasets or #files? This workshop will introduce you to the basics of #webscraping in a clear, practical way!

Also drop by at the 𝗦𝘂𝗽𝗽𝗼𝗿𝘁 𝗖𝗮𝗳é where experts will be present for your quick (or big;-) questions about #R, #Python, #Statistics, #MachineLearning, #HPC and #Geo!

ℹ️ More information 👉🏼edu.nl/rw7vd

Call For Manuscript Submissions - Real-Time GIS For Disaster Management
--
nature.com/collections/bjdhbfi <-- shared link to submission details
--
[note that I have NO affiliation with this journal, the guest editors, etc]
[I wonder if anybody from FEMA has compiled use case / effectiveness / robustness on/of the #WaffleHouseIndex in the southern USA, especially related to hurricanes?]
#GIS #paper #mapping #spatial #manuscripts #callforpapers #callformanuscripts #submissions #callforsubmissions #realtime #disaster #management #mitigation #prevention #preparedness #response #recovery #risk #hazard #naturalhazard #naturalhazard #emergency #remotesensing #earthobservation #satellite #drone #sensor #socialmedia #WaffleHouseIndex #datasets #AI #InternetOfThings #research #monitoring #evacuation #planning #resourceallocation #hazardmapping #realworld #global

Ready to supercharge your #OpenScience profile?

With #OpenAIREEXPLORE + @ORCID_Org you can seamlessly complete your #ORCID record with all your research outputs, from papers & #datasets to #software tools.

Backed by the @OpenAIREGraph EXPLORE identifies and matches your work, including:

Journal articles
Research data
Software & more

Read the article to learn more openaire.eu/openaire-explore-a

Visit explore.openaire.eu to make your contributions count publicly and properly.

Ready to supercharge your #OpenScience profile?

With #OpenAIREEXPLORE + @ORCID_Org , you can seamlessly complete your #ORCID record with all your research outputs, from papers & #datasets to #software tools.

Backed by the @OpenAIREGraph, EXPLORE identifies and matches your work, including:

-Journal articles
-Research data
-Software & more

Log in with your ORCID → check what’s missing → sync it to your profile in just a few clicks.

Read the article: explore.openaire.eu

"Almost two dozen repositories of research and public health data supported by the National Institutes of Health are marked for “review” under the Trump administration’s direction, and researchers and archivists say the data is at risk of being lost forever if the repositories go down.

“The problem with archiving this data is that we can’t,” Lisa Chinn, Head of Research Data Services at the University of Chicago, told 404 Media. Unlike other government datasets or web pages, downloading or otherwise archiving NIH data often requires a Data Use Agreement between a researcher institution and the agency, and those agreements are carefully administered through a disclosure risk review process.

A message appeared at the top of multiple NIH websites last week that says: “This repository is under review for potential modification in compliance with Administration directives.”
Repositories with the message include archives of cancer imagery, Alzheimer’s disease research, sleep studies, HIV databases, and COVID-19 vaccination and mortality data."

404media.co/nih-archives-repos

404 Media · Massive, Unarchivable Datasets of Cancer, Covid, and Alzheimer's Research Could Be Lost ForeverDays before Robert F. Kennedy Jr. announced that 10,000 HHS staffers would lose their jobs, a message appeared on NIH research repository sites saying they were "under review."
#USA#Trump#Datasets

#ListenBrainz / #MetaBrainz I'm confused. Aren't sponsors the true customer? Why use this? 🤔

On one hand #Music: "Listen together", "Ethical forever"

On the other: #DATASETS

"Some of the world’s biggest platforms such as Google and Amazon, use our data"

"We ask commercial supporters to support us in order to help fund the creation and maintenance of these datasets."

"The following organizations make use of the data-sets published by MetaBrainz"

"Unicorn tier: #Google, #Amazon, #Spotify"

The Physical Sciences Data Infrastructure (PSDI) aims to simplify data management for researchers by integrating and enhancing existing #infrastructures.

It will enable seamless access to high-quality #data from both commercial and open sources, allowing researchers to combine #datasets, share software, models, and experimental or simulation data.

#Archivists Work to Identify and Save the Thousands of #Datasets Disappearing From Data.gov

Datasets aggregated on data.gov, the largest repository of U.S. government open data on the internet, are being deleted, according to the website’s own information. Since Donald #Trump was inaugurated as president, more than 2,000 datasets have disappeared from the database.
#archive

404media.co/archivists-work-to

404 Media · Archivists Work to Identify and Save the Thousands of Datasets Disappearing From Data.govMore than 2,000 datasets have disappeared from data.gov since Trump was inaugurated. But analyzing exactly what happened and where it went is going to take some time.

This is such a cool dataset: 22 different robots demonstrating 527 skills through a collaboration between 21 research institutions.

And the GIFs of all these different robots applying basic motor skills are adorable.

robotics-transformer-x.github.

robotics-transformer-x.github.ioOpen X-Embodiment: Robotic Learning Datasets and RT-X ModelsProject page for Open X-Embodiment: Robotic Learning Datasets and RT-X Models.

The @osi released the #OpenSource #AI definition this week, and stopped short of requiring that #datasets used for training AI models also be openly available.

Side-stepping that debate, I dig into the onus the OSI's decision now places on having better #DatasetDocumentation approaches - without which the data used for training cannot be adequately described.

blog.kathyreid.id.au/2024/11/0