DataSet/DataTable - Search News

Chemists ran 50,688 reactions to make a huge open dataset

The dataset, which the researchers have made available on the Open Reaction Database, is nearly five times as large as the ...

Wired

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft

Harvard University announced Thursday it’s releasing a high-quality dataset of nearly 1 million public-domain books that could be used by anyone to train large language models and other AI tools. The ...

MIT Technology Review

A major AI training data set contains millions of examples of personal data

Personally identifiable information has been found in DataComp CommonPool, one of the largest open-source data sets used to train image generation models. Millions of images of passports, credit cards ...

Frontiers

Showcasing FAIR² Data Articles: Unlocking Trustworthy, AI-Ready Scientific Data for Reuse and Impact in Space Technologies

Scientific knowledge is fundamentally built on data; yet, for too long, research datasets have remained siloed, poorly documented, and inconsistently ...

VentureBeat

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

AI has transformed the way companies work and interact with data. A few years ago, teams had to write SQL queries and code to extract useful information from large swathes of data. Today, all they ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results