Skip to content

Latest commit

 

History

History
14 lines (9 loc) · 513 Bytes

File metadata and controls

14 lines (9 loc) · 513 Bytes

software-mentions-dataset-analysis

Analyses of software mentions and dependencies

What this dataset is

The software-mentions dataset is a collection of ML-identified mentions of software detected in about 24,000,000 academic papers.

Getting Started

Getting the Parquet files

If you want to extract the .parquet tables yourself, or work with the original dataset, see Extracting Tables. Otherwise, you can download the tables in a friendlier format from (INSERT LOCATION).