Posted On: Apr 14, 2022
Read below for the 16 new or updated datasets from Space Telescope Science Institute, DNAStack, National Archives and Records Administration, and others available on the Registry of Open Data in the following categories.
Astronomy:
- Galaxy Evolution Explorer Satellite managed by Space Telescope Science Institute
Climate and weather:
- CMIP5 Probabilistic Downscaling Dataset from NOAA
- NOAA Real-Time Mesoscale Analysis from NOAA
- Servicio Meteorológico Nacional (SMN) Hi-Res Weather Forecast over Argentina from SMN, the National Weather Service of Argentina
- NOAA Unified Forecast System Weather Model (UFS-WM) Regression Tests from NOAA
Geospatial:
- Cloud to Street - Microsoft Flood and Clouds Dataset managed by Radiant Earth Foundation
Life sciences:
- TIGER Training from Radboud University Medical Center
- DNAStack COVID19 SRA Data from DNAstack
- Open Bioinformatics Reference Data for Galaxy from Galaxy and Bioconductor Projects
- GATK Structural Variation (SV) Data from Loka Inc.
Machine learning:
- YouTube 8 Million - Data Lakehouse Ready managed by Amazon Web Services
- Consented Activities of People from Visym Labs
- MultiCoNER Dataset from Amazon
- TSBench managed by AWS
Statistical and regulatory:
- 1950 Census from National Archives and Records Administration
Cultural
- Ukrainian Cultural Heritage Web Archive from Saving Ukrainian Cultural Heritage Online (SUCHO)
Looking to make your data available? The AWS Open Data Sponsorship Program covers the cost of storage for publicly available, high-value, cloud-optimized datasets. We work with data providers who seek to:
- Democratize access to data by making it available for analysis on AWS
- Develop new cloud-native techniques, formats, and tools that lower the cost of working with data
- Encourage the development of communities that benefit from access to shared datasets