February Dataset Spotlight

2 min read

It's that time. Our February dataset spotlight here at DoltHub. For new folks, Dolt is a SQL database with git-like versioning and DoltHub is a place on the internet to share Dolt databases. This monthly feature keeps you updated on Data Bounties and popular Dolt databases.

Bounties

We are excited to continue updating you about our progress on Data Bounties. We have two active data bounties with one finishing imminently. We will be launching another in the next week or so. We completed one bounty in February.

Hospital Price Transparency

Link: dolthub/hospital-price-transparency
Bounty: $10,000
End Date: March 1, 2021

On January 1, 2021, a US law was passed requiring hospitals to publish their prices in human and machine readable format. We are assembling the best open dataset of hospital prices in the US to aid researchers. The bounty started January 14 and ends March 1. So far, we have 1375 hospitals and over 65M prices covered. We have 7 open PRs to review.

US College Course Catalogs

Link: dolthub/national-course-catalog-us
Bounty: $10,000
End Date: March 18, 2021

We want to build a database of US College Course catalogs. Unlike previous bounties, the data generated by this bounty will become private once the bounty ends. This is a new one for us. People with which we have a previous business relationship came to us wanting to use bounties to collect a private dataset. They are fronting the prize money for the bounty. We mulled it over, discussed a few options, and decided to give it a try in this form. Lots more work to do. We have 3 accepted PRs and 5 outstanding PRs, each representing a school.

US Presidential Election Precinct Results

Link: dolthub/us-president-precinct-results
Bounty: $25,000
End Date: February 14, 2021

We built a great database of US Presidential Precinct results. We discussed the results in this blog post. The dataset generated a bit of controversy with OpenElection but we worked it out. Feel free to use the database for any purpose. We'll be working to add more data with OpenElections.

Popular Datasets

The five most viewed DoltHub datasets for the month of December:

  1. dolthub/us-president-precinct-results
  2. dolthub/hospital-price-transparency
  3. orioncri/PerfTest
  4. dolthub/nfl-play-by-play
  5. dolthub/national-course-catalog-us

Conclusion

That's it for this month. Interested in participating in data bounties? Come say hello on our Discord and be a part of our data community.

SHARE

JOIN THE DATA EVOLUTION

Get started with Dolt