1. REFERENCE
    11 min read

    So you want Database Versioning?

    Tim Sehn|

    Here at DoltHub, we've had a lot of success with our "So you want..." series of blog posts helping people find Dolt when they are looking for it. Dolt is a lot of things. Dolt is a version controlled database, a Git database, Git for data, data versi...

    Read More
0 matching articles.
  1. REFERENCEWEB
    8 min read

    How GraphQL Saved Us from the gRPC Dumpster Fire We Created

    DoltHub is the online data community powered by Dolt, the version-controlled SQL database. In the previous article in this series, we took an overview of DoltHub's front-end architecture. In this article, we'll take a look at the pit of sadness our d...

    Read More
  2. REFERENCE
    6 min read

    The Dolt Commit Graph and Structural Sharing

    Dolt is a SQL database that provides Git-like functionality, including clone, push, pull, branch, and merge. This post is part of a series exploring how Dolt stores table data. In our previous post, How Dolt Stores Table Data, we explored a unique so...

    Read More
  3. 10 min read

    Using Dolt to Manage Train/Test Splits

    Twitter is wonderful sometimes. Train/Test Tweet We don't know Aaron. He finds us on Twitter, asks a great question, makes us think, and prompts a blog post. How can you use Dolt to manage train/test splits for your Machine Learning models? Dolt i...

    Read More
  4. INTEGRATION
    7 min read

    Using Dolt with the JetBrains DataGrip SQL Workbench

    Dolt has been rapidly expanding its capabilities as a SQL server recently. We've done a lot of work to get the dolt sql-server command to be a stable peer to the built-in SQL shell, with all the same capabilities. In the last month we've expanded the...

    Read More
  5. SQL
    6 min read

    Joining Multiple Repositories with SQL Queries

    In our blogs we have shown over and over again how easy it is to clone data from DoltHub and immediately start querying it with SQL. We are constantly working on improving our data catalog. As we do, there emerge more occasions where you can derive i...

    Read More
  6. 3 min read

    Adopting go-mysql-server

    go-mysql-server is the SQL query execution engine that powers Dolt and DoltHub. Today we are excited to announce that we are adopting the project after its founding company ceased operations. Our fork of the project has over 400 additional commits th...

    Read More
  7. DATASET
    5 min read

    April Dataset Spotlight

    This blog entry is the first in a new series. Every month we will highlight some interesting datasets on DoltHub. The focus will be on new or updated datasets but sometimes we'll shed fresh light on a classic. For those new to Dolt and DoltHub, Dolt...

    Read More
  8. 10 min read

    Dolt and DoltHub: Publish Using CSVs

    Dolt is a SQL database with Git-style versioning. DoltHub is a place to share Dolt repositories. Dolt is Git for data. DoltHub is GitHub for Dolt. We want to host your public data on DoltHub. We think Dolt and DoltHub provide the best sharing model a...

    Read More
  9. 4 min read

    Introducing Dolt to SQL sync

    Background While building Dolt and DoltHub, we have had many conversations with our users. They all share an interest in finding better ways to manage data. They recognize that writing code to massage CSV, JSON, and other less well known formats, in...

    Read More
  10. 5 min read

    Using Dolt to Find Test Regressions

    Dolt is Git for data. It's a database that lets you clone, fork, branch, merge and diff. This is a really cool technology that has a lot of uses, but today we're going to focus on just one: using Dolt SQL to find regressions in test results. Dolt SQ...

    Read More
  11. DATASET
    5 min read

    Common Vulnerabilities and Exposures in Dolt

    TLDR: The NVD is a lot more useful when you can simply clone it and query it. The National Vulnerability Database (NVD) is the authoritative source for the publication of Common Vulnerabilities and Exposures (CVE). The vulnerabilities cataloged in t...

    Read More
  12. DATASET
    13 min read

    28 grams of Cannabis Data Sets

    Happy 4/20! Today is April 20th, the unofficial holiday of marijuana afficionados the world over. Happy 4/20! Or, as we in the data business like to say, Happy 20%! 4/20 is 1/5 is 20% Recreational marijuana has been legalized in a dozen US states,...

    Read More
  13. DATASET
    4 min read

    F*#%! you (in 4 languages)

    Dolt is to DoltHub as Git is to GitHub - except with Dolt, the unit of versioning is SQL tables. Dolt also has Git-like semantics such as pull, branch and merge. By running dolt pull in a Dolt repository, you know you are getting the most up-to-date ...

    Read More
  14. 10 min read

    How Dolt Types Work

    UPDATED FEBRUARY 10, 2021: Updated the final table with the types that have been added to Dolt since the article was first written. When we started on Dolt, our goal was to apply Git's idea of versioning to data. Whereas Git versions files, Dolt ver...

    Read More
  15. DATASETWEB
    11 min read

    Coronavirus State Actions Dataset: A Use Case for Pull Requests

    As COVID-19 continues to affect the lives of millions of people around the world, having the most recent and accurate information is an increasingly important tool to help combat the disease. We've been tracking COVID-19 cases for a few months in ou...

    Read More
  16. 8 min read

    Dolt and DoltHub: Become a Publisher

    Dolt is a SQL database with Git-style versioning. In Git the unit of versioning is files. In Dolt, the unit of versioning is SQL tables. Dolt will eventually support 100% of the Git command line and 100% of MySQL SQL. Moreover, anything you can do on...

    Read More
  17. FEATURE RELEASEWEB
    8 min read

    Data CI with DoltHub Webhooks

    Dolt and DoltHub are Git and GitHub for data. The same way that GitHub enables collaboration on source code repositories in Git format, DoltHub enables collaboration on data repositories in Dolt format. A very common workflow on GitHub involves usin...

    Read More
  18. 6 min read

    Tracking SQL Correctness and Performance Regressions in Dolt

    Tracking Dolt's SQL regressions As part of our journey to make Dolt a great SQL database, we set out to track the correctness of Dolt’s SQL engine against a suite of SQL tests called the sqllogictests. These tests are what we use to measure how clos...

    Read More
  19. REFERENCE
    14 min read

    Dolt for Git Noobs

    TL;DR Dolt is a SQL database with built-in Git versioning, branching, and distribution semantics that makes collaborating on and distributing data effortless. What Git does for files, Dolt does for data. Where Git versions files, allowing for fine-g...

    Read More
  20. REFERENCE
    8 min read

    How Dolt Stores Table Data

    Dolt is Git for data. It's a SQL database that lets you clone, branch, diff, merge, and fork your data just like you can with a filesystem tree in Git. This blog post explores one of the fundamental datastructures that underlies Dolt's implementation...

    Read More
JOIN THE DATA EVOLUTION

Get started with Dolt

Or join our mailing list to get product updates.