1. REFERENCE
    11 min read

    So you want Database Versioning?

    Tim Sehn|

    Here at DoltHub, we've had a lot of success with our "So you want..." series of blog posts helping people find Dolt when they are looking for it. Dolt is a lot of things. Dolt is a version controlled database, a Git database, Git for data, data versi...

    Read More
0 matching articles.
  1. REFERENCE
    7 min read

    Dolt Implementation Notes — Push And Pull On a Merkle DAG

    Dolt is a SQL database with Git-like functionality, including branch, merge and diff and push and pull to remotes. This is a post in a series of posts about the internal workings of some of the core algorithms that underly Dolt's implementation. The ...

    Read More
  2. INTEGRATION
    5 min read

    Dolt SQL Server MySQL Client Support

    Dolt is a SQL database with Git-style versioning. Dolt ships with a MySQL compatible server that you can start on a repository using dolt sql-server. Once started, you can then connect to the running server using standard MySQL clients. We now suppo...

    Read More
  3. USE CASE
    3 min read

    Dolt as a Data Management Service

    Dolt is a version controlled SQL database. What that looks like in practice is a SQL engine sitting on top of a commit graph like storage format. Dolt SQL is a superset of MySQL that provides access to the database at every point in the commit graph....

    Read More
  4. DATASET
    4 min read

    August Dataset Spotlight

    Every month we highlight some interesting datasets on DoltHub. The focus is on new or updated datasets but sometimes we shed fresh light on a classic. For those new to Dolt and DoltHub, Dolt is Git for data. Git versions files. Dolt versions SQL tab...

    Read More
  5. 6 min read

    Dr. Discord, or: How we Learned to Stop Worrying and Love Public Chat

    Executive summary We are a small startup team building a new database tool called Dolt, which is Git for Data. This is the story of how we chose to use Discord for our open source project. You can join our server now! The Fermi Paradox of open sour...

    Read More
  6. 5 min read

    SQL Sync for Schema with SQL Alchemy

    Dolt is a version controlled SQL database. It behaves like a traditional relational database in that it offers a SQL interface for data and schema management, but the underlying data structure is a commit graph inspired by Git. One natural use-case i...

    Read More
  7. FEATURE RELEASE
    5 min read

    Announcing DoltHub SQL API

    Dolt is Git for data, a relational database built to create, publish and consume datasets. DoltHub hosts a growing collection of public open datasets stored as Dolt databases. DoltHub allows you explore data through its SQL query interface. We're exc...

    Read More
  8. DATASET
    11 min read

    FBI Crime Data and the Future of Data Distribution

    Dolt is Git for data and DoltHub hosts a growing collection of public open datasets. Recently, we created dolthub/fbi-nibrs reflecting the FBI's National Incident Based Reporting System (NIBRS) crime data. Law enforcement agencies from around the Uni...

    Read More
  9. WEB
    4 min read

    Open Source Cypress Testing Suite

    Dolt is Git for data and DoltHub is our web application that hosts Dolt repositories. At the beginning of the year we redesigned DoltHub and decided to try out Cypress as our end-to-end testing solution (similar to how we use Bats tests for Dolt). L...

    Read More
  10. DATASET
    6 min read

    Collaborative GPT-3 Dataset

    Dolt is Git for data. Recently, we've been thinking a lot about what could be Dolt's Linux. A reader of that blog had a suggestion, an open GPT-3 dataset. Dolt really shines as a collaborative database where many users are making distributed edits. ...

    Read More
  11. WEB
    7 min read

    Testing DoltHub Using Cypress

    We use Cypress to test DoltHub in production. Learn how we set up Cypress and write tests for our pages against different screen sizes.

    Read More
  12. 8 min read

    Data Integrity for Open Data

    Open Data Validation Recently an article made the rounds at our company about "data integrity" checks. The article advocates that in the absence of perfect code that never corrupts data, it's wise to have "data integrity checks" that ensure data qua...

    Read More
  13. FEATURE RELEASESQL
    8 min read

    Implementing subqueries in go-mysql-server

    Dolt is Git for data. Git versions files. Dolt versions SQL tables. Dolt's SQL engine is go-mysql-server, which is an open source project that we adopted a few months ago. Today we're excited to announce better support for subqueries in the engine, a...

    Read More
  14. DATASET
    3 min read

    July Dataset Spotlight

    Every month we highlight some interesting datasets on DoltHub. The focus is on new or updated datasets but sometimes we shed fresh light on a classic. For those new to Dolt and DoltHub, Dolt is Git for data. Git versions files. Dolt versions SQL tab...

    Read More
  15. 6 min read

    The Anatomy of Open Data Projects

    A core motivation for building DoltHub was to empower organizations to collaboratively create and maintain high quality data assets that they could collectively depend on. This is very much analogous to GitHub. Analogies are powerful ways to articula...

    Read More
  16. DATASET
    5 min read

    Scraping LinkedIn

    On June 13th, 2016 Microsoft acquired LinkedIn for $26.2 billion due to its ability to successfully monetize the resumes of its users. They have proven the value of a resume database and sell premium services that let recruiters search this database ...

    Read More
  17. 9 min read

    Data Dependencies Using DoltHub, an Example

    Introduction In the past we have blogged about the IRS Sources of Income (SOI) data that we harvested and published as a Dolt database. We presented a compelling visualization that was relatively straightforward to create using that database. It was...

    Read More
  18. 7 min read

    Being a Startup in COVID-19 Times

    Today, we're taking a break from our regularly scheduled Dolt and DoltHub content to talk about our experience as a ten person startup in Los Angeles over the past few months as we've all dealt with this pandemic. In the beginning... I can't say w...

    Read More
  19. 8 min read

    In Search of Dolt's Linux...

    Dolt is a SQL database with Git-style versioning. DoltHub is a place on the internet to share Dolt databases. In this blog post we discuss our search for Dolt's Linux. Git Git was built to manage the Linux open source project. Lore has it that Linu...

    Read More
  20. FEATURE RELEASEWEB
    2 min read

    Announcing Username and Password Login

    DoltHub is a web application for hosting and collaborating on Dolt repositories. Until now, DoltHub has only supported creating accounts and signing in with third-party providers - currently Google and GitHub. We're excited to announce that DoltHub n...

    Read More
JOIN THE DATA EVOLUTION

Get started with Dolt

Or join our mailing list to get product updates.