- 11 min read
So you want an AI Database?
Here at DoltHub, we built the world's first version-controlled SQL database: Dolt. What do version control and databases have to do with Artificial Intelligence (AI)? It turns out, a lot. At first, we were skeptical about the AI revolution, but then...
Read More
- 7 min read
Diffing Queries in Dolt
Dolt is a SQL database built to wrangle datasets. Its tables are versioned, queryable, and shareable. We've recreated Git's functionality in a relational database so you can collaborate on data in the same ways you collaborate on code. One of Dolt's ...
Read More - 4 min read
June Dataset Spotlight
Every month we highlight some interesting datasets on DoltHub. The focus is on new or updated datasets but sometimes we shed fresh light on a classic. For those new to Dolt and DoltHub, Dolt is Git for data. Git versions files. Dolt versions SQL tab...
Read More - 6 min read
How DoltHub Integrates Metered Billing with Stripe
Dolt is a SQL database with Git-style versioning. DoltHub is a place on the internet to share Dolt databases. Dolt is and always will be an open source tool and DoltHub hosts all public repositories for free. Users interested in hosting private repo...
Read More - 2 min read
Announcing GitHub Login
Dolt is a SQL database with Git-style versioning. DoltHub is a place on the internet to share Dolt databases. As you can tell from our product names and descriptions, we are inspired by Git and GitHub. We want to bring the same open collaboration mod...
Read More - 8 min read
Efficient Diff on Prolly-Trees
Dolt is a SQL database with Git-like functionality, including branch, merge, diff, clone, push and pull. This is the third post in a series of blog posts that explore the underlying datastructures that are used to table storage and core algorithms in...
Read More - 10 min read
Harnessing our SQL engine tests to run on Dolt
Introduction Dolt is Git for Data, and its built-in SQL engine is an open source project we recently adopted, go-mysql-server. The engine is a general-purpose SQL execution engine that lets integrators read or write to their custom data source with ...
Read More - 8 min read
Introducing Cell History Inspection on DoltHub
Dolt and DoltHub are Git and GitHub for data. Having a versioned database makes collaborating on data more fluid and reliable in the same way that Git improves source code collaboration for software engineers. Using both Git and GitHub, engineers are...
Read More - 5 min read
Doltpy 1.0.0
Background Dolt is a SQL database that stores data in a commit graph, and offers a Git-like interface for management. It offers a command-line-interface (CLI) that provides managing database level considerations such as how and where to start a serv...
Read More - 5 min read
Learn SQL with Real Data using Dolt
Dolt is a SQL database with Git-style versioning. DoltHub is a place on the internet to share Dolt databases. We think these tools can help people learn and perfect their SQL skills like no other database. This blog explains how. Get started quickly...
Read More - 3 min read
May Dataset Spotlight
Every month we highlight some interesting datasets on DoltHub. The focus is on new or updated datasets but sometimes we shed fresh light on a classic. For those new to Dolt and DoltHub, Dolt is Git for data. Git versions files. Dolt versions SQL tab...
Read More - 5 min read
Delivering Declarative Data to DoltHub with GraphQL
DoltHub is GitHub for data. As you might imagine, the data-fetching needs on the front end of such an application are intense. In the previous article in this series, we saw how working directly with our gRPC API was making our front-end team rather ...
Read More - 7 min read
Extending SQL Sync to Postgres
Background Dolt is Git for data. It is a relational database that implements a storage layout similar to a commit graph, allowing users to clone, branch, and merge structured data. We believe the ability to clone and pull a remote dataset, and insta...
Read More - 6 min read
Introducing Secondary Indexes
Dolt is a SQL database with Git-style versioning. We're constantly adding new and exciting SQL features, and secondary indexes are one of them! This blog goes over what they are, why they're useful, and how they're implemented in Dolt. What are inde...
Read More - 6 min read
Dolt as an Application Server
A question we have been asked numerous times is, "Can Dolt be used as an application server"? This has driven a lot of conversations internally about the use cases of a versioned database server, and led to some very technical discussions about concu...
Read More - 6 min read
Distribute Data with Dolt, not APIs
Application Programming Interfaces (APIs) are the dominant mode of distributing data on the internet. Twitter debates in the data science community about Comma Separated Value (CSV) files vs APIs have flared up lately. We think both of these options ...
Read More - 8 min read
How GraphQL Saved Us from the gRPC Dumpster Fire We Created
DoltHub is the online data community powered by Dolt, the version-controlled SQL database. In the previous article in this series, we took an overview of DoltHub's front-end architecture. In this article, we'll take a look at the pit of sadness our d...
Read More - 6 min read
The Dolt Commit Graph and Structural Sharing
Dolt is a SQL database that provides Git-like functionality, including clone, push, pull, branch, and merge. This post is part of a series exploring how Dolt stores table data. In our previous post, How Dolt Stores Table Data, we explored a unique so...
Read More - 10 min read
Using Dolt to Manage Train/Test Splits
Twitter is wonderful sometimes. Train/Test Tweet We don't know Aaron. He finds us on Twitter, asks a great question, makes us think, and prompts a blog post. How can you use Dolt to manage train/test splits for your Machine Learning models? Dolt i...
Read More - 7 min read
Using Dolt with the JetBrains DataGrip SQL Workbench
Dolt has been rapidly expanding its capabilities as a SQL server recently. We've done a lot of work to get the dolt sql-server command to be a stable peer to the built-in SQL shell, with all the same capabilities. In the last month we've expanded the...
Read More - 6 min read
Joining Multiple Repositories with SQL Queries
In our blogs we have shown over and over again how easy it is to clone data from DoltHub and immediately start querying it with SQL. We are constantly working on improving our data catalog. As we do, there emerge more occasions where you can derive i...
Read More