- 11 min read
So you want an AI Database?
Here at DoltHub, we built the world's first version-controlled SQL database: Dolt. What do version control and databases have to do with Artificial Intelligence (AI)? It turns out, a lot. At first, we were skeptical about the AI revolution, but then...
Read More
- 5 min read
Prefix Indexes
If you haven't heard, Dolt is a SQL database with Git versioning. A couple of months ago, a customer asked for prefix indexes, so we implemented them. In this blog, we'll discuss how to use prefix indexes, their benefits, as well as their limitations...
Read More - 10 min read
(Do Not) Let Them Build: Mining Open Data to find NIMBY and YIMBY counties
This is a guest post by Rimantas Lukosevicius. He is a regular bounty contributor and this is his first data analysis blog for us. Who and Where are the NIMBYs? During the second iteration of DoltHub's USA housing price data bounty a large amount...
Read More - 7 min read
Improving Stored Procedure Support
Here at DoltHub, our centerpiece is Dolt, which fuses a MySQL-compatible database with Git-style versioning capabilities. After you install Dolt, all it takes are a few commands to have a running server: mkdir demo cd demo dolt init dolt sql-server ...
Read More - 7 min read
Cooperating with Golang's GC & Fast Blob Writes
Explains how we improved blob write performance
Read More - 5 min read
Adding Google Analytics 4 to an existing Gatsby and Next.js application
Google Universal Analytics will stop collecting data on July 1, 2023. Learn how to add the new Google Analytics 4 (GA4) property to start collecting data from your Gatsby and Next.js applications.
Read More - 6 min read
Some Useful Patterns for Go's os/exec
A collection of useful patterns for interacting with spawned processes using os/exec.
Read More - 18 min read
So you want Data Quality Control?
A survey of data quality control processes and tools. The article describes the modern data stack and how it evolves, a model for thinking about data quality, and finally a survey of modern, open source data quality tools.
Read More - 8 min read
Three Ways to Import Data into Dolt
Dolt is the first database that versions data like Git versions code. We focused on a few key areas for improving Dolt this year: 1) sharpening MySQL feature parity, 2) smoothing the first hour experience, and 3) chasing MySQL's performance. Today we...
Read More - 3 min read
Hosted Dolt now has Organization Teams
Hosted Dolt now has Organization Teams, try making A-Team now
Read More - 10 min read
Introducing Branch Permissions
Here at DoltHub, our centerpiece is Dolt, which fuses a MySQL-compatible database with Git-style versioning capabilities. After you install Dolt, all it takes are a few commands to have a running server: mkdir demo cd demo dolt init dolt sql-server ...
Read More - 10 min read
What Do Two Dot and Three Dot Mean for Logs and Diffs?
TL;DR Git versions files and Dolt versions data. The diff and log commands in Git and Dolt are useful tools to view what has changed between different revisions. You can control what changes you want to include when listing logs or viewing diffs by ...
Read More - 3 min read
Dolt Supports Every Type
If you haven't heard, Dolt is a version controlled database, kinda like if Git and MySQL had a baby. Not too long ago, we announced partial support for Spatial Data Types. Since then, we've received requests for the rest of the Spatial Types, so we d...
Read More - 16 min read
Pruning test dependencies from Go binaries
We're building Dolt, the world's first SQL database with Git-like version control. Recently, a customer contacted us to let us know that test symbols were making it into their binary when they took a dependency on our go-mysql-server library, which p...
Read More - 9 min read
- 7 min read
So you want Soft Deletes?
Explains soft deletes in databases: what they are, why and how to use them. Introduces a new version controlled database concept where every delete is a soft delete.
Read More - 9 min read
I Am Healthcare Transparency and So Can You
If you're like me you've spent a lot of the last two months thinking about how to parse huge JSON files. That's because some of the most valuable data in the world of healthcare is buried in them. These files are big (sometimes 100GB+) and annoying ...
Read More - 4 min read
Migrate your Dolt database to the new format on DoltHub
DoltHub is the central repository for Dolt's version-controlled databases. We like to call it the Github of databases. It lets you query, share, and collaborate on Dolt databases. Last month, we brought Dolt's new storage format to DoltHub. It's al...
Read More - 5 min read
Dockerception: Leonardo DiCaprio and DoltLab v0.7.0
In early October we released DoltHub Jobs, our latest backend change to DoltHub that lets DoltHub handle large, long-running write operations like file import and pull-request merge. We also recently released DoltLab v0.7.0 which includes support for...
Read More - 2 min read
Announcing the $10,000 chargemaster URLs bounty
DoltHub is building a database of hospitals and their chargemasters for Payless.Health. A complete list of URLs will help them build out their comprehensive search engine of hospital prices, spanning all 7,000+ hospitals in the US. Background In 20...
Read More