Clustrix
Private | |
Industry | Computer database |
Founded |
San Francisco, California, U.S. (December 15, 2006 ) |
Founder | Paul Mikesell, Sergei Tsarev, Eric Hoffman |
Headquarters |
201 Mission Street San Francisco, CA, United States |
Area served | Worldwide |
Key people |
Mike Azevedo (CEO) Sergei Tsarev (Founder) Paul Mikesell (Founder) |
Products | Clustrix Database Server |
Number of employees | 40–50 |
Website |
clustrix |
Clustrix, Inc. is a San Francisco-based company that develops a NewSQL[1][2] Database. The company was founded by Paul Mikesell (formerly of EMC Isilon) and Sergei Tsarev (developer of Simple Time-series Database) and is headed by Mike Azevedo.[3] The company is privately held, and is backed by HighBAR Ventures,[4] Sequoia Capital, U.S. Venture Partners (USVP), and ATA Ventures.[5] Clustrix is a distributed primary SQL database.
Market
Clustrix is a scale-out SQL database and part of what are often called the NewSQL databases (modern relational database management systems), which have started to gain mind share[6] closely following the NoSQL movement.
Clustrix is a mature product, based on support for stored procedures and that it was designed and built before competitive NewSQL databases. The product launched in 2006,[7] and has served customers since 2008.[8] The primary databases like Microsoft SQL Server and MySQL supported online transaction processing and online analytical processing but were not distributed. Clustrix occupies this space with a distributed SQL, ACID database that scales transactions[9] and support real-time analytics. Other successful distributed SQL databases are columnar (they don't support primary transaction workload) and focus on offline analytics and this includes EMC Greenplum, HP Vertica, Infobright, and Amazon Redshift. Notable players in the primary SQL database space are in-memory. This includes VoltDB and MemSQL, which excel at low-latency transactions, but do not target real-time analytics. NoSQL competitors, like MongoDB are good at handling unstructured data and read heavy workloads, but do not compete in the space for write heavy workloads (no transactions, coarse grained (DB-level) locking, and no SQL features (like joins), so the NewSQL and NoSQL databases are complementary.
Products
Clustrix is a primary scale-out SQL database. It supports workloads that involve scaling transactions and real-time analytics. The system is a drop-in replacement for MySQL, and is designed to overcome MySQL scalability issues with a minimum of disruption to an enterprise's production activities.[10] It also has built in fault-tolerance features for high availability within a cluster. It has parallel backup and parallel replication among clusters for disaster recovery.
Clustrix's database is available:
- as downloadable software[11]
- in the Amazon Web Services Marketplace[12]
Technology
Query evaluation
The Clustrix database operates on a distributed cluster of shared-nothing nodes using a query to data approach.[13] Here nodes typically own a subset of the data. SQL queries are split into query fragments and sent to the nodes that own the data. This enables Clustrix to scale horizontally (scale out) as additional nodes are added.[14]
Data distribution
The Clustrix database automatically splits and distributes data evenly across nodes with each slice having copies on other nodes.[15] Uniform data distribution is maintained as nodes are added, removed or if data is inserted unevenly. This automatic data distribution approach removes the need to shard and enables Clustrix to maintain database availability in the face of node loss.[16]
Performance
In a performance test completed by Percona,[17] a three-node cluster saw about a 73% increase in speed over a similarly equipped single MySQL server running tests with 1024 simultaneous threads. Additional nodes added to the Clustrix cluster provided roughly linear increases in speed.[18]
References
- ↑ What we talk about when we talk about NewSQL
- ↑ The NewSQL Movement
- ↑ Clustrix Management Webpage
- ↑
- ↑ Marketwatch Article on Clustrix
- ↑ / Google Spanner's most surprising revelation NoSQL is Out and NewSQL is in
- ↑ http://venturebeat.com/2010/05/03/y-combinators-clustrix-rolls-out-databases-that-scale/
- ↑ http://www.dbms2.com/2010/05/12/the-clustrix-story/
- ↑ http://www.insideanalysis.com/2013/01/companies-technologies-to-watch-in-2013/
- ↑ http://www.nytimes.com/external/gigaom/2011/01/17/17gigaom-clustrix-lifts-the-curtain-on-early-database-cust-93427.html?adxnnl=1&partner=&adxnnlx=1295460060-+Q1NmQDKXNGYjDWtdfSQIA
- ↑
- ↑ http://techcrunch.com/2013/01/19/your-database-is-probably-terrible/
- ↑ http://www.clustrix.com/Portals/146389/docs/clustrix_a_new_approach.pdf
- ↑ http://techcrunch.com/2013/01/19/your-database-is-probably-terrible/
- ↑ http://cs.brown.edu/courses/cs227/slides/checkpointing/clustrix.pdf
- ↑ http://cattell.net/datastores/Datastores.pdf
- ↑ Percona-Clustrix TPCC Evaluation
- ↑ Clustrix Delivers Software-Only Kit to Demo Shard-less MySQL Scaling
External links
- Official website
- Clustrix on Twitter
- Sergei Tsarev. "Sergei Tsarev's Blog".
- "Clustrix Keynote at Percona Live London 2011 Convention".