GraphLab

GraphLab
Developer(s) Carnegie Mellon University
Stable release v2.2 / July 1, 2013 (2013-07-01)
Development status Active
Written in C++
Operating system Linux, MAC OS
Type graph processing framework
License proprietary
Website http://graphlab.org/

GraphLab is a graph-based, high performance, distributed computation framework written in C++. The GraphLab project was started by Prof. Carlos Guestrin of Carnegie Mellon University in 2009. It is an open source project using an Apache License. While GraphLab was originally developed for Machine Learning tasks, it has found great success at a broad range of other data-mining tasks; out-performing other abstractions by orders of magnitude.[1][2]

Motivation

As the amounts of collected data and computing power grows (multicore, GPUs, clusters, clouds), modern datasets no longer fit into one computing node. Efficient distributed/parallel algorithms for handling large scale data are required. The GraphLab framework is a parallel programming abstraction targeted for sparse iterative graph algorithms. GraphLab provides a high level programming interface, allowing a rapid deployment of distributed machine learning algorithms.[3] The main design considerations behind the design of GraphLab are:

Main features of GraphLab are:

GraphLab Toolkits

On top of GraphLab, several implemented libraries of algorithms:

Award Winning Software

A solution based on Graphlab collaborative filtering library won the 5th place in ACM Yahoo! KDD CUP challenge, track1, out of more than 1000 participants. LeBuShiShu team used a mixture of 12 different algorithms and deployed 10,000 CPU hours on BlackLight supercomputer.[4] Most of the utilized algorithms and techniques are now part of the GraphLab Collaborative FIltering Toolkit.

Dato Inc.

Dato Inc. company (formerly GraphLab inc.) was founded by Prof. Carlos Guestrin from University of Washington on May 2013 to continue development support of the GraphLab open source project. Dato Inc. raised 6.75M$ from Madrona and New Enterprise Associates in A round, and 18.5M$ in B round from Vulcan Capital and Opus Capital, as well as Madrona and New Enterprise Associates.[5]

References

  1. Joseph Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, Carlos Guestrin (2012). "PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs." Proceedings of Operating Systems Design and Implementation (OSDI).
  2. Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin and Joseph M. Hellerstein (2012). "Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud." Proceedings of Very Large Data Bases (PVLDB).
  3. Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin and J. Hellerstein. GraphLab: A New Framework for Parallel Machine Learning. In the 26th Conference on Uncertainty in Artificial Intelligence (UAI), Catalina Island, USA, 2010
  4. Yao Wu, Qiang Yan, Danny Bickson, Yucheng Low, Qing Yang. Efficient Multicore Collaborative Filtering. In ACM KDD CUP workshop 2011.
  5. GraphLab CrunchBase Profile http://www.crunchbase.com/company/graphlab

External links

This article is issued from Wikipedia - version of the Tuesday, October 13, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.