Matei Zaharia
Matei Zaharia | |
---|---|
Citizenship | Canada |
Nationality | Romania |
Fields | Computer Science |
Institutions | Massachusetts Institute of Technology |
Alma mater |
UC Berkeley (Ph.D.) University of Waterloo (B.S.) |
Thesis | An Architecture for Fast and General Data Processing on Large Clusters (2013) |
Doctoral advisor |
Ion Stoica Scott Shenker |
Known for |
Apache Spark Apache Mesos |
Website people |
Matei Zaharia is a Romanian-Canadian computer scientist specializing in big data, distributed systems, and cloud computing. He is a co-founder and CTO of Databricks, and an assistant professor of computer science at the Massachusetts Institute of Technology.[1] He created the Apache Spark project and co-created the Apache Mesos project during his PhD at UC Berkeley, and also designed the core scheduling algorithms used in Apache Hadoop, including the most widely used fair scheduler.[2]
Biography
Matei Zaharia was born in Romania. His family moved to Canada later and he attended Jarvis Collegiate Institute in Toronto for high school and the University of Waterloo for Computer Science. He received the Governor General’s Academic Silver Medal for highest academic standing upon graduation from the University of Waterloo. He went on to study at UC Berkeley gaining a Ph.D. in Computer Science in 2013 under the supervision of Ion Stoica and Scott Shenker.[3]
He participated in programming contests, winning two IOI silver medals in high school. He was on the University of Waterloo team that competed in ACM ICPC programming competition in 2004 and 2005. He won a gold medal in ICPC 2005 (3rd place worldwide), and placed 15th in 2004.[4] Both times his team got a title of North America champions.
In the course of his PhD studies, he created the Apache Spark project and co-created the Apache Mesos project. He also designed and implemented the core scheduling algorithms used in Apache Hadoop.[5]
He received two Best Paper awards at NSDI 2012 and SIGCOMM 2012, Honorable Mention for Community Award at NSDI 2012, and a Best Demo Award at SIGMOD 2012. Jointly with Reynold Xin, Parviz Deyhim, Xiangrui Meng, and Ali Ghodsi, he holds the 2014 world record in Daytona GraySort using Apache Spark.[6] Moreover, in 2015 he received the ACM Doctoral Dissertation Award.[7]
References
- ↑ "How Companies are Using Spark, and Where the Edge in Big Data Will Be". Strata Conference. Retrieved 26 August 2014.
- ↑ "Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling" (PDF).
- ↑ Zaharia, Matei. "An Architecture for Fast and General Data Processing on Large Clusters" (PDF). http://www.eecs.berkeley.edu. Retrieved 29 June 2015. External link in
|website=
(help) - ↑ "Programming Contest Resources".
- ↑ "Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling" (PDF).
- ↑ "Sort Benchmark".
- ↑ "ACM Doctoral Dissertation Award 2015".
External links
- Website at MIT
- Chinese translation of his PhD Disseration by CSDN community, January 2015