## Alice Zhengalicez at dato dot com |

I currently work at Dato (formerly known as GraphLab). At Dato, we are developing a fast and scalable machine learning platform for Big Data analytics. Prior to Dato/GraphLab, I was a researcher in the Machine Learning Group at Microsoft Research, Redmond. Before joining Microsoft, I was a postdoc at Carnegie Mellon University's Auton Lab and the Parallel Data Lab. I received my B.A. and Ph.D. degrees from U. C. Berkeley.

I am interested in making machine learning easy to use. Up to now, applying machine learning to data analysis has required close collaborations between domain experts who understand the data and machine learning experts who understand the algorithms. The problem is that it is easy to scale up the size of the data, but much harder to scale up the number of experts. My research focuses on easing the dependence on expertise by making learning algorithms more automated, their outputs more interpretable, and the labeling tasks simpler. In the past, I have worked on using machine learning to diagnose ailing computer systems and software. Some of the lessons learned from that domain continues to drive my research today.

**Evaluating Machine Learning Models—A Beginner's Guide to Key Concepts and Pitfalls**. Alice Zheng. O'Reilly. Sep 2015. [website]

**Gradient Boosted Feature Selection**. Zhixiang (Eddie) Xu, Gao Huang, Kilian Q.Weinberger, Alice X. Zheng. In*Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)*, New York, 2014. [pdf]

**Lazy Paired Hyper-Parameter Tuning**. Alice X. Zheng and Mikhail Bilenko. In*Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013)*, Beijing, China, 2013. [webpage] [slides]

**Fast Image Tagging**. Minmin Chen, Alice Zheng, Kilian Q. Weinberger. In*Proceedings of the 30th International Conference on Machine Learning (ICML 2013)*, Atlanta, GA, USA, 2013. [webpage]

**Fast Top-K Similarity Queries Via Matrix Compression**. Yucheng Low and Alice X. Zheng. In*Proceedings of the Twenty-First ACM International Conference on Information and Knowledge Management (CIKM 2012)*, 2012. [pdf]

**Fast Top-K Similarity Queries Via Matrix Compression**. Yucheng Low and Alice X. Zheng.*Microsoft Research Technical Report*, MSR-TR-2012-81, 7 August 2012. [pdf]

**Active Graph Reachability Reduction for Network Security and Software Engineering**. Alice X. Zheng, John Dunagan, Ashish Kapoor. In*Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI)*, 2011. [pdf]

**Diagnosing performance changes by comparing request flows**. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. In*8th USENIX Symposium on Networked Systems Design and Implementation (NSDI)*, 2011. [pdf]

**There's an app for that, but it doesn't work. Diagnosing Mobile Applications in the Wild**. Sharad Agarwal, Ratul Mahajan, Alice Zheng, Victor Bahl. In*ACM HotNets IX*, October 2010. [pdf]

**Practical Performance Models for Complex, Popular Applications: A Feasibility Study**. Eno Thereska, Bjoern Doebel, Alice X. Zheng, Peter Nobel. In*International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS)*, 2010. [pdf]

**A Survey of Statistical Network Models**. Anna Goldenberg, Alice X. Zheng, Stephen E. Fienberg, Edoardo M. Airoldi.*Foundations and Trends in Machine Learning*, 2, 2, pp 129-233, 2009. [access]

**Heat-ray: Combating Identity Snowball Attacks Using Machine Learning, Combinatorial Optimization and Attack Graphs**. John Dunagan, Alice X. Zheng, Daniel R. Simon.*ACM Symposium on Operating Systems Principles (SOSP)*, 2009. [pdf]

**Categorizing and Differencing System Behaviours**. Raja R. Sambasivan, Alice X. Zheng, Eno Thereska, Gregory R. Ganger.*Second Workshop on Hot Topics in Autonomic Computing*. June 15, 2007. Jacksonville, FL. [pdf]

**Modeling the Relative Fitness of Storage**, Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice Zheng, Gregory R. Ganger.*International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS)*. San Diego, CA. June 12-14, 2007. ACM.**Awarded Best Paper.**[pdf]

**A Generative Model for Dynamic Contextual Friendship Networks**, Alice X. Zheng and Anna Goldenberg.*CMU tech report, 2006.*[pdf]

**Exploratory Study of a New Model for Evolving Networks**, Anna Goldenberg and Alice X. Zheng.*Proceedings of the Workshop on Statistical Network Analysis: Models, Issues, and New Directions at ICML-06.*[pdf]

**Statistical Debugging: Simultaneous Isolation of Multiple Bugs**, Alice X. Zheng, Michael I. Jordan, Ben Liblit, Mayur Naik, Alex Aiken.*Proceedings of ICML-06.*[pdf]

**Statistical Software Debugging**, Alice X. Zheng.*Doctoral dissertation*, U.C. Berkeley, 2006. [pdf]

**Efficient Test Selection in Active Diagnosis via Entropy Approximation**, Alice X. Zheng, Irina Rish, Alina Beygelzimer.*Proceedings of UAI-05.*[pdf] [ps]

**Scalable Statistical Bug Isolation**, Ben Liblit, Mayur Naik, Alice X. Zheng, Alex Aiken, Michael I. Jordan.*ACM SIGPLAN 2005 Conference on Programming Language Design and Implementation (PLDI 2005)*. [pdf] [ps]

**Public Deployment of Cooperative Bug Isolation**, Ben Liblit, Mayur Naik, Alice X. Zheng, Alex Aiken, Michael I. Jordan.*Workshop on Remote Analysis and Measurement of Software Systems (RAMSS)*, 2004. [pdf]

**Failure Diagnosis Using Decision Trees**, Mike Chen, Alice X. Zheng, Jim Lloyd, Michael I. Jordan, Eric Brewer.*International Conference on Autonomic Computing (ICAC-04)*, 2004. [pdf] [ps]

**Statistical Debugging of Sampled Programs**, Alice X. Zheng, Michael I. Jordan, Ben Liblit, Alex Aiken.*Advances in Neural Information Processing Systems 16*, 2003. [pdf]

**Sampling User Executions for Bug Isolation**, Ben Liblit, Alex Aiken, Alice X. Zheng, Michael I. Jordan.*Workshop on Remote Analysis and Measurement of Software Systems (RAMSS)*, May 9, 2003. [pdf]

**Bug Isolation via Remote Program Sampling**, Ben Liblit, Alex Aiken, Alice X. Zheng, Michael I. Jordan.*ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation (PLDI 2003)*. [pdf] [ps]

**Learning a Gaussian Process Prior for Automatically Generating Music Playlists**, John C. Platt, Christopher J.C. Burges, Steven Swenson, Christopher Weare, Alice Zheng.*Advances in Neural Information Processing Systems 14*, 2001. [ps] [pdf]

**Link analysis, eigenvectors, and stability**, Andrew Y. Ng, Alice X. Zheng, Michael I. Jordan.*Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01)*, 2001. [ps]

**Stable methods for link analysis**, Andrew Y. Ng, Alice X. Zheng, Michael I. Jordan.*Proceedings of the Twenty-Fourth Annual Internation ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001)*, 2001. [ps]

**Fast Multiple Antenna Differential Decoding**, Kenneth L. Clarkson, Wim Sweldens, Alice Zheng.*IEEE Transactions on Communications*, Vol. 49, Nr. 2, pp. 253-261, 2001. [compressed postscript]

[Bio] -- [Interests] -- [Publications]

Last modified: Wed Jan 21 23:22:48 PST 2015 I had a feeling once about mathematics -- that I saw it all. Depth beyond depth was revealed to me -- the Byss and the Abyss. I saw -- as one might see the transit of Venus or even the Lord Mayor's Show -- a quantity passing through infinity and changing its sign from plus to minus. I saw exactly why it happened and why tergiversation was inevitable -- but it was after dinner and I let it go. -- Winston Churchill