## Alice Zhengalicez at gmail dot com |

I specialize in machine learning. Right now, I'm working on a book on feature engineering. Before ~~spending my days staring into the distance and twiddling my thumbs~~ my sabbatical, I worked at Dato (formerly known as GraphLab), where I was head of the Toolkits team developing machine learning algorithms. I also dabbled in blogs, talks, tutorials. Prior to Dato/GraphLab, I was a researcher in the Machine Learning Group at Microsoft Research, Redmond. Before joining Microsoft, I was a postdoc at Carnegie Mellon University's Auton Lab and the Parallel Data Lab. I received B.A.s in Mathematics and Computer Science and a Ph.D. in Electrical Engineering from U. C. Berkeley.

When not working, I meditate daily and practice yoga occasionally. I'm a co-organizer of the Seattle Data/Analytics/Machine Learning MeetUp.

I am interested in making machine learning easy to use. Machine learning applications always require close collaborations between domain experts who understand the data and machine learning experts who understand the algorithms. The problem with this setup is that it is easy to scale up the size of the data, but much harder to scale up the number of experts. My research focuses on easing the dependence on expertise by making learning algorithms more automated, their outputs more interpretable, and the labeling tasks simpler. In the past, I have worked on using machine learning to diagnose ailing computer systems and software. Some of the lessons learned from that domain continues to drive my research today.

**Mastering Feature Engineering**. O'Reilly Media. Early release 2016 [website].

**Evaluating Machine Learning Models—A Beginner's Guide to Key Concepts and Pitfalls**. O'Reilly Media. Sep 2015. [website]

**The How and Why of Feature Engineering**. Strata + Hadoop World, San Jose, CA. March 2016. [slides].

**Evaluating Machine Learning Models—A Beginner's Guide**. Seattle Data Science MeetUp. Sep, 2015. [slides].

**Understanding Feature Space in Machine Learning**. [slides] [video] (Rich Data Summit, SF, CA. 2015).

**The Challenges of Bringing Machine Learning to the Masses**. Alice Zheng and Sethu Raman. NIPS Workshop on Software Engineering for Machine Learning. Montreal, Quebec, Canada. 2014. [slides].

**What the #*($! is Big Data?—A Holistic View of Data and Algorithms**. Strata Conference, Santa Clara, CA. Feb 2014. [slides] [video].

**Collaborative Denoising Auto-Encoders for Top-N Recommender Systems**. Yao Wu, Christopher DuBois, Alice X. Zheng, Martin Ester. In*Proceedings of the 9th ACM International Conference on Web Search and Data Mining (WSDM 2016)*, San Francisco, 2016. [pdf]

**Gradient Boosted Feature Selection**. Zhixiang (Eddie) Xu, Gao Huang, Kilian Q.Weinberger, Alice X. Zheng. In*Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)*, New York, 2014. [pdf]

**Lazy Paired Hyper-Parameter Tuning**. Alice X. Zheng and Mikhail Bilenko. In*Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013)*, Beijing, China, 2013. [webpage] [slides]

**Fast Image Tagging**. Minmin Chen, Alice Zheng, Kilian Q. Weinberger. In*Proceedings of the 30th International Conference on Machine Learning (ICML 2013)*, Atlanta, GA, USA, 2013. [webpage]

**Fast Top-K Similarity Queries Via Matrix Compression**. Yucheng Low and Alice X. Zheng. In*Proceedings of the Twenty-First ACM International Conference on Information and Knowledge Management (CIKM 2012)*, 2012. [CIKM pdf] (short), [Microsoft TR pdf] (long).

**Active Graph Reachability Reduction for Network Security and Software Engineering**. Alice X. Zheng, John Dunagan, Ashish Kapoor. In*Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI)*, 2011. [pdf]

**Diagnosing performance changes by comparing request flows**. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. In*8th USENIX Symposium on Networked Systems Design and Implementation (NSDI)*, 2011. [pdf]

**There's an app for that, but it doesn't work. Diagnosing Mobile Applications in the Wild**. Sharad Agarwal, Ratul Mahajan, Alice Zheng, Victor Bahl. In*ACM HotNets IX*, October 2010. [pdf]

**Practical Performance Models for Complex, Popular Applications: A Feasibility Study**. Eno Thereska, Bjoern Doebel, Alice X. Zheng, Peter Nobel. In*International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS)*, 2010. [pdf]

**A Survey of Statistical Network Models**. Anna Goldenberg, Alice X. Zheng, Stephen E. Fienberg, Edoardo M. Airoldi.*Foundations and Trends in Machine Learning*, 2, 2, pp 129-233, 2009. [webpage]

**Heat-ray: Combating Identity Snowball Attacks Using Machine Learning, Combinatorial Optimization and Attack Graphs**. John Dunagan, Alice X. Zheng, Daniel R. Simon.*ACM Symposium on Operating Systems Principles (SOSP)*, 2009. [pdf]

**Categorizing and Differencing System Behaviours**. Raja R. Sambasivan, Alice X. Zheng, Eno Thereska, Gregory R. Ganger.*Second Workshop on Hot Topics in Autonomic Computing*. June 15, 2007. Jacksonville, FL. [pdf]

**Modeling the Relative Fitness of Storage**, Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice Zheng, Gregory R. Ganger.*International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS)*. San Diego, CA. June 12-14, 2007. ACM.**Awarded Best Paper.**[pdf]

**A Generative Model for Dynamic Contextual Friendship Networks**, Alice X. Zheng and Anna Goldenberg.*CMU tech report, 2006.*[pdf]

**Exploratory Study of a New Model for Evolving Networks**, Anna Goldenberg and Alice X. Zheng.*Proceedings of the Workshop on Statistical Network Analysis: Models, Issues, and New Directions at ICML-06.*[pdf]

**Statistical Debugging: Simultaneous Isolation of Multiple Bugs**, Alice X. Zheng, Michael I. Jordan, Ben Liblit, Mayur Naik, Alex Aiken.*Proceedings of ICML-06.*[pdf]

**Statistical Software Debugging**, Alice X. Zheng.*Doctoral dissertation*, U.C. Berkeley, 2006. [pdf]

**Efficient Test Selection in Active Diagnosis via Entropy Approximation**, Alice X. Zheng, Irina Rish, Alina Beygelzimer.*Proceedings of UAI-05.*[pdf] [ps]

**Scalable Statistical Bug Isolation**, Ben Liblit, Mayur Naik, Alice X. Zheng, Alex Aiken, Michael I. Jordan.*ACM SIGPLAN 2005 Conference on Programming Language Design and Implementation (PLDI 2005)*. [pdf] [ps]

**Public Deployment of Cooperative Bug Isolation**, Ben Liblit, Mayur Naik, Alice X. Zheng, Alex Aiken, Michael I. Jordan.*Workshop on Remote Analysis and Measurement of Software Systems (RAMSS)*, 2004. [pdf]

**Failure Diagnosis Using Decision Trees**, Mike Chen, Alice X. Zheng, Jim Lloyd, Michael I. Jordan, Eric Brewer.*International Conference on Autonomic Computing (ICAC-04)*, 2004. [pdf] [ps]

**Statistical Debugging of Sampled Programs**, Alice X. Zheng, Michael I. Jordan, Ben Liblit, Alex Aiken.*Advances in Neural Information Processing Systems 16*, 2003. [pdf]

**Sampling User Executions for Bug Isolation**, Ben Liblit, Alex Aiken, Alice X. Zheng, Michael I. Jordan.*Workshop on Remote Analysis and Measurement of Software Systems (RAMSS)*, May 9, 2003. [pdf]

**Bug Isolation via Remote Program Sampling**, Ben Liblit, Alex Aiken, Alice X. Zheng, Michael I. Jordan.*ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation (PLDI 2003)*. [pdf] [ps]

**Learning a Gaussian Process Prior for Automatically Generating Music Playlists**, John C. Platt, Christopher J.C. Burges, Steven Swenson, Christopher Weare, Alice Zheng.*Advances in Neural Information Processing Systems 14*, 2001. [ps] [pdf]

**Link analysis, eigenvectors, and stability**, Andrew Y. Ng, Alice X. Zheng, Michael I. Jordan.*Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01)*, 2001. [ps]

**Stable methods for link analysis**, Andrew Y. Ng, Alice X. Zheng, Michael I. Jordan.*Proceedings of the Twenty-Fourth Annual Internation ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001)*, 2001. [ps]

**Fast Multiple Antenna Differential Decoding**, Kenneth L. Clarkson, Wim Sweldens, Alice Zheng.*IEEE Transactions on Communications*, Vol. 49, Nr. 2, pp. 253-261, 2001. [compressed postscript]

[Bio] -- [Interests] -- [Books] -- [Talks] -- [Papers]

Tue, Jul 12, 2016 11:55:46 PM I had a feeling once about mathematics -- that I saw it all. Depth beyond depth was revealed to me -- the Byss and the Abyss. I saw -- as one might see the transit of Venus or even the Lord Mayor's Show -- a quantity passing through infinity and changing its sign from plus to minus. I saw exactly why it happened and why tergiversation was inevitable -- but it was after dinner and I let it go. -- Winston Churchill