Erkang (Eric) Zhu

[Github] [Email]

I am a PhD student in Computer Science at the University of Toronto. I am a member of the Data Curation Lab and my supervisor is Renée J. Miller.

My research focuses on algorithms and systems that enable efficient discovery and navigation over massive amount of decentralized datasets such as Open Data and data on the Web. I published a paper in VLDB 2016 on Internet-Scale Dataset Search, and the code is open source.

I also have an interest in randomized algorithms and their applications. I developed a popular open source Python library (datasketch) with several implementations including MinHash and LSH.

In the lab, I am collaborating with Ken Pu and Fatemeh Nargesian on several projects. In the summer of 2016, I was fortunate to collaborate with Surajit Chaudhuri and Yeye He of DMX Group at Microsoft Research in Redmond, WA. In 2014, I interned at the Dynamo DB team of Amazon Web Services in Seattle.