Periklis Andritsos   Periklis Andritsos






In the News

Contact Info


Refereed Conference and Journal Publications

  • Making Open Data Transparent: Data Discovery on Open Data. [link]
    In IEEE Data Engineering Bulletin, 41(2), pp. 59-70, June 2018.
    Renée J. Miller, Fatemeh Nargesian, Erkang Zhu, Christina Christodoulakis, Ken Q. Pu, Periklis Andritsos
  • CJM-ab: Abstracting Customer Journey Maps using Process Mining. [link]
    In International Conference on Advanced Information Systems Engineering (CAiSE), 2018.
    Gaël Bernard, Periklis Andritsos
  • The Corporate Control Network Before and After the Crash: Evolution and Sustainability Implications. [link]
    In International Conference on ICT for Sustainability (ICT4S), [poster], 2018.
    Maria-Angela Ferrario, Periklis Andritsos, Niel Chah, Thais Bittencourt
  • An Efficient Type-agnostic Approach for Finding Sub-sequences in Data. [link] BEST PAPER AWARD
    In IEEE International Conference on Data Science and Systems (DSS), 2017.
    Bertil Chapuis, Benoît Garbinato, Periklis Andritsos
  • Distributed clustering of categorical data using the information bottleneck framework. [link]
    In Information Systems, Volume 72, pp. 161-178, December 2017.
    Natasa Tagasovska, Periklis Andritsos
  • CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics. [demo] [link]
    In International Conference on Business Process Modeling (BPM), 2017.
    Gaël Bernard, Periklis Andritsos
  • A Process Mining Based Model for Customer Journey Mapping. [link]
    In International Conference on Advanced Information Systems Engineering (CAiSE), pp. 49-56, 2017.
    Gaël Bernard, Periklis Andritsos
  • Profiling Billions of Triples: The Case of Freebase Data Dumps. [Exposition]
    In International Conference on Computer Science and Software Engineering (CASCON),2017.
    Niel Chah, Periklis Andritsos
  • Reconstructing databases: instance-based structure discovery using reconstructability analysis. [link]
    In International Conference on Computer Science and Software Engineering (CASCON), 279-284, 2017.
    Periklis Andritsos
  • Discovering Customer Journey Maps using a Mixture of Markov Models. [link]
    In International Symposium on Data-driven Process Discovery and Analysis (SIMPDA), pp. 3-7, 2017.
    Matthieu Harbich, Gaël Bernard, Pietro Berkes, Benoît Garbinato, Periklis Andritsos
  • Categorical Data Clustering. [link]
    In Encyclopedia of Machine Learning and Data Mining, pp. 188-193, 2017.
    Periklis Andritsos, Panagiotis Tsaparas
  • Throughput: A key performance measure of Content-Defined Chunking Algorithms. [link]
    In International Conference on Distributed Computing Systems (ICDCS), 2016. 6th International Workshop on Big Data and Cloud Performance.
    Bertil Chapuis, Benoît Garbinato, Periklis Andritsos
  • Data Driven Discovery of Attribute Dictionaries. [link]
    In LNCS Transactions on Computational Collective Intelligence (TCCI), Vol. 21, pp.69-96, 2016.
    Fei Chiang, Periklis Andritsos, Renée J. Miller
  • When Sales Meet Process Mining: A Scientific Approach to Sales Process and Performance Management. [link]
    In International Conference on Information Systems (ICIS), 2016.
    Gaël Bernard, Thomas Boillat, Christine Legner, Periklis Andritsos
  • INDREX: In-database relation extraction. [link]
    In Information Systems Journal, Volume 53, pp. 124-144, October-November 2015.
    Torsten Kilias, Alexander Löser, Periklis Andritsos
  • Efficient itinerary planning with category constraints. [link]
    In ACM/IEEE 22nd International Conference on Advances in Geographic Information Systems (SIGSPATIAL), pp. 203-212, 2014.
    Paolo Bolzoni, Sven Helmer, Kevin Wellenzohn, Johann Gamper, Periklis Andritsos
  • Constructing adaptive configuration dialogs using crowd data. [link]
    In ACM/IEEE International Conference on Automated Software Engineering (ASE), pp. 485-490, 2014.
    Saeideh Hamidi, Periklis Andritsos, Sotirios Liaskos
  • Detecting correlated columns in relational databases with mixed data types. [link]
    In ACM International Conference on Scientific and Statistical Database Management (SSDBM), pp. 30:1-30:12, 2014.
    Hoang Vu Nguyen, Emmanuel Müller, Periklis Andritsos, Klemens Böhm
  • Composite Key Generation on a Shared-Nothing Architecture. [link]
    In Performance Characterization and Benchmarking. Traditional to Big Data - 6th TPC Technology Conference (TPCTC), pp. 188-203, 2014.
    Marie Hoffmann, Alexander Alexandrov, Periklis Andritsos, Juan Soto, Volker Markl
  • Level-Wise Exploration of Linked and Big Data Guided by Controlled Vocabularies and Folksonomies. [link]
    In Advances in Classification Research Online, Volume 24, Number 1, January 2014.
    Periklis Andritsos, Patrick Keilty
  • Case-based reasoning For Bio-Medical Informatics and Medicine. [link]
    In Springer Handbook of Bio-/Neuro-Informatics, pp. 207-221, 2014.
    Periklis Andritsos, Igor Jurisica, Janice Glasgow
  • INDREX: In-Database Distributional Relation Extraction. [link]
    In 6th ACM International Workshop on Data Warehousing and OLAP (DOLAP), 2013 (part of CIKM 2013).
    Torsten Killias, Alexander Löser, Periklis Andritsos.
  • Provenance for Data Mining. [link]
    In 5th USENIX Workshop on the Theory and practice of Provenance (TaPP), 2013.
    Boris Glavic, Javed Siddique, Periklis Andritsos, Renée J. Miller.
  • Individualized Hiking Time Estimation. [link]
    In 23rd Inernational Workshop on Database and Expert Systems Applucations (DEXA), 2012.
    Arthur Pitman, Markus Zanker, Johann Gamper, Periklis Andritsos.
  • AutoDict: Automated Dictionary Discovery. [link]
    In 28th International Conference on Data Engineering (ICDE), 2012.
    Fei Chiang, Erkang Zhu, Periklis Andritsos, Renée J. Miller.
  • Categorical Data Clustering. [link]
    In Encyclopedia of Machine Learning, Part 4, pp. 154-159, 2010.
    Periklis Andritsos, Panayiotis Tsaparas.
  • Ranking of Evolving Stories Through Meta-Aggregation. [pdf-file]
    In 19th International Conference on Information and Knowledge Management (CIKM), 2010.
    Juozas Gordevicius, Francisco J. Estrada, Hyun Chul Lee, Periklis Andritsos, Johann Gamper.
  • Entity Data Management in OKKAM. [pdf-file]
    In 2nd International Workshop on Semantic Web Architectures For Enterprises (SWAE), 2008.
    Themis Palpanas, Junaid Chaudhry, Periklis Andritsos, Yannis Velegrakis.
  • Automating the Generation of Semantic Annotation Schema Using a Clustering Technique. [pdf-file]
    In 13th International Conference on Applications of Natural Language to Information Systems (NLDB), 2008.
    Vitor Souza, Nicola Zeni, Nadzeya Kiyavitskaya, Periklis Andritsos, Luisa Mich, John Mylopoulos.
  • Exploration of Discovered Process Views in Process Spaceship. [pdf-file]
    In Proceedings of the 6th International Conference on Service-Oriented Computing (ICSOC), 2008.
    Hamid R. Motahari Nezhad, Boualem Benatallah, Fabio Casati, Regis Saint-Paul, Periklis Andritsos.
  • Entity Lifecycle Managmenet for OKKAM. [pdf-file]
    In 1st International Workshop on Identity and Reference on the Semantic Web (IRSW), 2008.
    Junaid, Chaudhry, Themis Palpanas, Periklis Andritsos, Antonio Mana.
  • Overview and Semantic Issues of Text Mining. [pdf-file]
    In SIGMOD Record, 36(3), pp. 23-34, September 2007.
    Anna Stavrianou, Periklis Andritsos, Nicolas Nicoloyannis.
  • A Lightweight Tree Structure to Model User Preferences. [pdf-file]
    In 10th DELOS Thematic Workshop on Personalized Access, Profile Management, and Context Awareness in Digital Libraries (PersDL), 2007.
    Hamza H. Syed, Periklis Andritsos.
  • Evaluating Value Weighting Schemes in the Clustering of Categorical Data [pdf-file]
    In 1st Workshop on Machine Learning and Intelligent Optimization (LION), 2007.
    Periklis Andritsos, Vassilios Tzerpos.
  • Clean Answers over Dirty Databases. [pdf-file]
    In 22nd International Confenrece on Data Engineering (ICDE), 2006.
    Periklis Andritsos, Ariel Fuxman, Renée J. Miller.
  • Reducing Build Time Through Precompilations for Evolving Large Software. [pdf-file]
    In 21st International Conference on Software Maintenance (ICSM), September 2005.
    Yijun Yu, Homayoun Dayani-Fard, John Mylopoulos, Periklis Andritsos.
  • Improving the Build Architecture of Legacy C/C++ Software Systems. [pdf-file]
    In International Conference on Fundamental Approaches to Software Engineering (FASE) [part of ETAPS], April 2005.
    Homy Dayani-Fard, Yijun Yu, John Mylopoulos, Periklis Andritsos.
  • Information-Theoretic Software Clustering. [pdf-file]
    In IEEE Transactions on Software Engineering (TSE), Volume 31, Number 2, February 2005.
    Periklis Andritsos, Vassilios Tzerpos.
  • Kanata: Adaptation and Evolution in Data Sharing Systems. [pdf-file]
    In SIGMOD Record, 33(4), pp. 32-37, December 2004.
    Periklis Andritsos, Ariel Fuxman, Aanastasios Kementsietsidis, Renée J. Miller, Yannis Velegrakis.
  • Information-Theoretic Tools for Mining Database Structure from Large Data Sets. [pdf-file]
    In 23rd ACM SIGMOD International Conference on the Management of Data, June 2004.
    Periklis Andritsos, Renée J. Miller, Panayiotis Tsaparas.
  • LIMBO: Scalable Clustering of Categorical Data. [pdf-file]
    In 9th International Conference on Extending DataBase Technology (EDBT), March 2004.
    Periklis Andritsos, Panayiotis Tsaparas, Renée J. Miller, Kenneth C. Sevcik.
  • Software Clustering Based on Information Loss Minimization. [pdf-file]
    In 10th Working Conference on Reverse Engineering, Victoria, B.C., Canada, November 2003.
    Periklis Andritsos, Vassilios Tzerpos.
  • On Schema Discovery. [ps-file]
    In IEEE Data Engineering Bulletin, 26(3), pp. 41-47, September 2003.
    Renée J. Miller, Periklis Andritsos.
  • Clustering Categorical Data Based on Information Loss Minimization.
    In 2nd Hellenic Data Management Symposium (HDMS'03), Athens, Greece, September 2003.
    Periklis Andritsos, Panayiotis Tsaparas, Renée J. Miller, Kenneth C. Sevcik.
  • Using Categorical Clustering in Schema Discovery. [pdf-file]
    In IJCAI Workshop on Information Integration on the Web, pp. 211, Acapulco, Mexico, Augoust 2003.
    Periklis Andritsos, Renée J. Miller.
  • Schema Management. [ps-file]
    In IEEE Data Engineering Bulletin, 25(3), pp. 32-38, September 2002.
    Periklis Andritsos, Ron Fagin, Ariel Fuxman, Laura M. Haas, Mauricio A. Hernandez, C. Ho, Anastasios Kementsietsidis, Renée J. Miller, Felix Naumann, Lucian Popa, Yannis Velegrakis, Charlotte Vilarem, L. Yan.

  • Reverse Engineering Meets Data Analysis. [pdf-file]
    In International Workshop on Program Comprehension (IWPC), pp. 157-166, Toronto, ON,  June 2001 .
    Periklis Andritsos, Renée J. Miller.


  • LIMBO: A Sclable Categorical Data Clustering Algorithm.
    In 13th Annual IBM Centers for Advanced Studies Conference (CASCON), October 2003.
    Periklis Andritsos, Panayiotis Tsaparas, Renée J. Miller, Kenneth C. Sevcik.

  • Clustering Categorical Data Based on Information Maximization.
    In 4th Annual MITACS Conference, May 2003.
    Periklis Andritsos, Panayiotis Tsaparas, Renée J. Miller, Kenneth C. Sevcik.

Technical Reports

  • LIMBO: A Scalable Algorithm to Cluster Categorical Data. [pdf-file]
    Tech. Report CSRG-467, U. of Toronto, Dep. of Computer Science, July 2003.
    Periklis Andritsos, Panayiotis Tsaparas, Renée J. Miller, Kenneth C. Sevcik.

  • Data Clustering Techniques. [pdf-file]
    Tech. Report CSRG-443, U. of Toronto, Dep. of Computer Science, March 2002 (Qualifying Oral Exam) .
    Periklis Andritsos.


  • Scalable Clustering of Categorical Data And Applications. [pdf-file]
    PhD Thesis, U. of Toronto, Dep. of Computer Science, September 2004.
  • Program Reverse Engineering Through On-Line Analytical Processing. [pdf-file]
    Master's Thesis, U. of Toronto, Dep. of Computer Science, July 2000. Tech. Report CSRG-415.
  • Development Of OLAP Queries Tool. [pdf-file (abstract in Greek)]
    Diploma Thesis, Nat. U. of Athens, Dep. of Electrical and Compiter Engineering, July 1998. Tech. Report DIPL-1998-06.