Mahdi Esmailoghli

CURRICULUM VITAE

  • About me

    I am a Ph.D. student at the Databases and Information Systems research group under the supervision of Prof. Ziawasch Abedjan. I received my M.Sc. degree from Amirkabir University of Technology (Tehran Polytechnic). I worked on distributed and unsupervised anomaly detection and explanation systems in the healthcare domain as my master thesis.

    My current research interest lies in data discovery in data lakes. In particular, I develop systems to efficiently explore large data lakes to enhance the data at hand to train more effective machine learning (ML) models. To this end, I have developed indexes and algorithms to efficiently discover relevant tables from a data lake corpus of hundreds of millions of tables.

     

    Contact:

    esmailoghi ( at ) dbs ( dot ) uni-hannover ( dot ) de

    Website

  • Education

    Leibniz Universität Hannover
    Research associate in DataBase Systems group (DBS) under the supervision of Prof. Dr. Ziawasch Abedjan (2020 - Present)

    Technische Universität Berlin
    Research associate in Big Data Management (BigDaMa) group under the supervision of Prof. Dr. Ziawasch Abedjan (2018 - 2020)

    Amirkabir University of Technology (Tehran Polytechnic)
    Master's Degree, Computer Software Engineering · (2015 - 2017)

    Urmia University
    Bachelor’s Degree, Computer Software Engineering · (2010 - 2014)

  • Awards & honors

    Distinguished Referee at CIKM, 2023
    Conference on Information and Knowledge Management (CIKM)
    2023, Birmingham, UK

    GI Data Science Challenges First Prize 2023
    Datenbanksysteme für Business, Technologie und Web (BTW) 2023,
    Dresden, Germany

    GI Data Science Challenges First Prize 2019
    Datenbanksysteme für Business, Technologie und Web (BTW) 2019,
    Rostock, Germany

    National university entrance exam exemption award (for M.Sc. degree) due to the top ranks in Computer Olympiads 2014

    Top 1 student based on GPA, in B.Sc degree
    Urmia University, EECS department, 2014

    Ranked #4 in 19th national Computer Olympiad Of Iran
    September 2014

    Ranked #1 in 19th national Computer Olympiad Of Iran semi final
    June 2014

    Ranked #10 in 18th national Computer Olympiad Of Iran
    August 2013

    Ranked #2 in 18th national Computer Olympiad Of Iran semi final
    June 2013

  • Publications

    Blend: A Unified Data Discovery System
    Under Submission, link: https://arxiv.org/pdf/2310.02656.pdf

    Demonstrating MATE and COCOA for Data Discovery
    International Conference on Management of Data (SIGMOD) 2023, Seattle, USA

    Duplicate Table Discovery with Xash
    Datenbanksysteme für Business, Technologie und Web (BTW) 2023, Dresden, Germany

    MATE: multi-attribute table extraction
    Very Large Data Bases (VLDB) 2022, Sydney, Australia

    COCOA: COrrelation COefficient-Aware Data Augmentation.
    International Conference on Extending Database Technology (EDBT) 2023, Nicosia, Cyprus, 331-336

    Combining Programming-by-Example with Transformation Discovery from large Databases
    Datenbanksysteme für Business, Technologie und Web (BTW) 2021

    Data Science für alle: Grundlagen der Datenprogrammierung: Ein Data-Science-Kurs für alle Studierenden der TU Berlin
    Informatik Spektrum 43, 2020, 129-136

    CAFE: Constraint-Aware Feature Extraction from Large Databases
    The Conference on Innovative Data Systems Research (CIDR) 2020, Amsterdam, Netherlands

    Particulate matter matters—the data science challenge@ BTW 2019
    Datenbank-Spektrum 19, 165-182

    Explanation of air pollution using external data sources
    BTW 2019–Workshopband

    Design of a Driver Assistant System Based on Vehicular Communications Using Fuzzy Logic
    Quarterly Journal of Transportation Engineering 7 (3), 385-404

  • Teaching

    Leibniz Universität Hannover:

    • Big-Data Technologies WS 2023
    • Data Science Foundation SS 2023
    • Big-Data Technologies WS 2022
    • Data Science Foundation SS 2022
    • Advanced Topics in Database Systems WS 2021
    • Advanced Topics in Database Systems SS 2021

    TU Berlin:

    • Data Science Application SS 2020
    • Data Science 1: Essentials of Data Programming WS 2019
    • Data Science 1: Essentials of Data Programming SS 2019
    • Data Science Application SS 2019
    • Data Science Application WS 2018

    Amirkabir University:

    • Data Intensive Computing, TA for Prof. Dr. Amir Payberah associate professor at KTH Royal Institute of Technology, Stockholm, Sweden