Open Thesis

1- Will pre-indexing improve the deep code search?

2- Quality-driven union table search [MSc]

3- Meta-learning fair ML pipeline components [MSc]

4- A unified data representation for few-shot learning [MSc]

5- Predict research trends in data science based on arXiv repository [MSc]

6- Column Splitter with Record-Matching [B.Sc.]

Ongoing Thesis


  • Effectively Sampling Validation Sets (B.Sc.)
    • Mohamed Mahdi Kanoun
  • ML Validation Set Mining (B.Sc.)
    • Johannes Waldeck
  • Validation Set Selection in Machine Learning (B.Sc.)
    • Achraf Bahloul   
  • Extracting unbiased text from large text corpora (M.Sc.)
    • Christoph Becker   
  • Investigating and improving variable names in data science projects with data mining (M.Sc.)
    • Huu Kim Nguyen   
  • Leveraging Semi-Structured Web Pages for Semantic Information (M.Sc.)
    • Fabian Johannsen

Finished Thesis


  • Deklarative sukzessive Halbierung (M.Sc.)
    • Ramzi Mezlini
  • Performance Benchmarking of Database MAnagement Systems (B.Sc.)
    • Erik Schriefer

  • Improving Label Propagation in the Data Cleaning System Raha (B.Sc.)
    • Maximilian Siebenthaler
  • Declarative successive halving (M.Sc.)
    • Pinliang Li
  • Embedding Data Transformations in AutoML (B.Sc.)
    • Henrik Tipp
  • Opinion mining in social media data based on neural networks to predict bitcoin prices (B.Sc.)
    • Marc Speckmann
  • Analysing the Influence of social media Influencers on the Bitcoin Price (B.Sc.)
    • Omar Allouni
  • Analyzing the relation between social media and Bitcoin’s price variation (B.Sc.)
    • Ahmed Malek Ghanmi

  • From mining Naming Conventions in Data Science Projects to suggesting Variable Names (M.Sc.)
    • Philip Ossenkopp

  • Scalable Error Detection (B.Sc.)
    • Faical Aridal

  • Feature Analysis for Agglomerative Clustering (B.Sc.)
    • Malte Fabian Kuhlmann
  • Detecting table headers in heterogeneous tables (B. Sc.)
    • Daniel Ritter



  • Instrumentierung von Datenreinigung mit AutoML (B.Sc.)
    • Yazan Alkhatib
  • Mining social media to discover the factors that affect bitcoin price (B.Sc.)
    • Jonathan Friebe
  • Multi-attribute join search with map-reduce (B.Sc.)
    • Justin Zheng
  • Efficient join discovery from large data lakes (B.Sc.)
    • Akram Chorfi
  • Multi-attribute join search with map-reduce (B.Sc.)
    • Meike Liedtke
  • Interleaving Data Cleaning and AutoML (B.Sc.)
    • Jingwen Ye