Romain Mathonat

I am a
The internet has greatly helped me in my work, so I contribute my share through various open-access publications. I'm interested in everything related to tech and data (engineering & science)

Projects

music streaming app
Market Technical Analysis Lab

Analysis of financial markets and cryptocurrencies, backtesting of numerous strategies

Accomplishments
  • Analysis and automation of automated trading strategies (technical analysis) for stocks and cryptos
  • Backtesting of passive strategies: 4 weathers and variations
  • Data retrieval via API
  • Semi-automation algorithms for active portfolio management



music streaming app
VirtualGuessr

Browser-based virtual world exploration game

Accomplishments
  • Experimenting with LLM (Claude) for front-end development



music streaming app
VulgaireDev

Publications of scientific and technical popularization articles

music streaming app
Rocket League skill detection

Innovative machine learning algorithm for automatic in-game skill ("figure") detection

Accomplishments
  • Accuracy of about 80% for an original and difficult problem
  • Homemade data collection (sniffer)
  • Various publications and communications



music streaming app
Mirror of Erised

Notion template for personal organization, inspired by the Eisenhower matrix

Accomplishments
  • Free
  • Harry Potter inspired design
  • Many small features, including ordering by importance, mental load relief



Experience

Lead Data Engineer

Project Implementation and improvement of mobility data pipelines

Context The Michelin Mobility Intelligence (MMI) entity's mission is to make road infrastructures safer, better quality, and more ecological. To achieve this, mobility data is collected and stored in the data warehouse, then transformed and used by various machine learning algorithms to provide prediction and analytics to the client.

Missions

  • Implementation of high-volume data pipelines (~100 billion/month)
  • Improvement of CI/CD (performance/ease of use)
  • Modernization of the python dependency system, overhaul of wheel storage and build
  • Team training on best practices/tools (TDD, hexa archi, pdm, ruff, wsl, vsCode)
  • State of the art (data sharing)
  • Development of internal performance monitoring library
  • Dashboarding and dataviz

Technical environment

Snowflake, snowpark (equivalent to pyspark), SQL, python, CI/CD, pytest, Microsoft Azure, Databricks, Grafana, Streamlit, Gitlab

January 2024 - October 2024 | Lyon, Clermont-Ferrand
Data Engineer / Scientist

Project Implementation and improvement of mobility data pipelines

Context Infologic is an SME that provides an ERP in the agri-food sector. In this context, a lot of data is available, both user behavior data and technical server operation data. This data was underutilized, so it was necessary to create an architecture to store it and then use it for various needs: preventive maintenance, ergonomic problem detection, user tracking, etc.

Missions

  • Data modeling and implementation of the data warehouse
  • Training in best practices for data scientists
  • Securing and installing solutions on machines (devops)
  • Implementation of data pipelines
  • Development of an in-house monitoring solution
  • State of the art on scientific topics (clustering, LLM)
  • Popularization and communication to the business

Technical environment

Python, SQL, Clickhouse, ElasticSearch, Bare metal, notebooks, Jupyter Hub, java

October 2020 - December 2023| Lyon
Data Engineer

Project Implementation and improvement of mobility data pipelines

Context In collaboration with a startup: FUL. The business model was to grow plants in controlled growing environments: growing melons in Japan becomes possible. My mission was then to collect operating data from this urban farm, to consolidate the data in order to train algorithms for predicting plant growth

Missions

  • Data collection (ETL)
  • Data visualization
  • Implementation of an electricity consumption prediction model
  • Participation in steering committees
  • Implementation of a "human in the loop" model for improving plant growth recipes

Technical environment

Python, SQL, Linux, pytest, Git, SQLite, Grafana

September 2017 - August 2020| Lyon
PhD in Data Mining

Project Research, invention of interpretable rule discovery algorithms for sequential data, applied to e-sports

Context This thesis focused on creating new algorithms to perform data mining on sequential and temporal data. I applied this work to input sequences and 3D information from the game "Rocket League". This led to various publications, as well as the ability to detect user behaviors that were previously undetectable, which particularly interested various players in the field (startups, Ubisoft).

Missions

  • Scientific state of the art
  • Self-training
  • Creation of new algorithms
  • Communication and popularization of my work
  • Teaching

Technical environment

Python, SQL, Linux, Git, Pytest, notebooks

September 2017 - August 2020| Lyon

Education

PhD in Data Mining

Algorithms for automatic discovery of predictive rules for sequential data, application to e-sports and analysis of video game data (Rocket League)

Contact