Romain Mathonat

I am a

The internet has greatly helped me in my work, so I contribute my share through various open-access publications.

Projects

                
Nebula Station
                  A cosy space traveling experience, lofi music, genAI, and task organisation.
                
AccomplishmentsSmart TODO list
Lofi radio station
Space travel simulation
Relaxing vibes

Market Technical Analysis Lab
                  Analysis of financial markets and cryptocurrencies, backtesting of numerous strategies
                
AccomplishmentsAnalysis and automation of automated trading strategies (technical analysis) for stocks and cryptos
Backtesting of passive strategies: 4 weathers and variations
Data retrieval via API
Semi-automation algorithms for active portfolio management

VirtualGuessr
                  Browser-based virtual world exploration game
                
AccomplishmentsExperimenting with LLM (Claude) for front-end development

VulgaireDev
                  Publications of scientific and technical popularization articles
                
Rocket League skill detection
                  Innovative machine learning algorithm for automatic in-game skill ("figure") detection
                
AccomplishmentsAccuracy of about 80% for an original and difficult problem
Homemade data collection (sniffer)
Various publications and communications

Mirror of Erised
                  Notion template for personal organization, inspired by the Eisenhower matrix
                
AccomplishmentsFree
Harry Potter inspired design
Many small features, including ordering by importance, mental load relief

Experience

Michelin

Lead Data Engineer

Project Implementation and improvement of mobility data pipelines

Context The Michelin Mobility Intelligence (MMI) entity's mission is to make road infrastructures safer, better quality, and more ecological. To achieve this, mobility data is collected and stored in the data warehouse, then transformed and used by various machine learning algorithms to provide prediction and analytics to the client.

Missions

Implementation of high-volume data pipelines (~100 billion/month)
Improvement of CI/CD (performance/ease of use)
Modernization of the python dependency system, overhaul of wheel storage and build
Team training on best practices/tools (TDD, hexa archi, pdm, ruff, wsl, vsCode)
State of the art (data sharing)
Development of internal performance monitoring library
Dashboarding and dataviz

Technical environment

Snowflake, snowpark (equivalent to pyspark), SQL, python, CI/CD, pytest, Microsoft Azure, Databricks, Grafana, Streamlit, Gitlab

January 2024 - October 2024 | Lyon, Clermont-Ferrand

Infologic

Data Engineer / Scientist

Project Implementation and improvement of mobility data pipelines

Context Infologic is an SME that provides an ERP in the agri-food sector. In this context, a lot of data is available, both user behavior data and technical server operation data. This data was underutilized, so it was necessary to create an architecture to store it and then use it for various needs: preventive maintenance, ergonomic problem detection, user tracking, etc.

Missions

Data modeling and implementation of the data warehouse
Training in best practices for data scientists
Securing and installing solutions on machines (devops)
Implementation of data pipelines
Development of an in-house monitoring solution
State of the art on scientific topics (clustering, LLM)
Popularization and communication to the business

Technical environment

Python, SQL, Clickhouse, ElasticSearch, Bare metal, notebooks, Jupyter Hub, java

October 2020 - December 2023| Lyon

Atos

Data Engineer

Project Implementation and improvement of mobility data pipelines

Context In collaboration with a startup: FUL. The business model was to grow plants in controlled growing environments: growing melons in Japan becomes possible. My mission was then to collect operating data from this urban farm, to consolidate the data in order to train algorithms for predicting plant growth

Missions

Data collection (ETL)
Data visualization
Implementation of an electricity consumption prediction model
Participation in steering committees
Implementation of a "human in the loop" model for improving plant growth recipes

Technical environment

Python, SQL, Linux, pytest, Git, SQLite, Grafana

September 2017 - August 2020| Lyon

Liris

PhD in Data Mining

Project Research, invention of interpretable rule discovery algorithms for sequential data, applied to e-sports

Context This thesis focused on creating new algorithms to perform data mining on sequential and temporal data. I applied this work to input sequences and 3D information from the game "Rocket League". This led to various publications, as well as the ability to detect user behaviors that were previously undetectable, which particularly interested various players in the field (startups, Ubisoft).

Missions

Scientific state of the art
Self-training
Creation of new algorithms
Communication and popularization of my work
Teaching

Technical environment

Python, SQL, Linux, Git, Pytest, notebooks

September 2017 - August 2020| Lyon

Education

PhD in Data Mining

Algorithms for automatic discovery of predictive rules for sequential data, application to e-sports and analysis of video game data (Rocket League)

Computer Science Engineer

INSA Lyon

Contact

[email protected]

github.com/romathonat

linkedin.com/in/romain-mathonat-478910b5/