Thanks for taking the time to check in!

What's this website: I try to give here a personal sense of who I am, both as an individual and as an engineer.

I am an AI research engineer at the Yale-LiGHT research lab (Laboratory for Intelligent Global Health & Humanitarian Response Technologies) which is shared between Yale and EPFL, Switzerland, where I am based. Most of my professional experience is in machine learning research.
I graduated with an M.Sc. in Data Science, and before any technical knowledge, my studies taught me how to learn efficiently and imparted me a delight for learning.

I first think of myself as curious and interested. Accordingly, I love interdisciplinary projects - as you may notice in my experiences: medical diagnosis, digital education, financial fraud detection - for the opportunity to dive into a new discipline.
My peers have sometimes described me as thoughtful, dedicated and sweet.

Experience

AI Research Engineer - EPFL & Yale

2024

I am now working as a research engineer in a machine learning lab shared between EPFL and Yale. I work on designing and implementing the no-code platform DISCO for federated and decentralized machine learning in collaboration with ICRC and WHO. The platform allows training and using state-of-the-art models in low computational resource settings, via mobiles and personal computers. DISCO also enables multiple entities to leverage confidential data to train models collaboratively without sharing their data between each other.

Concurrently, I am working in collaboration with the privacy and cryptography compagny Tune Insight on privacy-preserving training of Large Language Models. The joint project aims at navigating the fine line between training the most helpful LLMs and preventing privacy leakage and sensitive data memorization during training.

EPFL - M.Sc. in Data Science

2020-2023

5.8/6

EPFL (Swiss Federal Institute of Technology, Lausanne) is a leading engineering school and is consistently ranked within the world top 10 universities in computer science.



University of California, Berkeley

- Spring 2023 | Master's Thesis


Designed a new computer vision method to analyze error patterns in clinician misdiagnoses of skin cancer.

Learn More

Oracle Labs, Zürich

- Fall 2022 | Internship


Implemented a novel financial fraud detection method through graph neural networks. The method flags suspicious networks of transaction and provides interpretations for its decisions.

Learn More

ML for Education Lab

- 2020 - 2021 | Research Scholar


Funded by a research grant, I worked alongside my studies at the ML4ED research lab, which leverages Machine Learning to improve digital learning. We utilized machine learning to stufy student's online learning methodology in flipped classroom settings to identify the most successful learning strategies.

Learn More

EPFL - Bachelor in Computer Science

2017-2020

5.2/6

The bachelor covered the fundamentals of computer science: algorithms and data structures, multiple programming paradigms - object-oriented, imperative, functional - computer architecture, computer networks, etc.

KTH, Stockholm

- 2019 - 2020 | Exchange year

Projects

Computer Vision | Python

Analyzing Idiosyncratic patterns of clinician errors in skin cancer diagnosis

Spring 2023

1

Master's Thesis

Maximal Grade

In the context of my Master's thesis, I worked at the University of California, Berkeley on a new method to analyze error patterns in clinician misdiagnoses of melanoma skin cancer. To uncover idiosyncratic and systematic diagnostic errors, we relied on a clustering of computer vision image embeddings to analyze intra- and inter-clinician variations. The thesis received the maximal grade.

Graph Neural Networks | Python

Financial fraud detection via graph neural networks on heterogeneous graphs

Fall 2022

1

Internship

During a 6-month internship at Oracle Labs, we designed a novel graph neural network (GNN) method tailored to detect fraud cases in a financial transaction network. GNN allows a flexibility matching the diversity of forms frauds cases can take. As main innovations, the method detects whole transaction networks rather than individual nodes or transactions, and supports different types of vertices and edges, i.e., heterogeneous graphs.

Computer Vision | Python

Deep Learning for Automated Detection of Tuberculosis from Lung Ultrasound

Spring 2022

1

Research Project

Maximal Grade

A 6-month research project, in which I leveraged deep learning to diagnose TB from lung ultrasounds (LUS). Ultrasound probes's practicality makes it a promising alternative. TB is nowadays only present in regions where treatment can't be afforded. However, LUS interpretation is notoriously complex and deep learning may prove pivotal in TB diagnosis. Our model proved significantly better than clinician-handcrafted features.

Time series ML | Python

Machine Learning to improve digital learning and student retention

2020-21

1

Research grant

Funded by an EPFL grant during 12 months, I worked alongside my studies in a lab leveraging Machine Learning to improve education. The first project aimed at improving early prediction of student dropout in online courses. In a second time, I worked on a deep learning framework to handle irregular time series, ubiquitous in digital education and currently requiring manual engineering. Both projects resulted in publications.

Large scale data | Spark Hive HDFS

Trip planner accounting for delays and successful connection probabilities

Spring 2022

4

Course Project

Maximal Grade

The aim of the project is to leverage the Swiss public transport data to implement a journey planner in the area of Zurich. Notably, our trip planner assesses the success rate of each connection, letting the user choose the confidence level of his or her journey. The data, stored as a Hive table, was processed using PySpark and the algorithm was implemented in Python.

Data visualization | d3.js web

Chess insights through visualizations of one million chess games

Spring 2021

3

Course Project

Maximal Grade

By collecting and parsing more than a million chess games from Lichess, we set our goal to draw insights and convey them intuitively through interactive data visualizations using D3.js. Starting from a visual history to the game basics, we show the importance of the chessboard center and analyze the effectiveness of the different chess openings in the dataset. We also let users watch any game in the dataset to follow their own interests.

Behavioral Data | Python

Song2vec: mining music listening history for meaningful song embeddings

Spring 2021

3

Course Project

5.5 / 6

Inspired by the word2vec method, we utilized the music listening history of 1000 users, composed of 20 million listens, to extract information found in songs' listening chronology. Such method is essential in music recommendation. Interestingly Karma Police - Radiohead + Metallica = Where Is My Mind? In other words, if Karma Police had been produced by Metallica rather than Radiohead, our closest result was Where Is My Mind? by The Pixies.

Recommender systems | Scala Spark

Movie recommender systems for large scale data with Apache Spark

Spring 2021

1

Course Project

5.75 / 6

Throughout this course project, we implemented multiple recommender systems of increasing complexity in Apache Spark and Scala. The project is organized as successive milestones to practice wielding Scala and Spark to handle large scale data.

Computer vision | TF Keras

Machine Learning competition of road segmentation from satellite images

Fall 2020

3

Course Project

5.5 / 6

We designed and optimized a model to discriminate roads from satellite images. We implemented a U-Net model with TensorFlow Keras, producing a 2D-grid of pixel labels. By leveraging heavy data augmentation during training and majority voting on image transformations during inference, our model reached an F1 of 90% and an accuracy of 94.5%. We used an Adam optimizer and a focal Tversky loss to address the class imbalance.

Interests

Hiking

Picture from the top of Mont Charvin, France.
I am usually up in the mountains whenever time and weather permitting. I grew up in the Alps, and mountains are typically the first thing I would miss in a new location.

Cycling

A farewell gift from a dear friend, showing Bikie my dear bike.
Being able to commute by bike is one of the simple things I appreciate and treasure.

Linguistics

My name, Julien, spelled with the International Phonetic Alphabet.
Learning Vietnamese in a self-taught manner created a deep interest in linguistics and led me to audit a linguistics class during my stay at UC Berkeley.
I am fascinated by the diversity found in languages, especially in phonetics and writing systems.

Arts and craft

These flowers were painted as a token of gratitude for some friends.
I dabble in creative crafts, mostly drawing and watercolor. I've recently tried linocut printmaking and loved it. Furthermore, I enjoy creative programming tasks, such as data visualization or designing this website.

Reading

A sunny morning at the farmer's market, snacking on strawberries while reading.
Whether in transit or waiting in line, I always have a book on me. A good book with a drink in a cute location is one of the ways I savor life. I read mostly fiction, essays and whatever my friends recommend. Additionally, I'm very happy to discuss the pros and cons of an e-reader.)

Learning

I deeply enjoy diving into subjects I find compelling that are usually outside my field of study, linguistics being one of them. I also read about psychology - I took multiple classes of social psychology during my studies - architecture and city planning, or art history. I am curious and enjoy learning more without any pretention. I rely mostly on books and Wikipedia. As such, the literature review is of my favorite parts of a project.

Traveling

As common as it sounds, I like exploring new places, witnessing different architecture styles, new landscapes and urbanism, trying out local cuisines and learning about places' history. I strive to minimize my airplane carbon footprint and I travel mostly by train. I recently traveled all over the US for two months via night trains.

Cooking

Pretty produce at Jean-Talon Market in Montréal.
While I don't consider myself a very talented cook, I always enjoy the time I spend cooking. Especially improvising a yummy dish with whatever is in the fridge. I appreciate the creativity, the educated intuitions and satisfying results. I also really like eating.

Publications

Ren, Z.†, Vignoud, J.†, Wang, Z., Rutledge-Jukes, H., Sekhar, T., Fang, K., Duhaime, E., Whitney, D. (2023).


Proficiency is Associated with Higher Idiosyncratic Biases in Medical Image Perception.


† Equal Contributions.


Working paper.

Asadi, M., Swamy, V., Frej, J., Vignoud, J., Marras, M., Käser, T. (2023).


Ripple: Concept-Based Interpretation for Raw Time Series Models in Education.


Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 15903-15911.

Marras, M., Vignoud, J., Käser, T. (2021).


Can Feature Predictive Power Generalize? Benchmarking Early Predictors of Student Success across Flipped and Online Courses.


Proceedings of the 14th International Conference on Educational Data Mining.