π‘ About Me
Hi! I am Weimin Wang (ηε«ζ). I completed my MSc in Statistics and Data Science at Leiden University with a GPA of 8.28/10. My research focuses on data-driven modeling and predictive control of dynamical systems, combining statistical learning with control theory.
Supported by the CSC Scholarship, I am currently seeking PhD opportunities. I am interested in reliable and interpretable machine learning methods, with a particular focus on their applications in complex real-world systems.
π Research & Projects
Data-Enabled Predictive Control of Greenhouse Climate
- Developed data-driven predictive controller achieving 94% performance of model-based MPC with 5Γ computational speedup
- Identified and analyzed safety-critical constraint violations under out-of-distribution conditions & Investigated robustness challenges in data-driven control
- Manuscript under review at ECC 2026
Predicting Depression Risk Using Machine Learning (CHARLS Dataset)
- Developed predictive models on large-scale health survey data (CHARLS, 15k+ individuals)
- Implemented Random Forest, Gradient Boosting, MARS, and Logistic GAM with cross-validation
- Achieved best classification accuracy of 72.9% and identified key nonlinear risk factors using interpretable ML techniques
Causal Effect of Smoking on Stroke Risk
- Estimated 2.8% absolute risk increase using propensity score weighting on Framingham data (4,434 subjects, 24-year follow-up)
- Applied DAG-based confounder selection with backdoor criterion; verified positivity and balance assumptions via Love plots
- Conducted sensitivity analyses for missing data mechanisms and weight truncation strategies
Brain Iron Accumulation and Pathology Analysis
- Applied non-parametric tests with multiple comparison corrections to assess regional brain iron differences across pathology groups
- Developed stepwise regression models using AIC-based variable selection, identifying significant predictors
Generative Models and Sequence-to-Sequence Learning
- Trained VAE and GAN models for anime face synthesis
- Built encoder-decoder architectures(RNN/LSTM) for multimodal arithmetic tasks using MNIST-based representations
Scalable Similarity Search with Locality Sensitive Hashing
- Designed a MinHash + LSH pipeline to detect similar user pairs in the Netflix Prize dataset
- Achieving efficient large-scale Jaccard similarity search without exhaustive comparisons
π Education
MSc in Statistics and Data Science, Leiden University
Sep 2023 β Aug 2025 | GPA: 8.28/10
MEng in Optical Engineering, China Jiliang University
Sep 2017 β Jul 2020
BEng in Optoelectronic Information Science, China Jiliang University
Sep 2013 β Jul 2017 | Outstanding Graduate of Zhejiang Province (Top 5%)
πΌ Experience
Product Manager
Translated customer needs into data-driven product requirements. Analyzed market trends and developed bundling strategies for enterprise networking solutions.
Laboratory Assistant
Assisted in building solar cell calibration systems. Designed and tested pulse-to-continuous light conversion devices for precision measurement.
π Teaching
Teaching Assistant, Linear and Generalized Linear Models
Leiden University, OctβDec 2024
π£οΈ Presentations
- Femtosecond pulse laser beam shaping and power control β OIT 2019, Beijing (Oct 2019)
- Optical fiber bundles converting pulsed lasers into continuous waves β AOPC 2019, Beijing (Jul 2019)
π Honors & Awards
- Outstanding Graduate of Zhejiang Province (Top 5%), 2017
- Zhejiang Provincial Government Scholarship (Top 3%), 2016
- First-Class Scholarship, China Jiliang University (Top 3%), 2015
π οΈ Skills
Python (PyTorch, NumPy, SciPy, scikit-learn), R
Linear Algebra, Probability Theory, Optimization, Statistical Modeling
Statistical Learning, Neural Networks, Deep Learning, High-dimensional modeling
Git, Jupyter, LaTeX, Markdown
Chinese (Native), English (Proficient, IELTS 6.5)