MSE5540/6640 Materials Informatics course at the University of Utah
This github repo contains coursework content such as class slides, code notebooks, homework assignments, literature, and more for MSE 5540/6640 "Materials Informatics" taught at the University of Utah in the Materials Science & Engineering department.
Below you'll find the approximate calendar for Spring 2025 and videos of the lectures are being placed on the following YouTube playlist https://youtube.com/playlist?list=PLL0SWcFqypCl4lrzk1dMWwTUrzQZFt7y0
month | day | Subject to cover | Assignment | Link |
---|---|---|---|---|
Jan | 7 | Syllabus. What is machine learning? How are materials discovered? Machine Learning vs Materials Informatics | Install software packages together in class | |
Jan | 9 | Using Github, In class example of fitting Hall-Petch data with linear model | Read 5 High Impact Research Areas in ML for MSE (paper1), Read ISLP Chapter 3, but especially Section 3.1 | paper1, ISLP |
Jan | 14 | Materials data repositories, get pymatgen running for everybody, examples of MP API, MDF, NOMAD, others | Create a new env and make sure you can get the notebooks in the "worked examples/MP_API_example" and "worked examples/foundry" folders running. | Materials Project API |
Jan | 16 | Machine Learning Tasks and Types, Featurization in ML, Composition-based feature vector | Read Is domain knowledge necessary for MI (paper1). Make sure you can get the CBFV_example notebook running in the ""worked examples/CBFV_example" folder | paper1 |
Jan | 21 | Classification and cross-validation | Read ISLP Sections 4.1-4.5 and Section 5.1. Run through classification notebook | ISLP |
Jan | 23 | Structure-based feature vector, crystal graph networks, SMILES vs SELFIES, 2pt statistics | read selfies (paper1), two-point statistics (paper2) and intro to graph networks (blog1) | paper1, paper2, blog1 |
Jan | 28 | Simple linear/nonlinear models. test/train/validation/metrics | Read linear vs non-linear (blog1), read best practices (paper1), benchmark dataset (paper2), and loco-cv (paper3). | blog1, paper1, paper2, paper3 |
Jan | 30 | in-class examples of featurization | Run through 2pt statistics, GridRDF, CBFV notebooks | HW1 due! |
Feb | 4 | ensemble models, ensemble learning | Read ensemble (blog1), and ensemble learning (paper1) | blog1, paper1 |
Feb | 6 | Extrapolation, support vector machines, clustering | Read extrapolation to extraordinary materials (paper1), clustering (blog1) , SVMs (blog2) | paper1, blog1, blog2 |
Feb | 11 | Artificial neural networks | Read the introduction to neural networks (blog1, blog2) | blog1, blog2 |
Feb | 13 | Advanced deep learning (CNNs, RNNs) | HW2 due. Read… | blog1, blog2 |
Feb | 18 | Transformers | Read the introduction to transformers (blog1, blog2) | blog1, blog2 |
Feb | 20 | Generative ML: Generative Adversarial Networks and variational autoencoders | Read about VAEs (blog1, blog2, repo1) and GANS () | blog1, blog2, repo1 |
Feb | 25 | Diffusion models and Image segmentation | Read U-net (paper1) and nuclear forensics (paper2) | CrysTens repo |
Feb | 27 | Image segmentation part 2 and in-class coding examples | Download CrysTens github repo, read Segment Anything Model (paper 3) | paper1, paper2, paper3 |
Mar | 4 | Bayesian Inference | Read the introduction to Bayesian (blog1), go through Naive Bayes notebook | blog1 |
Mar | 6 | Gaussian Processes | ||
Mar | 11 | Bayesian Inference | Read the introduction to Bayesian (blog1), go through Naive Bayes notebook | blog1 |
Mar | 13 | No CLASS, spring break | ||
Mar | 19 | No CLASS, spring break | ||
Mar | 18 | Self Driving labs part 1 | ||
Mar | 20 | Self Driving labs part 2 | ||
Mar | 25 | No CLASS, TMS Meeting | ||
Mar | 27 | NO CLASS, TMS Meeting | ||
Apr | 1 | Large Language Models part 2 | TBD | TBD |
Apr | 3 | Case study TBD | ||
Apr | 8 | Case study TBD | ||
Apr | 15 | Case study TBD | ||
Apr | 17 | Case study TBD | ||
Apr | 22 | Final project presentation |
| | | | | |
I can recommend the book Introduction to Machine Learning found here https://www.statlearning.com/