A thread on r/MachineLearning is quietly doing what most academic departments won't: crowdsourcing a rigorous, research-ready curriculum for applying AI to materials science. The post, framed as a starter kit request, has the potential to become a genuine reference point for anyone trying to break into computational chemistry or cheminformatics from an ML background.
What's on the table
The thread's author — someone already comfortable with deep learning fundamentals — is asking for papers, courses, tutorials, and talks sufficient to conduct real research in AI for materials science, not just dabble. The one resource they've already surfaced: a GitHub-hosted course from the University of Chicago, applied-ai-for-materials, built by the Ward Lab. It covers graph neural networks for molecular property prediction, force fields, and active learning workflows — the kind of content that actually appears in NeurIPS materials science workshops.
Why it matters
AI for materials science sits at a genuinely high-value intersection: faster drug discovery, next-gen battery design, catalyst optimization. But the entry ramp is steep. The field demands fluency in both ML architecture and domain chemistry — crystal symmetry, density functional theory, atomic representations like SOAP and ACSF. Most ML practitioners hit a wall fast without a structured path in. A community-vetted syllabus lowers that barrier meaningfully.
What to watch
The comments are where this gets useful. If the thread attracts responses from active researchers — and r/MachineLearning has a solid contingent of them — it could surface lesser-known resources like the Open Catalyst Project, MatBench benchmarks, or tutorials around tools like JARVIS and the Materials Project API. Worth bookmarking the thread if you're anywhere near this space.