Statistics and Machine Learning
As the field of data science grows and opens new opportunities in many different disciplines, Princeton University’s CSML has kept apace by fostering cutting-edge research and engaging in deep collaboration with faculty members, centers and departments across campus and with outside practitioners in varied industries. Graduate students would be well positioned at Princeton due to the center’s strong interdisciplinary nature and the wide array of exciting research happening on campus.
Princeton graduate students can now earn a Graduate Certificate in Statistics and Machine Learning as a complement to their departmental graduate studies. This certificate is overseen by CSML. There are three requirements in completing the certificate: completing appropriate course work, engaging in research involving statistics or machine learning, and participating in the CSML graduate seminar. (See link on the left-hand menu.)
Data-driven research increasingly involves large-scale complex data that needs to be analyzed using innovative methods from the fields of statistics and machine learning. This approach, coupled with advancing data science’s underlying methods and algorithms, has become an essential component of modern scientific discovery. Princeton University is committed to playing a vital role in preparing students to lead in these areas, and the certificate should deepen and enhance students’ understanding and application of data science techniques.
The Graduate Certificate Program in Statistics and Machine Learning is designed to formalize the training of students who contribute to or make use of statistics and machine learning as a significant part of their degree program. In addition, it serves to recognize the accomplishments of graduate students across the University who acquire additional training in statistics and machine learning, going beyond the requirements of their own degree programs. This certificate program is open to Princeton University students currently enrolled in a Ph.D. or master’s program at the University. Students must enroll by completing an online application form on the CSML website. The application will include a tentative plan and timeline for completing all the course requirements. Students are encouraged to sign up as soon as possible, and no later than one semester prior to graduation. Because Ph.D. students who have entered Dissertation Completion Enrollment (DCE) status are not eligible to enroll in courses, Ph.D. students must enroll in the CSML graduate certificate program in time to complete the course requirements while they are still in their regular degree program length.
For enrollment, please use this form: Graduate Certificate Enrollment Form
For questions, contact us at firstname.lastname@example.org
For students enrolled in a graduate degree program with a thesis or dissertation requirement, the certificate is comprised of three components: (a) completion of three appropriate graduate courses, (b) a relevant research contribution, and (c) a research seminar. We expect that the core courses can be taken as graduate electives, in partial fulfillment of the various course requirements in home departments, and that item (b) will naturally form as part of the student’s thesis or dissertation. For non-thesis master’s students, item (b) is replaced by an appropriate technical presentation. The certificate will appear on a student’s official transcript after all requirements for the certificate have been fulfilled and a graduate degree has been awarded. Students who earn the certificate will also be recognized on the CSML website.
Take for credit and receive an average GPA of B+ (3.3) or better in three courses from the approved list that has three categories: core machine learning, core statistics and probabilistic modeling, and electives. One course must be selected from each category. With the permission of the certificate director, the elective course can be selected from a core category provided it does not significantly overlap with the other course selected from that category. At least one of the three courses must be outside the student’s home department and at most one course can be below the 500 level.
The core curriculum is intended to provide training in the foundations of statistics and machine learning while ensuring that certificate students have some breadth across the core of statistics and machine learning. A list of approved core courses in the two areas is included below. In addition, a certificate student selects the third course from a listed set of elective courses that expands on the core courses. These electives delve more deeply into supporting material (e.g., optimization) or focus on applications in a specific domain.
Students may not count courses that are used to satisfy core requirements in their home department concentration toward this certificate, however, they may count up to two electives that were taken for their degree requirements.
Seminar series (SML 510)
The CSML graduate seminar, SML 510 serves as a venue for discussing current methods and results and the integration of different research approaches to data analysis. Attendance and participation in the CSML graduate seminar for at least one semester is required. It helps teach students how to communicate technical ideas to a broad audience and encourages the development of skills for interacting with other students, postdoctoral fellows, and faculty who are investigating data analysis problems. It also serves to build a supporting community of young scholars with shared interests.
For students completing a thesis or dissertation as part of their degree, the thesis or dissertation should include a significant component making contributions to statistics or machine learning, or rigorous use of such methods in an application domain. To ensure that an important component of a Ph.D. student's dissertation involves either rigorous data analysis and/or mathematical or computational modeling of data or machine learning problems, one of the dissertation readers or FPO committee members must be a participating graduate certificate faculty member. This reader or committee member is required to send either a letter or the reader's report to the program director to verify that the dissertation satisfies this requirement. Master's students who complete a thesis follow the same requirement.
For non-thesis master's degree students, the original research requirement can be satisfied by completing a relevant graded research project while enrolled in FIN 561 - Master's Project II, or through applied research performed in a professional setting as part of an approved internship. Such research performed in a professional setting must be submitted as a technical presentation and is reviewed for approval by the certificate director.
The original research requirement can also be met by any student through a publishable research paper that is approved by the certificate director.
- Ryan P. Adams
- Peter M. Melchior
- Ryan P. Adams, Computer Science
- Prateek Mittal, Electrical & Comp Engineering
- John M. Mulvey, Oper Res and Financial Eng
- Peter J. Ramadge, Electrical & Comp Engineering
- Marc Ratkovic, Politics
- Mengdi Wang, Electrical & Comp Engineering
- Amir Ali Ahmadi, Oper Res and Financial Eng
- Sanjeev Arora, Computer Science
- Yacine Aït-Sahalia, Economics
- Matias D. Cattaneo, Oper Res and Financial Eng
- Danqi Chen, Computer Science
- Yuxin Chen, Electrical & Comp Engineering
- Jonathan D. Cohen, Psychology
- Jia Deng, Computer Science
- Adji Bousso Dieng, Computer Science
- Barbara E Engelhardt, Computer Science
- Jianqing Fan, Oper Res and Financial Eng
- Jaime Fernandez Fisac, Electrical & Comp Engineering
- Filiz Garip, Sociology
- Tom Griffiths, Psychology
- Boris Hanin, Oper Res and Financial Eng
- Elad Hazan, Computer Science
- Bo E. Honoré, Economics
- Niraj K. Jha, Electrical & Comp Engineering
- Chi Jin, Electrical & Comp Engineering
- Jason Matthew Klusowski, Oper Res and Financial Eng
- Michal Kolesár, Economics
- Sun-Yuan Kung, Electrical & Comp Engineering
- Ching-Yao Lai, Geosciences
- Jason D. Lee, Electrical & Comp Engineering
- Naomi E. Leonard, Mechanical & Aerospace Eng
- Mariangela Lisanti, Physics
- John B. Londregan, Schl of Public & Int'l Affairs
- Anirudha Majumdar, Mechanical & Aerospace Eng
- Meredith A. Martin, English
- William A. Massey, Oper Res and Financial Eng
- Reed M. Maxwell, Civil and Environmental Eng
- Peter M. Melchior, Astrophysical Sciences
- Ulrich K. Mueller, Economics
- Karthik Narasimhan, Computer Science
- Kenneth A. Norman, Psychology
- Jonathan W. Pillow, Psychology
- Mikkel Plagborg-Moller, Economics
- H. Vincent Poor, Electrical & Comp Engineering
- Yuri Pritykin, Computer Science
- Miklos Z. Racz, Oper Res and Financial Eng
- Ben Raphael, Computer Science
- Olga Russakovsky, Computer Science
- Matthew J. Salganik, Sociology
- H. Sebastian Seung, Computer Science
- Amit Singer, Mathematics
- Mona Singh, Computer Science
- Bartolomeo Stellato, Oper Res and Financial Eng
- Brandon M. Stewart, Sociology
- John D. Storey, Integrative Genomics
- Michael A. Strauss, Astrophysical Sciences
- Rocío Titiunik, Politics
- Jeroen Tromp, Geosciences
- Olga G. Troyanskaya, Computer Science
- Robert J. Vanderbei, Oper Res and Financial Eng
- Mark W. Watson, Schl of Public & Int'l Affairs
- Yu Xie, Sociology
Sits with Committee
- Daisy Yan Huang
- Ricardo Pereira Masini
Courses listed below are graduate-level courses that have been approved by the program’s faculty as well as the Curriculum Subcommittee of the Faculty Committee on the Graduate School as permanent course offerings. Permanent courses may be offered by the department or program on an ongoing basis, depending on curricular needs, scheduling requirements, and student interest. Not listed below are undergraduate courses and one-time-only graduate courses, which may be found for a specific term through the Registrar’s website. Also not listed are graduate-level independent reading and research courses, which may be approved by the Graduate School for individual students.