Center for Statistics and Machine Learning

 

 

Featured News

Oluwatamilore “Tamilore” Ajeigbe: harnessing data science to look at impact of race on STEM retention rates
Sept. 30, 2022

For her CSML independent work project, Ajeigbe decided to compare data from Predominantly White Institutions (PWIs) versus Historically Black College and Universities (HBCUs). She used pruned decision trees and OLS regression to uncover factors that would make a student more likely to stay in a STEM major and whether these factors changed depending on the institution, PWI or HBCU.

Featured Event

Introduction to Data Analysis using Python
Fri, Oct 7, 2022, 10:00 am

This workshop will get students started in data analysis using the pandas Python package. It will briefly cover different components of data analysis and connect them with the goal of extracting meaning from data. We will go over an example to illustrate the data analysis process from beginning to end.

Open Positions

DataX Postdoctoral Fellowships Available at Princeton
Sept. 21, 2022

The Center for Statistics and Machine Learning (CSML) invites applications for DataX Postdoctoral Fellowships. The DataX Postdoctoral Fellowships are intended for early-career scientists with a research interest in data science, statistics, and machine learning. 


 

Latest News

Oluwatamilore “Tamilore” Ajeigbe: harnessing data science to look at impact of race on STEM retention rates

For her CSML independent work project, Ajeigbe decided to compare data from Predominantly White Institutions (PWIs) versus Historically Black College and Universities (HBCUs). She used pruned decision trees and OLS regression to uncover factors that would make a student more likely to stay in a STEM major and whether these factors changed depending on the institution, PWI or HBCU.

CSML hosts "Welcome Back" event

The Center for Statistics and Machine Learning’s (CSML) "Welcome Back" reception for undergraduate and graduate certificate students, researchers and faculty was a chance to mark the beginning of the academic year, catch up with colleagues, and strengthen ties within the campus data science community. The event was held on September 19th. Check the post for a slideshow of pictures. 

DataX Postdoctoral Fellowships Available at Princeton

The Center for Statistics and Machine Learning (CSML) invites applications for DataX Postdoctoral Fellowships. The DataX Postdoctoral Fellowships are intended for early-career scientists with a research interest in data science, statistics, and machine learning. 

Hannah To: using data science to look at impact of gangs in El Salvador

For her CSML independent work project, Hannah To worked on a study to see how gangs in El Salvador impacted labor, and was advised by Thomas Fujiwara, associate professor of economics and international affairs. This project also fulfilled her senior thesis requirement.

Princeton University is actively monitoring the situation around coronavirus (COVID-19) and the evolving guidance from government and health authorities. The latest guidance for Princeton members and visitors is available on the University’s Emergency Management website

Upcoming Events

Introduction to Data Analysis using Python
Fri, Oct 7, 2022, 10:00 am

This workshop will get students started in data analysis using the pandas Python package. It will briefly cover different components of data analysis and connect them with the goal of extracting meaning from data. We will go over an example to illustrate the data analysis process from beginning to end.

A Language-Based Model of Organizational Identification Demonstrates How Within-Person Changes in Identification Relate to Network Position
Mon, Oct 10, 2022, 12:00 pm

Shifting attachments to social groups are a constant in the modern era.They are especially pronounced in the contemporary workplace. What accounts for variation in the strength of organizational identification?

Location
Aaron Burr Hall 219
Speaker
Data Visualization in Python
Tue, Oct 11, 2022, 4:30 pm

This workshop provides an introduction to effective data visualization in Python. The training focuses on three plotting packages: Matplotlib, Seaborn and Plotly. Examples may include simple static 1D plots, 2D contour maps, heat maps, violin plots, and box plots. The session may also touch on more advanced interactive plots.

The Limits of the Quantitative Approach to Discrimination
Tue, Oct 11, 2022, 5:00 pm

Discrimination is obvious to the people facing discrimination. Given this, do we even need quantitative studies to test if it exists? Regardless of the answer, quantitative studies such as ProPublica’s “Machine Bias” have had a galvanizing effect on racial justice, especially in the context of automated decision-making. 

Location
East Pyne 010
Speaker