Format: There is a digital book for each year of new students at Carleton starting in 1955. Each book contains student names, high schools, and hometowns. We need to convert this into a dataset with just high schools and hometowns. In order to scrape the data, we are using a python library called pdfminer and will clean the data using excel.
Rights: Carleton College owns this data. Under the rights management section, it says the images from the zoobooks may not be reproduced, but doesn’t say anything about using the data within the zoobook.
Privacy: Carleton College students are depicted in the data, and there is personal information such as full name, high school and hometown. When we use the data, we won’t attach any names to the data so it remains anonymous.
Citation: Carleton College Archives. (1991). Carleton College Zoobooks. Carleton Digital Collections. https://contentdm.carleton.edu/digital/collection/Zoobooks
National Center for Education Statistics and Institute for Education Sciences
Format: These databases allow you to search for school information about public and private schools. In order to determine if the schools from the zoobook are public or private, we will run it against these databases.
Rights: The data is owned by the National Center for Education Statistics. The only information we plan on using from the website is if the schools are public or private. There is no information easily available on the website about rights to the data, so we believe it is okay to use this information.
Privacy: High schools are being depicted in the dataset, so there is no personal information. Therefore, we don’t have any ethical concerns about using this data.
Public SchoolCitation: National Center for Education Statistics. (1965a). Search for Public Schools. https://nces.ed.gov/ccd/schoolsearch/
Private School Citation: National Center for Education Statistics. (1965). Search for Private Schools. Search for private schools. https://nces.ed.gov/surveys/pss/privateschoolsearch/index.asp
Format: This is an article from the Education Writers Association describing the history and background of college access and admissions.
Rights: There are no rights. We plan on drawing insights from this article.
Privacy: No privacy issues since it is a public article.
Citation: Selingo, J. (2022, July 5). History and background: College Access & Admissions. ewa.org. https://ewa.org/issues/higher-ed/history-and-background-college-access-admissions
Format: This is a research paper about socioeconomic gaps in education. It looks at socioeconomic data across states.
Rights: There are no rights. We plan on drawing insights from this article.
Privacy: No privacy issues since it is a public article.
Citation: Jang, H., & Reardon, S. F. (2019). States as sites of educational (in)equality: State contexts and the socioeconomic achievement gradient. AERA open. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7413034/