Click on the title links to download the data. Email your instructors if you have any problems downloading.
berkeley_collisions.csv Data on injury and fatal traffic accidents in Berkeley from 2006 to 2014, from the Transportation Injury Mapping System. The data comes from the California Highway Patrol’s Statewide Integrated Traffic Records System and was then geocoded for mapping by UC Berkeley’s Safe Transportation Research & Education Center.
mlb_salaries_2015.csv Salaries of players in Major League Baseball at the start of the 2015 season, from the Lahman Baseball Database.
calpads_cohort16_alameda.csvThe State of California publishes quite a bit of high school graduation data statewide, here filtered for Alemeda county only.
USGS_2.5_month.csvUSGS publishes real time earthquake data.
311_Cases_Dec2017.csvSan Francisco’s 311 call records, from SF’s Open Data Portal, filtered for cases opened between 12/01/2017 12:00:00 AM and 01/01/2018 12:00:00 AM.
techexports.xls High-technology exports from 1990 to 2015, in current US dollars, from the UN Comtrade database, supplied via the World Bank. High-technology exports include products in aerospace, computers, pharmaceuticals, scientific instruments, and electrical machinery.
ucb_stanford_2014.csv Data on federal government grants to UC Berkeley and Stanford University in 2014, downloaded from USASpending.gov.
alerts-actions_2017.xls Records of disciplinary alerts issued and actions taken by the Medical Board of California in 2017.
ca_discipline.csvDisciplinary alerts and actions issued by the Medical Board of California from 2008 to 2017. Processed from downloads available here.
ca_medicare_opioids.csvData on prescriptions of opioid drugs under the Medicare Part D Prescription Drug Program by doctors in California, from 2013 to 2015. Filtered from the national data downloads available here. This is the public release of the data that ProPublica used FOIA to obtain for earlier years for the story we discussed in Week 2.
npi_license.csvCrosswalk file to join National Provider Identifier codes to state license numbers, processed from the download available here to include license numbers potentially matching California doctors.
pfizer.csv Payments made by Pfizer to doctors across the United States in the second half on 2009. Contains the following variables:
org_indivFull name of the doctor, or their organization.
first_plusDoctor’s first and middle names.
last_name. First and last names.
stateCity and state.
category of paymentType of payment, which include
Expert-led Forums, in which doctors lecture their peers on using Pfizer’s drugs, and `Professional Advising.
cashValue of payments made in cash.
otherValue of payments made in-kind, for example puschase of meals.
totalvalue of payment, whether cash or in-kind.
fda.csv Data on warning letters sent to doctors by the US Food and Drug Administration, because of problems in the way in which they ran clinical trials testing experimental treatments. Contains the following variables:
name_middleDoctor’s last, first, and middle names.
issuedDate letter was sent.
officeOffice within the FDA that sent the letter.