Skip to content

Smoking/incarceration updates#310

Open
lyu039 wants to merge 14 commits intomainfrom
smoking/incarceration_updates
Open

Smoking/incarceration updates#310
lyu039 wants to merge 14 commits intomainfrom
smoking/incarceration_updates

Conversation

@lyu039
Copy link
Copy Markdown
Collaborator

@lyu039 lyu039 commented Feb 13, 2026

@Makosak please review for following changes:

  • edited on smoking population data, prison incarceration rates, and jail incarceration rates
  • finalized data information, limitations, and comments
  • added state and county maps

- finalized data description, limitations, and comments
- add state and county level maps
- finalized data info and limitations
- add state and county maps
- finalized data info and limitations
- add state and county maps
@netlify
Copy link
Copy Markdown

netlify bot commented Feb 13, 2026

Deploy Preview for oeps canceled.

Name Link
🔨 Latest commit 8cc8a08
🔍 Latest deploy log https://app.netlify.com/projects/oeps/deploys/69cc593e0fdea40008e67f9c

Added two data cleaning R scritps of smoking data and incarceration (prison & jail) data
Pyhton notebook for calculating RAAM scores in Illinois
@Makosak Makosak self-requested a review March 31, 2026 22:12
@Makosak
Copy link
Copy Markdown
Collaborator

Makosak commented Mar 31, 2026

Please address these items so we can move this forward, @lyu039

  • Add Data: Currently the data to be merged is missing. It should be in the "data_to_merge" folder of the branch you're working on. Please use the appropriate naming convention for each dataset, for ex 'internet_tract23.csv'
  • Script Naming Convention: Please update your script names to be consistent with the others. Since the smoking and incarceration scripts are more about cleaning, consider something like 'clean_SmokingRates.R'
  • Metadata Registry is not correct (see registry info here, using this branch). While variables indicate measures available from 2017-2024 on the official Metadata documentation, the only year shown in the Metadata registry is 2025. This is not consistent. The same was found true for Prison measures. Also, the "long" format source is the same as the "short" format for Smoking Rates -- that is not correct. Please update.

When completed, please list all the items you corrected as a comment. In addition, please provide, as a comment:

  • Metadata concept, with a bulleted list of new variables to be added/updated below.
  • Indicate which year and spatial scale is updated
  • Add a link to the correct dataset to be added, as you can link via Github/branch.

Copy link
Copy Markdown
Collaborator

@Makosak Makosak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please see changes requested in the comments.

@lyu039
Copy link
Copy Markdown
Collaborator Author

lyu039 commented Mar 31, 2026

Metadata Variables:

  1. Smoking
    - Percentage of smoking population
  2. Prison Incarceration
    - Total prison population rate
    - Total prison admission rate
    - Total prison population count
    - Total prison admission count
  3. Jail Incarceration
    - Total jail population rate
    - Total jail admission rate
    - Pretrial jail population rate
    - Total jail population count
    - Total jail admission count
    - Pretrial jail population count

Update 1 - Script Naming Convention:

  1. clean_SmokingRates.R
  2. clean_IncarcerationRates.R

Update 2 - Metadata Registry:

  1. Smoking
    - Updated the "short" format source
  2. Prison & Jail Incarceration
    - Updated years shown in the Metadata registry, now consistent with Metadata documentation

Update 3 - Add Data (some explanation below):
https://github.com/healthyregions/oeps/tree/smoking/incarceration_updates/data_to_merge

  1. Smoking (2 data files, updated with 2025 state and county data)
    - State 2025
    - County 2025
  2. Prison Incarceration (3 data files, updated with 2024 & 2022 state data and 2019 county data)
    - State 2024: includes total prison population count and rate only, no data for the total prison admission count and rate; same for 2023, so skip 2023 state data and only keep 2024)
    - State 2022: includes data for all variables, both total prison population count and rate, as well as the total prison admission count and rate
    - County 2019: most recent year for available data
  3. Jail Incarceration (4 data files, updated with 2023 & 2022 state and 2024 & 2023 county data)
    - State 2023: includes only total jail population & pretrial jail population, but no total jail admission
    - State 2022: includes all
    - County 2024: most recent available data, but only include quarter 1 data
    - County 2023: includes all

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants