Skip to content

Conversation

@spopelka-dsac
Copy link
Contributor

module-name: Core Data Model - NPPES PUF Output Discovery

Jira Ticket #262

Problem

We need to ensure that out core data model supports the entities, data elements, and relationships of the outputs that we are expected to produce. Those outputs are the NPPES PUFs and the R4b FHIR JSON standard.

Solution

This draft PR introduces a Jupyter notebook detailing the process to analyze the NPPES PUF files and break them into more normalized entities and data elements, as well as to understand the relationships between individuals/organizations and those entities.

Result

The result of the notebook is an Excel spreadsheet with two sheets:

  • "Normalized Data Elements," which has each data element only listed once, irrespective of the source field(s) that map to it
  • "Raw Mapping," which preserves the mapping between data elements and source field(s)

Test Plan

  1. Review the notebook for understanding of the process
  2. Review the output Excel document to ensure that the proposed core data model covers the entities and data elements that will be necessary to generate the NPPES PUF files

N.B. Since this is discovery work, I don't think this PR should be merged into the code base, but it provides context for the process that was used to generate the NPPES Data Elements Excel Workbook (which I'll upload to Google Drive somewhere). This output may also inform the mapping of the core data model to the NPPES PUF output, a process that will eventually be incorporated into the code base.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants