Skip to content

Visualizing and Investigating Distances to Understand the Reliability of Weather Data #35

@rectheworld

Description

@rectheworld

Visualizing and Investigating Distances to Understand the Reliability of Weather Data

Goals

Goal: We have been able to match a subset of our volunteer submitted weather data to NOAA weather data. However, we have done no filtering on the distance between a cbc and its closest NOAA station.

Inputs

cbc_cleaned_usa_merged.csv - Contain volenteer submitted cbc data and the weather data for the closest noaa station. Distances are measured in meters.

Outputs

  • A visualization of how distance affects weather matching between volunteer submitted data and noaa data
  • Suggestion on what distances should be used to determine if volunteer submitted weather data is accurate at a cbc.
  • ipython notebooks are preferred output file

Inspiration and Support

From Francisco Vannini:

Would be nice to get a plot of this sort
https://plot.ly/python/choropleth-maps/

If we could have some dynamic parameter so that when the parameters change(day,month,year) then so would that map that would be even better.
https://medium.com/@williamkoehrsen

From Ian Davis

Hi all, I'd like to add that we will need to consider how far away the closest NOAA station is. It turns out there is someone in "Midway Atoll" (out in the middle of the Pacific Ocean!) who collected data for a number of years. The closest station distance is 1.9e6 m (~1180 miles)
EDIT:
So I did a quick Google Search asking at what radial distance a temperature measurement would still be considered accurate. The answers seem to be quite varied and depend on the terrain, among other things. I don't think we'll be able to consider those factors. However, this one particular discussion referenced a paper from the World Meteorological Organization (WMO) about weather observations. One line states "but for small-scale or local applications the considered area may have dimensions of 10 km or less." Not sure if that's enough for use to go on, but it's a start.
https://www.researchgate.net/post/Data_and_weather_station_distance
https://www.researchgate.net/deref/https%3A%2F%2Fwww.wmo.int%2Fpages%2Fprog%2Fwww%2FIMOP%2Fmeetings%2FCB%2FEd-Board-2%2FEdBd-2_P-I_Ch-1.doc
Using the paired.csv and setting the distance limit to 10 km, the number of rows drops from 53811 to 38178. A drop of 29%. (edited)

Metadata

Metadata

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions