CS 677 Big Data

Project 3: Data Dictionary

Some general information about the dataset:

Files are named like so: nam_YYYYMM.tdv (where YYYY is the year and MM the month). nam_tiny.tdv is a smaller sample dataset that you can use for testing before running your programs across the entire dataset.

These .tdv (tab-delimited values) files contain one complete observation per line. Each observation begins with a timestamp followed by its Geohash (spatial location). Next, we have 56 atmospheric features.

Here’s the complete list:

  1. Timestamp
  2. Geohash
  3. geopotential_height_lltw
  4. water_equiv_of_accum_snow_depth_surface
  5. drag_coefficient_surface
  6. sensible_heat_net_flux_surface
  7. categorical_ice_pellets_yes1_no0_surface
  8. visibility_surface
  9. number_of_soil_layers_in_root_zone_surface
  10. categorical_freezing_rain_yes1_no0_surface
  11. pressure_reduced_to_msl_msl
  12. upward_short_wave_rad_flux_surface
  13. relative_humidity_zerodegc_isotherm
  14. categorical_snow_yes1_no0_surface
  15. u-component_of_wind_tropopause
  16. surface_wind_gust_surface
  17. total_cloud_cover_entire_atmosphere
  18. upward_long_wave_rad_flux_surface
  19. land_cover_land1_sea0_surface
  20. vegitation_type_as_in_sib_surface
  21. v-component_of_wind_pblri
  22. albedo_surface
  23. lightning_surface
  24. ice_cover_ice1_no_ice0_surface
  25. convective_inhibition_surface
  26. pressure_surface
  27. transpiration_stress-onset_soil_moisture_surface
  28. soil_porosity_surface
  29. vegetation_surface
  30. categorical_rain_yes1_no0_surface
  31. downward_long_wave_rad_flux_surface
  32. planetary_boundary_layer_height_surface
  33. soil_type_as_in_zobler_surface
  34. geopotential_height_cloud_base
  35. friction_velocity_surface
  36. maximumcomposite_radar_reflectivity_entire_atmosphere
  37. plant_canopy_surface_water_surface
  38. v-component_of_wind_maximum_wind
  39. geopotential_height_zerodegc_isotherm
  40. mean_sea_level_pressure_nam_model_reduction_msl
  41. temperature_surface
  42. snow_cover_surface
  43. geopotential_height_surface
  44. convective_available_potential_energy_surface
  45. latent_heat_net_flux_surface
  46. surface_roughness_surface
  47. pressure_maximum_wind
  48. temperature_tropopause
  49. geopotential_height_pblri
  50. pressure_tropopause
  51. snow_depth_surface
  52. v-component_of_wind_tropopause
  53. downward_short_wave_rad_flux_surface
  54. u-component_of_wind_maximum_wind
  55. wilting_point_surface
  56. precipitable_water_entire_atmosphere
  57. u-component_of_wind_pblri
  58. direct_evaporation_cease_soil_moisture_surface