Database¶
CHANGE NOTICE: We are still working on operationalization. The database currently only goes through 2001
The Spectral Trend Database tracks over 14 thousand points in the mid-western United States from 2000 to present. Below we have a detailed description of each of the tables. The database lives on Google Big Query and can be accessed directly using big query. However, we’ve built a number of python tools to make accessing the data eaiser (see these examples).
Table |
Keys |
Dates |
Daily |
Description |
---|---|---|---|---|
sample_id |
False |
False |
location information such as lat, lon, h3-cels, and administrative information such as state and county |
|
sample_id, year |
False |
False |
CDL derived crop-type (corn, soy, NA, other) for year |
|
sample_id, year |
True |
False |
yield estimations for year |
|
sample_id, year |
True |
False |
masked landsat band values and spectral indices for year |
|
sample_id, year |
True |
True |
interpolated and smoothed daily values for indices contained in |
|
sample_id, year |
True |
True |
additional indices dervived from |
|
sample_id, year |
True |
False |
statistical (min, max, mean, median, skew, kurtosis) aggregation of |
|
sample_id, year |
True |
False |
same as |
|
sample_id, year |
True |
False |
same as |
TABLES¶
SAMPLE_POINTS
¶
This table contains locational information for sample-points such as lat, lon and h3-cells. Additionally, administrative information from the US Census, such as state and county name, are provided. A detailed description on of how the locations have been selected can be found here.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm. |
lon |
longitude |
lat |
latitude |
|
h3-cell at resolution |
|
water area (square meters) |
|
land area (square meters) |
|
legal/statistical area description code for county |
|
state name |
|
United States Postal Service state abbreviation |
|
county name |
|
County identifier; a concatenation of current state Federal Information Processing Series (FIPS) code and county FIPS code |
|
Fully qualified geographic identifier; a concatenation of census survey summary level information with the GEOID attribute value. The GEOIDFQ attribute is calculated to facilitate joining census spatial data to census survey summary files. |
|
county Federal Information Processing Series (FIPS) code |
|
name and the translated legal/statistical area description for county |
|
county Geographic Names Information System (GNIS) code |
|
state Federal Information Processing Series (FIPS) code |
CDL_CROP_TYPE
¶
CDL derived crop-type (corn, soy, NA, other) for year.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm |
|
year of yield |
|
CDL derived crop-type: one of corn, soy, other, na |
|
CDL derived crop-label: integer crop identifier (corn: 0, soy: 1, other: 2, na: 3) |
QDANN_YIELD
¶
Modeled yield data from 2008 to Present using QDANN.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm |
|
year of yield |
|
modeled biomass yield |
|
QDANN derived crop-type: one of corn or soy |
|
QDANN derived crop-label: integer crop identifier (corn: 0, soy: 1 ) |
RAW_INDICES_V1
¶
Masked Landsat (optical) band values and derived spectral-indices. See config/spectral_indices/v1 for a list of indices and how they are calculated.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm. |
|
agricultural year (from Sept 1. of the prior year, through December 1 of the listed year) |
|
date (format: YYYY-MM-DD) for each value |
|
cloud-masked landsat value (blue, green, red, nir, swir1, swir2) |
|
one of: ndvi, ndbr, ndmi, ndwi, msi, rdi, srmir, slavi, wdrvi, bwdrvi, savi, gsavi, mnli, tdvi, evi, evi2, evi22, atsavi, afri1600, cm, cig, gndvi, msavi, gvi, wet, tvi, osavi, rdvi, rvi, grvi, si, si1, gari, gli, msr, nli (see config/spectral_indices/v1 for details) |
SMOOTHED_INDICES_V1
¶
Smoothed daily values computed from RAW_INDICES_V1
. Smoothing computed here, leveraging the spectral_trend_database.smoothing module.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm. |
|
agricultural year (from Sept 1. of the prior year, through December 1 of the listed year) |
|
list of dates for each value |
|
one of: ndvi, ndbr, ndmi, ndwi, msi, rdi, srmir, slavi, wdrvi, bwdrvi, savi, gsavi, mnli, tdvi, evi, evi2, evi22, atsavi, afri1600, cm, cig, gndvi, msavi, gvi, wet, tvi, osavi, rdvi, rvi, grvi, si, si1, gari, gli, msr, nli (see config/spectral_indices/v1 for details) |
MACD_INDICES_V1
¶
Exponential Moving Averages (ema), and Moving Averge Convergence Divergence (macd/macd-div) values computed from smoothed values of NDVI, EVI and EVI2. These have been shown to be useful metrics in determining green-up dates.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm. |
|
list of dates for each value |
|
agricultural year (from December 1st of the prior year, through December 1 of the listed year) |
|
where |
INDICES_STATS_V1 (_GROWING/OFF_SEASON)
¶
Annual (12/1 through 12/1) and sub-annual aggregation statistics (min, max, mean, median, skew, kurtosis) determined for each index in SMOOTHED_INDICES_V1
. INDICES_STATS_V1
uses all available data. INDICES_STATS_V1_OFF_SEASON
uses data from December 1st of the prior-year to March 15th of the listed year. INDICES_STATS_V1_GROWING_SEASON
uses data from April 15th to November 1st of the listed year.
Column(s) |
Description |
---|---|
|
(key-column) 11-character geohash. specifies location down to about 15 cm. |
|
list of dates for each value |
|
agricultural year (from Sept 1. of the prior year, through December 1 of the listed year) |
|
where |