Data Type | Source | Generation Method | Format | File Size Order | License |
---|---|---|---|---|---|
Imagery | DigitalGlobe | Satellite(RGB) | GeoTIFF | ~3 TB | Creative Commons 4.0 |
Imagery | NOAA | Aerial(RGB) | GeoTIFF | ~60 GB | US Government Works |
Damage Annotations | TOMNOD | Crowdsourced | Vector | ~1 MB | Creative Commons 4.0 |
Damage Annotations | FEMA | Assessed by FEMA | Vector | ~20 MB | US Government Works |
Building Footprints | Oak Ridge National Lab | Proprietary Algorithm | Vector | ~2 GB | US Government Works |
Building Footprints | Microsoft | Proprietary Algorithm | Vector | ~3 GB | Open Data Commons Open Database License |
Parcel Data* | Affected County Appraisal Districts | Assessed by Appraisers | Vector | ~1 GB | Variable |
*Parcel data was collected by contacting each County Appraisal District Office
This project utilized Digital Globe Data, which included Hurricane Harvey tif images and a geojson of volunteer crowdsourced damage annotations from TOMNOD, as well as aerial imagery from the National Oceanic and Atmospheric Administration (NOAA).
As our goal was to train a model to automatically detect damages in post-hurricane satellite imagery, we needed to gather all data required for our chosen machine learning algorithms: SSD and Faster R-CNN. Since these required bounding boxes around the annotated damage points, we gathered parcel data and building footprints from the affected counties to create an additional layer from which the bounding boxes for the features (damaged and undamaged buildings) could be generated. Find more details on the data collection process here.
Now for a closer look at the imagery data: