Skip to content

LandCoverNet Training Labels v1.0

LandCoverNet is a global annual land cover classification training dataset with labels for the multi-spectral satellite imagery from Sentinel-1, Sentinel-2 and Landsat-8 missions in 2018. Image chips of 256 x 256 pixels in LandCoverNet spanning across multiple tiles. Each image chip contains temporal observations from the following satellite products with an annual class label, all stored in raster format (GeoTIFF files):

  • Sentinel-1 ground range distance (GRD) with radiometric calibration and orthorectification at 10m spatial resolution
  • Sentinel-2 surface reflectance product (L2A) at 10m spatial resolution
  • Landsat-8 surface reflectance product from Collection 2 Level-2

Radiant Earth Foundation designed and generated this dataset with a grant from Schmidt Futures with additional support from NASA ACCESS, Microsoft AI for Earth and in kind technology support from Sinergise.

One of the strongest feature of this dataset is Consensus labeling where each image chip was validated by three independent users. The accuracy of each user was assessed using chips that were separately labeled by experts from Radiant Earth’s team. To generate the consensus label for each pixel a Bayesian model averaging approach was implemented taking into account the accuracy of each user. The resulting labels are accompanied by a “consensus score” between 0 and 100 which indicates the degree of agreement among the three users. This forms b2 for the dataset while b1 is the class value.

You can read a sample detailed methodology here and you can go to the sample dataset page here. You can read about the approach in the paper here

Tutorials on this can be further accessed here

Disclaimer: Whole or parts of the dataset description were provided by the author(s) or their works.


Alemohammad, Hamed, and Kevin Booth. "LandCoverNet: A global benchmark land cover classification training dataset."
arXiv preprint arXiv:2012.03111 (2020).

Dataset Citation

Alemohammad S.H., Ballantyne A., Bromberg Gaber Y., Booth K., Nakanuku-Diggs L., & Miglarese A.H. (2020) "LandCoverNet Africa: A Geographically
Diverse Land Cover Classification Training Dataset", Version 1.0, Radiant MLHub. [Date Accessed]

Data structure and preprocessing

The datasets are provided as raster chips with 256 x 256 pixel resolution meaning a total of 65,536 pixels. Overall dataset distribution per region is summarized below

Abbreviation Image Chip Count Ref ID Region Proportion Global
AU 600 ref_landcovernet_au_v1_labels Australia 6.72
AS 2753 ref_landcovernet_as_v1_labels Asia 30.81
AF 1980 ref_landcovernet_af_v1_labels Africa 22.16
EU 840 ref_landcovernet_eu_v1_labels Europe 9.4
NA 1561 ref_landcovernet_na_v1_labels North America 17.47
SA 1200 ref_landcovernet_sa_v1_labels South America 13.43

The datasets do consist of a STAC representation and while the command line tool is the way to access this data, I wrote some custom script for parsing the properties in STAC metadata as well as to download the raster objects and the source imagery CSVs for use as Google Earth Engine assset level property.

Retained metadata includes date which is used for both start and end date.

Additional Metadata fields

  • Distinct Classes: There is a class count as part of the property of the STAC metadata for each tile, in this case a simple count was used to estimate how many distinct classes were available for chip and how that might effect your overall training.
  • system:index/id_no was the name of the tile retained from the imagery as is and has not been modified
  • source_imagery_start/end: Using the source imagery CSV files source imagery start and end dates were selected post sorting of the dates per chip and added to allow for creating a time filter.
  • source_imagery_datelist: This is the complete source imagery date list you can use the list to identify and select specific days of S2 imagery from those days for analysis or comparison.


Earth Engine Snippet

var au = ee.ImageCollection("projects/sat-io/open-datasets/LandCoverNet/LABELS/ref_landcovernet_au_v1_labels");
var af = ee.ImageCollection("projects/sat-io/open-datasets/LandCoverNet/LABELS/ref_landcovernet_af_v1_labels");
var as = ee.ImageCollection("projects/sat-io/open-datasets/LandCoverNet/LABELS/ref_landcovernet_as_v1_labels");
var eu = ee.ImageCollection("projects/sat-io/open-datasets/LandCoverNet/LABELS/ref_landcovernet_eu_v1_labels");
var na = ee.ImageCollection("projects/sat-io/open-datasets/LandCoverNet/LABELS/ref_landcovernet_na_v1_labels");
var sa = ee.ImageCollection("projects/sat-io/open-datasets/LandCoverNet/LABELS/ref_landcovernet_sa_v1_labels");

Sample Code:


The dataset is released under (CC BY 4.0) license. You can find license summary here

Produced by: Radiant Earth Foundation

Curated in GEE by: Samapriya Roy

Keywords: Land Use, Land Cover, Remote Sensing, landsat-8, sentinel-1, sentinel-2, segmentation

Last updated on GEE: 2022-07-17