Frequently Asked Questions

Data is aggregated from many different data sources. The majority of the data sources come from government portals and health authorities. We also have data from journalistic data sources as well as crowd-sourced efforts.
The aggregated output is licensed under the CC-BY-4.0 license, but each individual data source has its own license. For a list of data sources and their respective licenses, see the documentation for the table you are interested in using. For example, here are the epidemiology table data sources.
Different data sources have different update schedules. The dataset is synchronized with all data sources and updated no less than once per day.
All of the code used to process the data is available in our Github repository.

If you are using this dataset in derivative works, please cite our dataset as per the CC-BY-4.0 license. We ask that you use the name "Google COVID-19 Open Data" with the link goo.gle/covid-19-open-data.

If you are using this dataset in a research publication, you can use the following for citation in bibtex format:

@article{Wahltinez2020,
  author = "O. Wahltinez and others",
  year = 2022,
  title = "COVID-19 Open-Data a global-scale spatially granular meta-dataset for coronavirus disease",
  note = "",
  url = {https://goo.gle/covid-19-open-data},
}
The output data files are published under the CC-BY-4.0 license. All data is subject to the terms of agreement individual to each data source; refer to the Sources of data table for more details. All other code and assets are published under the Apache License 2.0. The data produced by third parties and made available by the COVID-19 Open Data Repository is subject to the license terms from the original third-party authors. We will always indicate the original source of the data in the database, and you should always check the license of any such third-party data before use.