This dataset was curated from the Bing search logs (desktop users only) over the period of Jan 1st, 2020 – (Current Month - 1).Only searches that were issued many times by multiple users were included. Dataset includes queries from all over the world that had an intent related to the Coronavirus or Covid-19. In some cases this intent is explicit in the query itself, e.g. “Coronavirus updates Seattle” in other cases it is implicit , e.g. “Shelter in place” .Implicit intent of search queries (e.g. Toilet paper) were extracted by using Random walks on the click graph approach as outlined in this paper by Microsoft Research. All personal data was removed.
Inside the data folder there is a folder 2020(for the year) which contains two kinds of files.
- QueriesByCountry_DateRange.tsv : A tab separated text file that contains queries with Coronavirus intent by Country.
- QueriesByState_DateRange.tsv : A tab separated text file that contains queries with Coronavirus intent by State.
Date : string, Date on which the query was issued.
Query : string, The actual search query issued by user(s)
IsImplicitIntent : bool, true if query did not mention covid or coronavirus or sarsncov2 e.g.”Shelter in place” false otherwise
Country : string, Country from where the query was issued.
PopularityScore : int, value between 1 to 100. 1 indicates least popular query on the day/Country with Coronavirus intent, and 100 indicates the most popular query for the same Country on the same day.
Date : string, Date on which the query was issued.
Query : string, The actual search query issued by user(s)
IsImplicitIntent : bool, true if query did not mention covid or coronavirus or sarsncov2 e.g.”Shelter in place” false otherwise
State : string, State from where the query was issued.
Country :string, Country from where the query was issued.
PopularityScore : int, value between 1 to 100. 1 indicates least popular query on the day/State/Country with Coronavirus intent, and 100 indicates the most popular query for the same geogrpahy on the same day.
New data will be added once a month till the Coronavirus is in the news.
Please see the LICENSE file for more details. If you choose to use the data, please attribute it to Microsoft as follows: "Data Source: Bing Coronavirus Query set (https://github.com/microsoft/BingCoronavirusQuerySet)".
This project is not open for contributions.
This project has adopted the Microsoft Open Source Code of Conduct.
For more information see the Code of Conduct FAQ or
contact [email protected] with any additional questions or comments.