Respiratory syncytial virus tracking using internet search engine data
Affiliation
Univ Arizona, Coll Publ Hlth, Div Epidemiol & Biostat, Tucson, AZ 85721 USAIssue Date
2018-04-03
Metadata
Show full item recordPublisher
BIOMED CENTRAL LTDCitation
Oren et al. BMC Public Health (2018) 18:445 https://doi.org/10.1186/s12889-018-5367-zJournal
BMC PUBLIC HEALTHRights
© The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Background: Respiratory Syncytial Virus (RSV) is the leading cause of hospitalization in children less than 1 year of age in the United States. Internet search engine queries may provide high resolution temporal and spatial data to estimate and predict disease activity. Methods: After filtering an initial list of 613 symptoms using high-resolution Bing search logs, we used Google Trends data between 2004 and 2016 for a smaller list of 50 terms to build predictive models of RSV incidence for five states where long-term surveillance data was available. We then used domain adaptation to model RSV incidence for the 45 remaining US states. Results: Surveillance data sources (hospitalization and laboratory reports) were highly correlated, as were laboratory reports with search engine data. The four terms which were most often statistically significantly correlated as time series with the surveillance data in the five state models were RSV, flu, pneumonia, and bronchiolitis. Using our models, we tracked the spread of RSV by observing the time of peak use of the search term in different states. In general, the RSV peak moved from south-east (Florida) to the north-west US. Conclusions: Our study represents the first time that RSV has been tracked using Internet data results and highlights successful use of search filters and domain adaptation techniques, using data at multiple resolutions. Our approach may assist in identifying spread of both local and more widespread RSV transmission and may be applicable to other seasonal conditions where comprehensive epidemiological data is difficult to collect or obtain.Note
Open access journal.ISSN
1471-2458PubMed ID
29615018Version
Final published versionae974a485f413a2113503eed53cd6c53
10.1186/s12889-018-5367-z
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License.
Related articles
- Using Google Trends to Predict Pediatric Respiratory Syncytial Virus Encounters at a Major Health Care System.
- Authors: Crowson MG, Witsell D, Eskander A
- Issue date: 2020 Jan 30
- Correlation between respiratory syncytial virus (RSV) test data and hospitalization of children for RSV lower respiratory tract illness in Florida.
- Authors: Light M, Bauman J, Mavunda K, Malinoski F, Eggleston M
- Issue date: 2008 Jun
- Association between respiratory syncytial virus activity and pneumococcal disease in infants: a time series analysis of US hospitalization data.
- Authors: Weinberger DM, Klugman KP, Steiner CA, Simonsen L, Viboud C
- Issue date: 2015 Jan
- Respiratory syncytial virus activity-- United States, July 2007-December 2008.
- Authors: Centers for Disease Control and Prevention (CDC)
- Issue date: 2008 Dec 19
- Predicting Lead-Time RSV-Related Pediatric Hospitalizations From Historic Google Trend Search.
- Authors: Bhatt P, Parmar N, Parekh T, Pemmasani S, Shaikh N, Boateng AJ, Donda K, Doshi H, Dapaah-Siakwan F
- Issue date: 2023 Nov 1