Assessing the quality of annotations in asthma gene expression experiments
| dc.contributor.author | Lacson, Ronilda | |
| dc.contributor.author | Mbagwu, Michael | |
| dc.contributor.author | Yousif, Hisham | |
| dc.contributor.author | Ohno-Machado, Lucila | |
| dc.date.accessioned | 2016-05-20T09:00:07Z | |
| dc.date.available | 2016-05-20T09:00:07Z | |
| dc.date.issued | 2010 | en |
| dc.identifier.citation | Lacson et al. BMC Bioinformatics 2010, 11(Suppl 9):S8 http://www.biomedcentral.com/1471-2105/11/S9/S8 | en |
| dc.identifier.doi | 10.1186/1471-2105-11-S9-S8 | en |
| dc.identifier.uri | http://hdl.handle.net/10150/610165 | |
| dc.description.abstract | BACKGROUND:The amount of data deposited in the Gene Expression Omnibus (GEO) has expanded significantly. It is important to ensure that these data are properly annotated with clinical data and descriptions of experimental conditions so that they can be useful for future analysis. This study assesses the adequacy of documented asthma markers in GEO. Three objective measures (coverage, consistency and association) were used for evaluation of annotations contained in 17 asthma studies.RESULTS:There were 918 asthma samples with 20,640 annotated markers. Of these markers, only 10,419 had documented values (50% coverage). In one study carefully examined for consistency, there were discrepancies in drug name usage, with brand name and generic name used in different sections to refer to the same drug. Annotated markers showed adequate association with other relevant variables (i.e. the use of medication only when its corresponding disease state was present).CONCLUSIONS:There is inadequate variable coverage within GEO and usage of terms lacks consistency. Association between relevant variables, however, was adequate. | |
| dc.language.iso | en | en |
| dc.publisher | BioMed Central | en |
| dc.relation.url | http://www.biomedcentral.com/1471-2105/11/S9/S8 | en |
| dc.rights | © 2010 Lacson et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0). | en |
| dc.rights.uri | https://creativecommons.org/licenses/by/2.0/ | |
| dc.title | Assessing the quality of annotations in asthma gene expression experiments | en |
| dc.type | Article | en |
| dc.identifier.eissn | 1471-2105 | en |
| dc.contributor.department | Decision Systems Group, Brigham & Women’s Hospital, Harvard Medical School, Boston, MA, USA | en |
| dc.contributor.department | Ohio State University, Columbus, OH, USA | en |
| dc.contributor.department | University of Arizona, Tucson, AZ, USA | en |
| dc.contributor.department | Division of Biomedical Informatics, University of California San Diego, La Jolla, CA, USA | en |
| dc.identifier.journal | BMC Bioinformatics | en |
| dc.description.collectioninformation | This item is part of the UA Faculty Publications collection. For more information this item or other items in the UA Campus Repository, contact the University of Arizona Libraries at repository@u.library.arizona.edu. | en |
| dc.eprint.version | Final published version | en |
| refterms.dateFOA | 2018-09-11T10:51:14Z | |
| html.description.abstract | BACKGROUND:The amount of data deposited in the Gene Expression Omnibus (GEO) has expanded significantly. It is important to ensure that these data are properly annotated with clinical data and descriptions of experimental conditions so that they can be useful for future analysis. This study assesses the adequacy of documented asthma markers in GEO. Three objective measures (coverage, consistency and association) were used for evaluation of annotations contained in 17 asthma studies.RESULTS:There were 918 asthma samples with 20,640 annotated markers. Of these markers, only 10,419 had documented values (50% coverage). In one study carefully examined for consistency, there were discrepancies in drug name usage, with brand name and generic name used in different sections to refer to the same drug. Annotated markers showed adequate association with other relevant variables (i.e. the use of medication only when its corresponding disease state was present).CONCLUSIONS:There is inadequate variable coverage within GEO and usage of terms lacks consistency. Association between relevant variables, however, was adequate. |

