VFDB 2016: hierarchical and refined dataset for big data analysis—10 years on
Chinese Academy of Medical Sciences & Peking Union Medical College
Abstract
The virulence factor database (VFDB, http://www.mgc.ac.cn/VFs/) is dedicated to providing up-to-date knowledge of virulence factors (VFs) of various bacterial pathogens. Since its inception the VFDB has served as a comprehensive repository of bacterial VFs for over a decade. The exponential growth in the amount of biological data is challenging to the current database in regard to big data analysis. We recently improved two aspects of the infrastructural dataset of VFDB: (i) removed the redundancy introduced by previous releases and generated two hierarchical datasets--one core dataset of experimentally verified VFs only and another full dataset including all known and predicted VFs and (ii) refined the gene…
Citation impact
- FWCI
- 29.81
- Percentile
- 100%
- References
- 17
Authors
5- LCLihong ChenCorresponding
Chinese Academy of Medical Sciences & Peking Union Medical College
- DZDandan Zheng
Chinese Academy of Medical Sciences & Peking Union Medical College
- BLBo Liu
Chinese Academy of Medical Sciences & Peking Union Medical College
- JYJian Yang
Chinese Academy of Medical Sciences & Peking Union Medical College
- QJQi Jin
Chinese Academy of Medical Sciences & Peking Union Medical College
Topics & keywords
- Annotation
- Biology
- Big data
- Redundancy (engineering)
- Exponential growth
- Usability
- Database
- Data mining