Machine learning in medicine: a practical introduction

Sidey-Gibbons, Jenni A. M.; Sidey‐Gibbons, Chris

doi:10.1186/s12874-019-0681-4

articleBMC Medical Research MethodologyMar 19, 2019GOLD OA

Machine learning in medicine: a practical introduction

JAJenni A. M. Sidey-Gibbons CSChris Sidey‐Gibbons

University of Cambridge · Brigham and Women's Hospital · +1 more institution

PubMed

Indexed incrossrefdoajpubmed

Abstract

Background

Following visible successes on a wide range of predictive tasks, machine learning techniques are attracting substantial interest from medical researchers and clinicians. We address the need for capacity development in this area by providing a conceptual introduction to machine learning alongside a practical guide to developing and evaluating predictive algorithms using freely-available open source software and public domain data.

Methods

We demonstrate the use of machine learning techniques by developing three predictive models for cancer diagnosis using descriptions of nuclei sampled from breast masses. These algorithms include regularized General Linear Model regression (GLMs), Support Vector Machines (SVMs) with a radial basis function kernel, and single-layer Artificial Neural Networks. The publicly-available dataset describing the breast mass samples (N=683) was randomly split into evaluation (n=456) and validation (n=227) samples. We trained algorithms on data from the evaluation sample before they were used to predict the diagnostic outcome in the validation dataset. We compared the predictions made on the validation datasets with the real-world diagnostic decisions to calculate the accuracy, sensitivity, and specificity of the three models. We explored the use of averaging and voting ensembles to improve predictive performance. We provide a step-by-step guide to developing algorithms using the open-source R statistical programming environment.

Citation impact

1,261

total citations

FWCI: 68.77
Percentile: 100%
References: 40

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Machine learning
Artificial intelligence
Computer science
Support vector machine
Artificial neural network
Cross-validation
Sensitivity (control systems)
Data mining

No related works found for this paper.