Data Mining as an Essential Informatics Skill Set

November 30, 2012
4 Views

Clinical Integrated Data Repositories are now become common at academic medical centers. With tools like i2b2 and RemedyMD, plus a broad range of analytic tools, access to large volumes of clinical data for research and population management is coming to maturity. The opportunities for use of this data in enabling clinical trials and accelerating research are promising.

Clinical Integrated Data Repositories are now become common at academic medical centers. With tools like i2b2 and RemedyMD, plus a broad range of analytic tools, access to large volumes of clinical data for research and population management is coming to maturity. The opportunities for use of this data in enabling clinical trials and accelerating research are promising. Quality and patient safety can also be enhanced through use of electronic medical records; a recent New England Journal of Medicine article by Dean Sittig details how to “Use EHRs to Monitor and Improve Patient Safety.”  ”Organizations must leverage EHRs to facilitate rapid detection of common errors (including EHR-related errors), to monitor the occurrence of high-priority safety events, and to more reliably track trends over time.”

To maximize these opportunities, physicians and other health professionals must develop skills in understanding and utilizing this data. Medical informatics has been successful in developing tools for data mining, but translating raw data into research questions and disease trends requires training medical professionals in new ways of thinking. Understanding clinical workflow in an EMR does not directly translate into this type of research. One must understand how the data is organized and coded to create disease cohorts for analysis. Informaticists are key in training a new generation of physicians in this skill. Because of the complexity of this clinical data, there are three approaches to this data mining and analysis:

  1. Self-service data mining enabled by cohort definition tools, both vendor developed and open source
  2. Analyst provided data – skilled data analysts can pull relevant data sets based on their understanding of the research question and the data. However, there are limitations on the number of experienced data analyst any organization can afford to meet the coming demand
  3. Predictive analytics – this is the realm of the biostatistician who will be key consumers of large data sets to create predictive models to be used in clinical practice. This is also a limited resource, so prioritizing predictive modeling projects which major impact is key

Data mining and analytics should be taught in medical schools for the next generation of providers.  Data visualization will be helpful in exploring this complex, big data. More on this in a future post.

You may be interested

3 Surprising Facts About the American Healthcare System
Health care
0 shares208 views
Health care
0 shares208 views

3 Surprising Facts About the American Healthcare System

Ryan Kh - May 24, 2017

The status of American healthcare has been in the news frequently over the past several months due to the new…

SEO vs Paid Search vs Social Media – Which is Better for Healthcare Marketing
eHealth
0 shares350 views
eHealth
0 shares350 views

SEO vs Paid Search vs Social Media – Which is Better for Healthcare Marketing

Rehan Ijaz - May 23, 2017

Digital media has transformed marketing practices in just about every industry. Healthcare marketing has surprisingly been affected by the digital…

New Protein Could Aid in Therapy for Mesothelioma
Medical Innovations
0 shares1813 views
Medical Innovations
0 shares1813 views

New Protein Could Aid in Therapy for Mesothelioma

jennacyprus - May 22, 2017

In the world of oncology, there’s a common consensus that cancer rarely has a single cause. Instead, the majority of…