In anticipation of his upcoming keynote presentation at Predictive Analytics World Healthcare, Las Vegas, June 19-24, 2022, we asked David Talby, PhD, Chief Technology Officer at John Snow Labs, a few questions about their deployment of predictive analytics. Catch a glimpse of his keynote presentation, New Frontiers in Applied Natural Language Processing in Healthcare, and see what’s in store at the Predictive Analytics World Healthcare conference.
Q: In your work with predictive analytics, what behavior or outcome do your models predict?
A: At John Snow Labs, our Spark NLP library offers state-of-the-art clinical and biomedical Natural Language Processing (NLP) models that can be used to predict and optimize healthcare outcomes. Some of these use cases include accelerating clinical trials, predicting disease progression, providing clinical decision support, analyzing real-world evidence, detecting and preventing adverse drug events (ADE), and creating a more comprehensive view of each patient’s journey.
Q: How does predictive analytics deliver value at your organization – what is one specific way in which it actively drives decisions or operations?
A: The biggest advantage of NLP models in the Spark NLP library is that they enable healthcare organizations to see a more holistic view of their patients, providers, operations, and costs. The technology achieves this by bridging the gap between structured data — claims, electronic medical records — and unstructured data — free-text clinical notes, pathology and radiology reports, lab results, research papers, clinical trial documents, social media posts, etc. Serving as the connective tissue, Spark NLP can accurately understand information from all of these unstructured formats and systems to create a clearer, more accurate picture. This enables data scientists or domain experts, in this case clinicians, to make better decisions, whether it be a form of recommended treatment or how to appropriately staff an emergency room.
Q: Can you describe a quantitative result, such as the predictive lift of your model or the ROI of an analytics initiative?
A: Used by 33% of all enterprise AI teams and 59% of healthcare AI teams, Spark NLP has secured the spot as the most widely used NLP library in the enterprise in just 6 years on the market. Additionally, Spark NLP for Healthcare has established new state-of-the-art accuracy in peer-reviewed papers on public academic benchmarks over the last year. This includes providing the most accurate models ever for medical named entity recognition, relation extraction, assertion status detection, and ADE detection. Customers include half of the world’s top 10 pharmaceutical companies, the 5 largest US payers, and 5 of the largest US healthcare systems, among others.
A few examples of what organizations have been able to achieve working with John Snow Labs include:
Q: What surprising discovery or insight have you unearthed in your data?
A: As the field of AI matures, we now have cases in which algorithms are bypassing human accuracy in certain tasks. One example is de-identification, or anonymizing data automatically. But how do you know when this is happening? All you can really do is compare what the algorithm says to what the human expert says and analyze the differences between them. As such, when the model disagrees with your validated data set you need to start going one-by-one through the disagreements to see which is right. This is not uncommon: some of the most well-known academic datasets have material well-known mistakes.
When the state-of-the-art accuracy of an NLP model is 92%, and it is estimated that 4% of the labels are incorrect, then roughly half of the remaining error is matching incorrect labels. This means that it’s very likely that current state-of-the-art models already learned a lot of incorrect labels. This is not necessarily a problem, but it should be taken into consideration when setting expectations and remembering that — at least for now — AI is an imperfect science.
Q: Sneak preview: Please tell us a take-away that you will provide during your talk at Predictive Analytics World.
A: This session will help attendees understand what is possible right now for NLP in healthcare, as told by early adopters in the industry. As new advances in NLP have started to move from research to real-world production implementations, practitioners have learned important lessons along the way. From better answering medical questions and enabling real-world data, to predicting patient outcomes and population health, we can start learning from how some of the largest healthcare systems and pharmaceutical companies in the US are putting NLP to good use.
Don’t miss David’s keynote presentation, New Frontiers in Applied Natural Language Processing in Healthcare, Tuesday, June 21, 2022, from 1:30 pm to 2:15 pm. Click here to register for attendance.
The Machine Learning Times © 2020 • 1221 State Street • Suite 12, 91940 •
Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact