Clinical data science when your patients are kids | Nikolay Braykov | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Hey there, welcome to the Paws at Data Science Hangout. I'm Libby Herron, and this is a recording of our weekly community call that happens every Thursday at 12pm US Eastern Time. If you are not joining us live, you miss out on the amazing chat that's going on. So find the link in the description where you can add our call to your calendar and come hang out with the most supportive, friendly, and funny data community you'll ever experience.

I think we can go ahead and introduce our featured leader for today, Nikolay Braykov, Manager of Advanced Analytics and Outcomes at Children's Healthcare of Atlanta. Nikolay, it's so great to have you here today. Could you introduce yourself? Tell us a little bit about what you do, maybe a little bit about Children's Healthcare of Atlanta and something you like to do for fun.

Hey everyone, good afternoon. Thanks for the opportunity to be here. So my name is Nikolay. I manage our data science team here at Children's Healthcare of Atlanta. We are one of the largest pediatric health systems in the country, the largest in Georgia and one of the largest in the Southeast. So what my team does, we're part of a bigger data and analytics group, or we sort of sit alongside data and analytics under the IT part of the organization. And what my team does, we're officially called the Advanced Analytics Team, but we provide essentially any kind of data science question that goes, things that go beyond reporting.

And there are, you know, most of our questions or problems revolve around healthcare quality or kind of clinical and operational questions. We're not a research team. We're very focused on kind of applied data science. And I'd say there's like three main pillars of what we do. The traditional kind of advanced analytics data science work is, as I said, anything that our reporting teams can handle in terms of like something requires more advanced data visualization, data wrangling it from different sources is needed. And then eventually you might have some hypothesis testing or some kind of statistical analysis that needs to get done. So that's where we come in.

The other pillar is machine learning and the evaluation and also the development of custom predictive models. Most of those are around clinical decision support. And so we partner closely with clinical informaticists. Those are the folks that can build things in our electronic health record. And the third pillar, this is a relatively new area, but we also develop custom solutions that might be powered by LLMs. So if there are any applications that sort of lean on large language models and do, for example, NLP kind of work, we would develop and support these.

So I love being outside. Atlanta is a very green city. So I live on the bike trail. I love going on long bike rides and I've been trying to get into pickleball. I'm still new to it, but it's addictive. There's no other way of saying it. It's easy to get into it and it's such a wonderful community. So that's been kind of my newer hobby. But yeah, I like to stay active and enjoy the outdoors.

So, we can show them what their sensitivity looks like in their cohort. We can show them what their precision looks like. And that's been really important for making some implementation decisions, right?

So, yeah, I think our, what we call our model evaluation dashboards have been very instrumental in getting buy-in and explaining our models.

Favorite and most impactful projects

I love data visualization. So, I'd say one of my favorites was that behavioral mental health clinical effectiveness project and creating that dashboard that I was describing where we can sort of do this clinical effectiveness analysis quickly. Yeah, just because I had a lot of fun visualizing our cohort. There were a lot of assumptions there on, okay, somebody, you know, clinical data is complex or the hospital and healthcare visit data is complex. So, we had to create some custom visuals to basically show this is how we define our index visit for this cohort. This is how all their other visits look like. This is how they meet or don't meet the definitions. So, creating those custom plots was fun for me.

As far as like other projects, I think I'm excited for some of the new things we're doing with LLMs. So, we have currently a project where we're building a chart review system tool. So, we're using an LLM where people can, people can bring their data, bring their cohort, and we really underuse data in notes. So, so far we've said NLP is out of reach. NLP is too hard. There's quicker things we can do with structured data, but a lot of times there's a lot of rich information in notes that people try to get out in a structured way by doing manual chart reviews and putting things in data collection forms. So, we've been working on a tool that can basically ease that data abstraction burden and give people a first pass at here is information you can conversationally ask from a note to extract some structured data.

So, so far it's, the reason why I'm saying this is one of my favorite projects is people get pretty excited about this. No one likes chart reviews, so making people's lives easier has been, it's been satisfying.

There's another tool that we built. Again, this is more of a Shiny application, but so in healthcare quality, a lot of times you want to show whether you're whether your process is stable. And then if it's not, if there's some kind of special cause variation or some noise in, in some, some metric that you're looking at over time, you want to investigate why that's happening. And also if you do an intervention and you're tracking your, your outcome measures, you want to be able to show that there was a change that occurred. So looking at that kind of data over time in statistical process control charts is something that's done a lot in healthcare quality. So for those of you who are statisticians, I don't know, SPC charts are not really something that we, we focus on a lot. They're, they're widely used in like manufacturing, for example, but they're also used in healthcare quality improvement. So we, we created an application that helps people derive those charts.

And it's more than just simple data visualization. There's a lot of rules to when something is considered to be a process out of control. So there's various rules about like when, you know, when you're measuring your process and you derive some, what's called a center line, what is the average value for, for example, for time to getting antibiotics. If that average goes up or down, there's different rules that are kind of accepted to be considered special cause variation. So yeah, Shiny was a really great way to actually create an application to make these charts and enable people to do their own measurement for improvement for, for these quality improvement projects. I'd say that's probably one of our most used dashboards.

Interpretability and black-box models

Yeah, I mean, I think explainability is very important when you're making, you know, critical decisions with these models. So that being said, like, there's ways to go beyond logistic regressions and make your models explainable. So, you know, there's the more traditional ways of, of adding explainability to ML models with, you know, SHAP plots and things like this. So I mean, there's techniques to make to kind of unblack box some of the more complex models. Yeah. Even deep learning models. But I think it's also just a culture shift to I actually, my suspicion is that with kind of the advancements in AI, there might be more willingness to embrace a black box approach, because, you know, even people who build some of the build the LLMs don't exactly aren't able to trace what the billions of parameters like round up to sometimes, right.

So I think, I think what was true a few years ago with explainability and opting for simpler models is going to change a little bit. But yeah, I mean, I think the other part is having a having the kind of governance in place where people can trust that if there is a tool going in, that is being evaluated, and that, you know, you both have considered the workflow that people are using. So you're not just thinking that something is useful to predict, because, you know, you've thought about this in isolation, but you've actually sat down with a clinician seen what they what they're trying to accomplish and how this algorithm is going to be helpful to them. And then also having the kind of sufficient guardrails around evaluation and monitoring for something once it goes live. For any clinical model, we we don't really deploy anything until it's been in this phase of like silent evaluation. So okay, something would just have to go silently in the background, and then we would see does it actually work in production, because things can be tricky to deploy, you might think that it works well when you're training your model when you're testing, and then yeah, so moving a little bit more carefully and having these processes in place is critical for buy-in.

For any clinical model, we we don't really deploy anything until it's been in this phase of like silent evaluation. So okay, something would just have to go silently in the background, and then we would see does it actually work in production.