The Client & the Challenge
One of the most trusted names in the development of healthcare IT systems, this US-based healthcare RPA company wanted us to extract pertinent information from insurance cards.
The global healthcare IT market size was valued at USD 135.6 billion in 2021 and is projected to exhibit a CAGR of 29.3% in the forecast period. A growing trend of digitalization in healthcare, rising demand for preventive care solutions, and the emergence of various entrepreneurship ventures are propelling the market's growth. With the support of an increasingly sophisticated IT infrastructure, key participants are focusing on innovative product development strategies and making constant technological advancements. To enhance health & medical processes, market players are integrating big data, the Internet of Things (IoT), artificial intelligence (AI), and machine learning (ML) algorithms into their existing solutions.
In the IT healthcare industry, one of the key operations is to extract relevant information from different types of documents. The customer was facing challenges with the traditional method to extract insurance cards from images and retrieve only the relevant information from those extracted insurance cards. These challenges were mainly due to the following reasons: Primarily, the conventional method was rule-based and could not work with newer or different formats (more than 8000) of insurance cards. Moreover, with very skewed labelled data the rule based approach functioned only approximately 50% of the time. Secondly, the process was very slow as OCR for high-resolution images consumes a lot of time. The client wanted us to automate this extraction process with low latency and high accuracy along with the task to identify the type of insurance.
With very skewed labelled data the rule based approach functioned only approximately 50% of the time.
We used a blend of vision AI and Deep Learning to solve the customer's challenge. Here is a breakdown of the steps we used:
Step 1: Adjusted working process to adhere to HIPAA compliance
Before beginning to solve the problem, Health Insurance Portability and Accountability Act (HIPPA) compliance was to be respected. We then had to work remotely using virtual machines as per the client's environment. To adjust to this working process, a few labeling tools were built to maintain HIPAA compliance
Step 2: Utilized object detection-based models to extract the insurance card from the image
We iterated with several segmentation and object detection-based algorithms to crop insurance cards from the given images, and then extract pertinent information from the insurance cards. Additionally, this insurance card extraction from the provided image was completed with a very low latency rate of 20 ms per image. Following that, LayoutLMV2, a transformer-based model for labelling the information such as Ids, Names, etc., received the extracted card image and text as input. Eventually, we improved the process' speed, dynamic nature, and accuracy by feeding the model additional relevant data. Finally, the classification model was used to this extracted insurance card image to determine the payer type.
Step 3: Model deployment for object detection and information retrieval
The customized model was deployed with Rest API using FastAPI in the client’s AZURE environment using VMSS that automatically scales up and down the virtual machines as per the requirement. The whole process of insurance card extraction and information retrieval is now automatic, more accurate, and faster.
- Reduced the time taken by pipeline by 65%
- 8000 plan types are automated.
- Automatically pull data from relevant sources
- Saves time and cost
- Improves data-driven decisions