The company offers ARE4, an AI backed, natural language processing (NLP) and information extraction (IE) technology for the healthcare industry that creates structured data from raw, unstructured text files. The type of files which are often found in electronic medical records, pathology reports, PubMed and many other medical data warehouses. ARE4 can process unstructured data like patient progress notes and radiology report and transform into a usable data format. Additionally, it is fully capable of tackling problems such as extracting the information from handwritten physician prescription, which is often incomplete and hard to read or retrieving the history of the patient population responding to various therapies. ARE4 processes the many errors contained in the raw data like headers, incomplete and run on sentences, misspellings, as well as punctuation issues. Furthermore, the AI-backed platform can dive deep to search the typed query quickly and effectively, for example, searching for tumor size, that surpasses the simple search, and brings out the granular level of details, which will enable the researchers, physician, and scientists to gain insights to enhance precision medicine.
The company has invested time and efforts to train their solutions with millions of medical records, which helps the AI to understand the complexity inherent in the unstructured medical text. Once the data is fed into the powerful algorithm, it cleans the abbreviations, issues with named entities such as genes, and much more not found in typical layman text. For example, if we have to determine who is going to be the next President based on tweets, we do twitter sentiment analysis. The first step is to remove all the special characters (@, #, &, $) from the tweet to make the data clean and make the machine understand the text. This is exactly what ARE4 replicates; without any manual labor involved, it cleans and interprets the text. The next step, after the text is processed, is to connect the dots, that is, to establish a relationship between the data. Here, instead of using a typical relational SQL system, MST harnesses graph database. A typical relational database stores the data in the form of tables, which is fine, but as the number of tables multiplies, it escalates the problem of handling the data due to the increasing number of keys and joins (which are expensive) that prolongs the time taken to process the queries. On the other hand, the graph database works by storing the relationship along with the data. Each node is physically linked in the database, allowing it to have unprecedented speed. Moreover, this connected relationship, using the resource description framework (RDF) gets stored in triplestore, a specialized database used for storing “triples”—a data entity composed of a subject, a predicate, and an object—to enable lighting fast and accurate search results.
The icing on the cake is the layer of Confidence Scoring Engine (CSE) present in the ARE4 solution. Here, CSE analyzes the physician’s level of confidence in their report, using a machine-learning algorithm. Hence, each aspect of the report is examined for the confidence level of the physician making the report. For example, words like ‘for sure’ in a report will result as a confirmed yes and high confidence score, whereas the words like ‘maybe,’ ‘most likely’ will be scored low based on the calculations done by the algorithm. Going an extra mile with the underlying technology translates into the users having a robust system that enables them to unlock the potential of unstructured data.
What really sets us apart is that our solution is a combination of traditional NLP and machine learning versus just pure machine learning, which helps us deal with the messy sentence structures
In times when technological advancements are necessary to stay ahead of the pack, MST continually strives to work on various enhancements, be it their ingestion of data or analytics solution. Among multiple things in the workshop, one they are keenly working on is Adverse Event Reporting that correctly receives, tracks, mines a patient’s prescribed medication, and analytically arrives at a conclusion as to what needs to be reported and what is irrelevant to report. Although the big picture revolves around the sophisticated search, the company with its various add-ons is set to disrupt many systems and processes currently used in the healthcare industry.