Dominic Tarn •December 19, 2022
Guide to Open-Source Annotation Tools For Medical Imaging
Viewing and annotating medical data, images and videos is a crucial, and frequent, task for many practitioners in the healthcare industry.
A starting point for many when evaluating how to go about this task, will be to start with open-source medical imaging annotation tools – these tools are a popular choice in the medical sector, and can be a smart way to save money when getting started on an image or video dataset annotation project.
In this article we will cover the handful of key data annotation tools that our team often discuss with leaders from Data Operations and Machine Learning teams (as well as radiologists, clinicians and the broader annotation community) as they are getting started in their ‘data annotation’ journey.
We will mainly cover a handful of tools designed to solve specific medical image annotation pain points and problems, such as MITK Workbench, ITK-Snap, 3D Slicer, and several others, rather than broader ranging computer vision annotation tools.
As we all know, there are pros and cons to using open-source tools for medical image annotation projects. When conducting your own evaluation, it’s worth comparing what is on the market with what your own requirements are – based on your specific use cases and forward looking plans.
In this article, we will cover several of the most popular open-source tools for medical image annotation, including the key use cases, benefits and downsides of these tools – we will also look ahead towards what’s next after getting started with these tools, and considerations that teams make as they go forward in their data annotation journey.
What are Open-Source Annotation Tools?
Open-source annotation tools are software programs whose source code is freely available for anyone to use. When we think of annotation platforms, what we mean by open-source annotation tools is tools that help teams with the broad annotation and labeling process (including use cases like image classification, image segmentation, data labeling and object detection). They will be aimed at supporting almost any image or video annotation purpose (unless the license specifically prohibits a certain type of use).
Open-source tools are usually built collaboratively, with numerous — sometimes hundreds or thousands — of developers contributing to the source code. Tools are often tested using publicly available medical imaging datasets, and are usually financially supported by a charitable foundation, public/users donations, or one or more tech company sponsors.
What are The Advantages of Using Open-Source Annotation Tools for Medical Imaging?
The key advantages of open-source annotation tools for medical imaging are:
- They are free to use
- They are available for commercial use and can be built upon and customized
- They typically support community and academic use cases alike
- In most cases, they support an array of medical image file formats (including DICOM and NIfTI medical image file formats).
Now let’s take a closer look at some of the most popular open-source annotation tools on the market.
What are Some of the Most Popular Medical Imaging Open-Source Annotation Tools?
A wide range of open-source tools support, and were specifically created to, manage annotation projects for medical image datasets. In this article, we will focus on several of the most popular, including MITK Workbench, ITK-Snap, 3D Slicer, HOROS, OsiriX, and the OHIF viewer.
MITK Workbench is a free open-source software for medical image processing, annotation, and segmentation. It’s based on The Medical Imaging Interaction Toolkit (MITK), “open-source software for the development of interactive medical image processing software.”
The source code is stored in GitHub and there is MTIK Workbench software that anyone can download and use for Windows, Linux, and Mac (macOS). Here’s more information about how you can use the MTIK Workbench for medical imaging annotation and segmentation projects.
MTIK, and the subsequent open-source workbench tool, were originally developed for and by PhD students and researchers in the Division of Medical and Biological Informatics (MBI) of the German Cancer Research Center.
ITK-Snap is another open-source medical imaging annotation tool – unlike some of the others we will cover in this article, it is focused exclusively on one step of the broader data annotation process: the segmentation task.
It was created as the result of a long-term collaboration between researchers at PICSL at the University of Pennsylvania, and the Scientific Computing and Imaging Institute (SCI) at the University of Utah, and as a result has a heavy academic following.
ITK-Snap’s main offering is manual segmentation tools (eg. brush and paint); it also provides a basic set of semi-automatic tools (mainly the ‘Snake Interaction Mode’) and complementary tools to the segmentation process (mainly the interpolation feature). It is a perfect fit and very popular option for practitioner teams just starting out, and it supports DICOM and NIfTI medical image file formats.
3D Slicer was created for the “visualization, processing, segmentation, registration, and analysis of medical, biomedical, and other 3D images and meshes.”
It comes with downloadable desktop software, can be used commercially, and enables access to an extensive development platform, and an active network of users. 3D Slicer helps medical imaging operations and data teams implement segmentation on multi-layered medical images, including 2D and 3D segmentations – tools available for segmentation include manual ones (eg. brush, drawing tool, and eraser) as well as a larger set of semi-automatic ones compared to ITK-Snap (eg. thresholding and level tracking).
3D Slicer also allows for basic tasks complementary to segmentation, including basic interpolation between slices, and filters. The main downside that teams often lament when using 3D Slicer for annotating images and files, are the functionality set (which is often reported as being quite convoluted) and the steeper learning curve compared to other tools mentioned in this article.
For over 10 years, the US National Institutes of Health (NIH) has been a key contributor and supporter, and 3D Slicer has had over 1 million downloads since it was launched.
HOROS is another free open-source medical imaging viewer and annotation software project. It is often a preferred tool when annotating with Apple computers and, not by coincidence, its stated goal is “to develop a fully functional, 64-bit medical image viewer for MacOS”.
Annotation and ML teams can use the HOROS viewer and annotation tools to annotate medical images and videos, store them in the cloud, and create reports to document an annotation project collaboratively.
HOROS works with and is supported by technology partners in the healthcare sector as well as by donations from users.
Interconnected to, and a supporter of HOROS, OsiriX is another option that many teams opt for when looking for a labeling tool to get started with.
Initially fully open-sourced, OsiriX now offers either a ‘Lite’ version, which is available for free as a demo application, or OsiriX MD, which is a commercial version that you can use from $69.99 per month.
Similar to most open-source tools, OsiriX Lite is often leveraged by early-stage startups, proof-of-concept (POC) projects and research work. Based on our many conversations with teams, a few key features are worth digging into when evaluating a tool like OsiriX Lite against others; specifically, its capabilities in regards to 3D rendering, as well as DICOM and collaboration features (which, depending on the use case, teams often cite as being limited).
On the other hand, one of the main benefits of OsiriX MD is to make up for the issue of security, which is one of the main challenges with open-source annotation tools (and which we’ll go through in a second more in depth).
OsiriX MD is an FDA-cleared and CE II labeled tool, and this increased level of security and safety makes it a better option for teams annotating professionally (or undergoing or eyeing the FDA approval).
The OHIF Viewer was developed by and is supported by the Open Health Imaging Foundation (OHIF), at the Massachusetts General Hospital (MGH), and is open-source software under an MIT license.
The OHIF Viewer is a tool for creating “custom workflows with user-friendly interfaces. Review cases and report results quickly, zero installation required.” It includes advanced visualization tools, an easy-to-use annotation suite, and is compliant with DICOMWeb and OpenID Connect standards.
OHIF is an open-source annotation tool that comes close to commercial options, as it supports multi-modal image fusion, multiplanar reformatting, and more. It also comes with a cloud-based interface, making it easier to manage collaborative annotation projects.
Despite the various benefits of using open-source annotation tools for medical imaging projects, there are several downsides too.
What are the Downsides of Using Open-Source Tools for Medical Imaging Annotation?
As with any image annotation tool, the ultimate goal of the labeling process is to provide high-quality data to the next step in the process; annotation is simply one stage, albeit a crucial one – in the case of building machine learning applications, once the datasets are labeled and annotated, you will be putting them to test by feeding them into a model (often a broader machine learning (ML) or computer vision (CV) model), then training and iterating on the training, until you are ready to launch a production-ready model and finally solve the critical objective you set out to achieve.
Open-source tools are often a great starting point when going from 0 to 1 with the annotation of medical imaging and video datasets, but, as we all know, they are inherently limited in their ability to achieve some of the more critical and powerful outcomes that teams face as they progress through the journey.
Throughout conversations with thousands of practitioners and leaders across the Medical AI community, we recurrently hear about numerous key downsides of open source annotation software – these are often useful to consider ahead of time, in order for you to effectively plan out what your next steps in your data journey will be.
Below, we’ll dig deeper into the three main ones – scale, security and collaboration – which you can also read more about in our blog here. These are:
- Scaling annotation activity with open-source tools is a big challenge. Open-source tools often come with a basic set of features (and hence are a perfect fit for many teams as they get started), but lack the wider set of needs that companies start to require at scale. For example, whereas it’s often a sensible setup for teams to collaborate on the annotation process over an open-source tool and back and forth emails, as the number of annotators and volume of data increases, in-app and real-time collaboration capabilities like tagging become key. Teams we work with often start to feel this pain point as they start to scale, and that’s when a more solid commercial option can help save resources, speed up the process, and also avoid errors and inaccuracy.
- Open-source tools inherently fall behind on security requirements. In most cases, open-source tools don’t benefit from the rigorous compliance standards of commercial tools, and by nature don’t include features like auditing, which more established companies require in order to achieve milestones like FDA approval. Many don’t have auditable data trails that can be monitored, tracked and reported on, making it more difficult to achieve compliance with the FDA and HIPAA, or GDPR and CE certification in Europe.
- Free doesn’t always mean cost-effective. As the volume of annotations and annotators increases, the hidden costs of managing the process starts to grow exponentially. At this stage, project leaders often find themselves needing to start being able to quantify, manage and measure their process and work; they need to be able to monitor and gain clear insight into the process, and streamline operations on multiple spectrums. This is where limitations of open-source tools start to heavily affect their ability to achieve their objectives; two examples we hear often at this stage are frustrations around needing to write off a large percentage of time on non-value add tasks, as well as not being able to track how each annotator is performing, leading to poor process and output.
After getting started with open-source annotation software for their medical imaging dataset annotation projects, most teams tend to graduate to a commercial, proprietary tool that’s purpose-built to help take their project from 1 onwards. Tools like these are often easy to collaborate across, have best-practice security standards, and allow data operations and machine learning teams to scale their project cost-effectively (as well as help attain milestones like FDA clearance).
Medical image annotation in Encord
Encord is the leading annotation platform in medical AI, with a platform built to easily manage the annotation process, while allowing for the most complex of annotation tasks. At Encord, we have developed our medical imaging dataset annotation software in collaboration with data operations, machine learning and AI leaders across the medical industry – this has enabled us to build a powerful, automated image annotation suite, allowing for fully auditable data, and powerful labeling protocols.
A few of the successes achieved by the medical teams we work with:
- Stanford Medicine cut experiment duration time from 21 to 4 days, while processing 3x the number of images in 1 platform rather than 3
- King’s College London achieved a 6.4x average increase in labeling efficiency for GI videos, automating 97% of the labels and allowing their annotators to spend time on value-add tasks
- Memorial Sloan Kettering Cancer Center built 1000, 100% auditable custom label configurations for its pulmonary thrombosis projects
Experience Encord in action. Try it for Free Today.
Dominic has over 10 years' experience writing content for high growth AI and SaaS startups. His writing covers a wide range of topics, including machine learning, artificial intelligence and computer vision. Dominic is the founder & CEO of Inbound Sales Content (ISC), an SEO growth-focused B2B content marketing agency. He has a History BA from UCL, has lived in three countries in the last decade, and is now happily settled with a family and cat in the North East of England. https://www.linkedin.com/in/dominicntarn-inboundsalescontent/ https://www.inboundsalescontent.com/
February 1, 2023
12 min read
January 31, 2023
7 min read