Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can be used to prepare raw data or improve existing training data to get more accurate ML models.
You can run the latest Label Studio version locally without installing the package with pip.
# Install all package dependencies
pip install -e .
# Run database migrations
python label_studio/manage.py migrate
# Start the server in development mode at http://localhost:8080
python label_studio/manage.py runserver
Deploy in a cloud instance
You can deploy Label Studio with one click in Heroku, Microsoft Azure, or Google Cloud Platform:
Apply frontend changes
The frontend part of Label Studio app lies in the frontend/ folder and written in React JSX. In case you’ve made some changes there, the following commands should be run before building / starting the instance:
cd frontend/
npm ci
npx webpack
cd ..
python label_studio/manage.py collectstatic --no-input
Troubleshoot installation
If you see any errors during installation, try to rerun the installation
pip install --ignore-installed label-studio
Install dependencies on Windows
To run Label Studio on Windows, download and install the following wheel packages from Gohlke builds to ensure you’re using the correct version of Python:
# Upgrade pip
pip install -U pip
# If you're running Win64 with Python 3.8, install the packages downloaded from Gohlke:
pip install lxml‑4.5.0‑cp38‑cp38‑win_amd64.whl
# Install label studio
pip install label-studio
What you get from Label Studio
Multi-user labeling sign up and login, when you create an annotation it’s tied to your account.
Multiple projects to work on all your datasets in one instance.
Streamlined design helps you focus on your task, not how to use the software.
Configurable label formats let you customize the visual interface to meet your specific labeling needs.
Support for multiple data types including images, audio, text, HTML, time-series, and video.
Import from files or from cloud storage in Amazon AWS S3, Google Cloud Storage, or JSON, CSV, TSV, RAR, and ZIP archives.
Integration with machine learning models so that you can visualize and compare predictions from different models and perform pre-labeling.
Embed it in your data pipeline REST API makes it easy to make it a part of your pipeline
Included templates for labeling data in Label Studio
Label Studio includes a variety of templates to help you label your data, or you can create your own using specifically designed configuration language. The most common templates and use cases for labeling include the following cases:
Set up machine learning models with Label Studio
Connect your favorite machine learning model using the Label Studio Machine Learning SDK. Follow these steps:
React and JavaScript frontend for managing data. Includes the Label Studio Frontend. Relies on the label-studio server or a custom backend with the expected API methods.
Transformers library connected and configured for use with Label Studio
Roadmap
Want to use The Coolest Feature X but Label Studio doesn’t support it? Check out our public roadmap!
Citation
@misc{Label Studio,
title={{Label Studio}: Data labeling software},
url={https://github.com/heartexlabs/label-studio},
note={Open source software available from https://github.com/heartexlabs/label-studio},
author={
Maxim Tkachenko and
Mikhail Malyuk and
Nikita Shevchenko and
Andrey Holmanyuk and
Nikolai Liubimov},
year={2020-2021},
}
Website • Docs • Twitter • Join Slack Community
What is Label Studio?
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can be used to prepare raw data or improve existing training data to get more accurate ML models.
Have a custom dataset? You can customize Label Studio to fit your needs. Read an introductory blog post to learn more.
Try out Label Studio
Try out Label Studio in a running app, install it locally, or deploy it in a cloud instance.
Install locally with Docker
Run Label Studio in a Docker container and access it at
http://localhost:8080
.You can find all the generated assets, including SQLite3 database storage
label_studio.sqlite3
and uploaded files, in the./mydata
directory.Override default Docker install
You can override the default launch command by appending the new arguments:
Build a local image with Docker
If you want to build a local image, run:
Run with Docker Compose
Docker compose script provides production-ready stack consisting of the following components:
To start using the app from
http://localhost
run this command:Install locally with pip
Install locally with Anaconda
Install for local development
You can run the latest Label Studio version locally without installing the package with pip.
Deploy in a cloud instance
You can deploy Label Studio with one click in Heroku, Microsoft Azure, or Google Cloud Platform:
Apply frontend changes
The frontend part of Label Studio app lies in the
frontend/
folder and written in React JSX. In case you’ve made some changes there, the following commands should be run before building / starting the instance:Troubleshoot installation
If you see any errors during installation, try to rerun the installation
Install dependencies on Windows
To run Label Studio on Windows, download and install the following wheel packages from Gohlke builds to ensure you’re using the correct version of Python:
What you get from Label Studio
Included templates for labeling data in Label Studio
Label Studio includes a variety of templates to help you label your data, or you can create your own using specifically designed configuration language. The most common templates and use cases for labeling include the following cases:
Set up machine learning models with Label Studio
Connect your favorite machine learning model using the Label Studio Machine Learning SDK. Follow these steps:
This lets you:
Integrate Label Studio with your existing tools
You can use Label Studio as an independent part of your machine learning workflow or integrate the frontend or backend into your existing tools.
Ecosystem
Roadmap
Want to use The Coolest Feature X but Label Studio doesn’t support it? Check out our public roadmap!
Citation
License
This software is licensed under the Apache 2.0 LICENSE © Heartex. 2020-2021