Skip to content

Imageomics: Bringing machine learning to life.

The Imageomics Institute GitHub organization hosts the development and distribution of a collection of open-source ML tools used to study the biological information encoded in images and videos integrated with structured biological knowledge.

What is the Imageomics Institute?

The Imageomics Institute is funded by the US National Science Foundation's Harnessing the Data Revolution (HDR) program under Award #2118240 (Imageomics: A New Frontier of Biological Information Powered by Knowledge-Guided Machine Learning). It started in Oct 2021.

You can find a full mission, vision, and abstract under the Imageomics website's About page. In short, the vision of the Institute is to "establish a new scientific field called imageomics that harnesses revolutions in data science and computing, as well as the rapidly expanding collections of biological image data, in order to accelerate biological understanding of phenotypic traits extracted from images of organisms."

History

The inception and research of the Imageomics Institute builds heavily on the "Biology-Guided Neural Networks for Discovering Phenotypic Traits" (BGNN) project, also funded by the US National Science Foundation. BGNN itself built in part on the Phenoscape project (funded by NSF multiple times), which started in 2007 and was incubated at the NSF-funded National Evolutionary Synthesis Center (NESCent).

Code repositories overview

Due to the history (see above) and highly collaborative and cross-disciplinary nature of the Institute, important software products and other code repositories are distributed over several organizations in GitHub, in addition to the ones found here. The following gives an overview and useful links.

Imageomics Institute

Institute collaborators


Disclaimer: Any opinions, findings and conclusions or recommendations expressed in the materials here are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Pinned Loading

  1. bioclip-2 bioclip-2 Public

    Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight]

    Python 44 8

  2. Collaborative-distributed-science-guide Collaborative-distributed-science-guide Public template

    Template guide to collaborative work, including GitHub and Hugging Face workflows. Co-developed with the Imageomics Guide.

    Python 1

  3. emb-explorer emb-explorer Public

    An interactive tool for classifying images with a pretrained model and exploring clustering results in 2D space.

    Python

  4. pybioclip pybioclip Public

    Python package that simplifies using the BioCLIP foundation model.

    Python 57 10

  5. SST SST Public

    Official repo for paper "Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation"

    Jupyter Notebook 10 3

  6. TaxonoPy TaxonoPy Public

    A Python package for efficiently aligning organismal taxonomic hierarchies using the Global Names Verifier

    Python 7

Repositories

Showing 10 of 101 repositories
  • TreeOfLife-toolbox Public

    Source-specific tools for processing data (images) downloaded using distributed downloader and relies on MPI.

    Imageomics/TreeOfLife-toolbox’s past year of commit activity
    Jupyter Notebook 3 MIT 1 9 24 Updated Jan 9, 2026
  • mmla Public

    Scripts for MMLA dataset

    Imageomics/mmla’s past year of commit activity
    Jupyter Notebook 6 MIT 0 0 11 Updated Jan 8, 2026
  • dna-trait-analysis Public

    Goal: to find associations between dna data and visual traits.

    Imageomics/dna-trait-analysis’s past year of commit activity
    Jupyter Notebook 4 MIT 0 7 (3 issues need help) 1 Updated Jan 8, 2026
  • repo-exporter Public

    A Python script that collects key info (Name, description, creation date, last updated date, contributors, etc.) from all repos in a GitHub organization and saves it to a color-coded Google Sheet file of choice.

    Imageomics/repo-exporter’s past year of commit activity
    Python 2 MIT 0 4 0 Updated Jan 8, 2026
  • HDR-SMood-Challenge-sample Public

    Sample Submission Repository for the 2025 HDR Scientific Mood Challenge (Modeling out of distribution).

    Imageomics/HDR-SMood-Challenge-sample’s past year of commit activity
    Jupyter Notebook 1 MIT 1 0 4 Updated Jan 8, 2026
  • pygbif Public Forked from gbif/pygbif

    GBIF Python client

    Imageomics/pygbif’s past year of commit activity
    Python 0 MIT 36 0 2 Updated Jan 7, 2026
  • bioclip-2 Public

    Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight]

    Imageomics/bioclip-2’s past year of commit activity
    Python 44 8 2 0 Updated Jan 7, 2026
  • bioclip-vector-db Public

    Vector database training data retrieval of BioCLIP-embedded nearest neighbors.

    Imageomics/bioclip-vector-db’s past year of commit activity
    Python 0 MIT 1 10 4 Updated Jan 7, 2026
  • cautious-robot Public

    Simple images from CSV downloader that runs and records checksums on downloaded image folder.

    Imageomics/cautious-robot’s past year of commit activity
    Python 5 MIT 1 8 (1 issue needs help) 1 Updated Jan 6, 2026
  • SST Public

    Official repo for paper "Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation"

    Imageomics/SST’s past year of commit activity
    Jupyter Notebook 10 MIT 3 0 6 Updated Jan 5, 2026