Projects Registry
The Projects Registry tracks notable open source projects at Stanford.
The breadth of the registry highlights the expertise of the research community, potential touchpoints for further discussion and collaboration, and the sheer amount of open source development happening on campus. It spans many disciplines that cover a wide range of experience levels and interests.
Currently, there are 74 projects, platforms, & data tools and 13 labs & groups listed.
If you would like to add your project to the registry or amend an existing project, please contact the Technical Community Manager (Francesca Vera) with relevant information.
Projects
The projects listed within this section make up the nominees for our Open Source Software Prize.
yt project
yt is an open source Python package for analyzing and visualizing volumetric data, and it has been applied in domains like astrophysics, seismology, nuclear engineering, molecular dynamics, and oceanography.
View ProjectHolistic Evaluation of Language Models (HELM)
Holistic Evaluation of Language Models (HELM) is a framework to increase the transparency of language models by the Stanford Center for Research on Foundation Models (CRFM).
View ProjectARNIE
ARNIE is a Python package that is widely used to compute RNA energetics and for RNA structure prediction. The work has led to many peer-reviewed publications, including some with important applications to current RNA medicine (e.g. mRNA vaccine stabilization co-authored with Pfizer).
View ProjectOpenCap
OpenCap is an open source platform that allows computing kinematics and dynamics of human movement using videos from two or more smartphones.
View ProjectPalladio
Palladio is an open source toolbox that helps humanities researchers visualize complex historical data with ease.
View Projectdada2
dada2 is an R package for high-resolution sample inference from high-throughput amplicon sequencing data.
View ProjectCausal Inference Methods
Software for causal inference methods, in particular for matching estimators and regression discontinuity.
View ProjectDynamax
Dynamax is a library for probabilistic state space models (SSMs) written in JAX. It is a JAX-based version of SSM, resulting from a collaboration with Google researchers and developers.
View ProjectNiPreps
NiPreps (NeuroImaging PREProcessing toolS) is a community project that provides researchers with applications that allow them to prepare data for modeling and statistical analysis, and perform quality control smoothly.
View ProjectFlashAttention
FlashAttention is an algorithm to speed up attention and reduce its memory footprint. It is now upstreamed into almost all popular libraries for AI (from PyTorch to OpenAI's system).
View ProjectHummingbird
Hummingbird is a Python framework for predicting performance of computing instances with varying memory and CPU on multiple cloud platforms. It gives a variety of configurations to run genomics pipelines on cloud platforms.
View ProjectVistasoft
Vistasoft is a Matlab toolbox for analyzing Magnetic Resonance Imaging (MRI) data. Vistasoft tools span anotomical, functional, and diffusion analyses. It also includes some sample data.
View Projecttorch-choice
torch-choice is an open source PyTorch package for flexible, fast, large-scale choice modeling with Python. It is a toolkit that can be used for modeling consumer behaviors.
View ProjectZMap
The ZMap Project is a collection of open source tools for performing large-scale studies of the hosts and services that compose the public Internet.
View ProjectStanford Spezi
Oliver Aalami, Paul Schmiedmayer, Vishnu Ravi
Stanford Spezi is an open source framework that is designed for digital health applications and makes it easy to build an app with modules that include questionnaires, data collection from wearable devices, and integration with electronic record systems.
View ProjectPyGeoprocessing
PyGeoprocessing is a Python/Cython-based library that provides a set of commonly used raster, vector, and hydrological operations for GIS processing. PyGeoprocessing is developed at the Natural Capital Project to create a programmable, open source, and free Python based GIS processing library to support the InVEST toolset.
View ProjectBioPortal
BioPortal is an online repository of all publicly available biomedical ontologies and controlled terminologies in the world, with more than 1000 resources. BioPortal allows users to search for ontologies, to browse them, and to review information about their provenance and use. BioPortal content is available through a Web-based user interface and an API. Additional components determine how terms in one ontology map to terms in other ontologies in the system, and which ontologies might be most useful for describing the content in particular situations. The BioPortal software forms the basis of a system known as OntoPortal, which is used by an international community of investigators to host ontologies in different scientific disciplines.
View ProjectPENSA
Ron Dror, Martin Voegele, Sang Truong
PENSA is a collection of Python methods for exploratory analysis and comparison of biomolecular conformational ensembles.
View ProjectSOMATA
State-space Oscillator Modeling And Time-series Analysis (SOMATA) is a Python library for state-space neural signal processing algorithms developed in the Purdon Lab.
View ProjectHapkit
Hapkit is an open-hardware haptic device that allows users to input motions and feel programmed forces in one degree of freedom. Stanford University has used the Hapkit in online courses, on-campus courses, and educational research.
View ProjectLabs & Groups
Stanford Trustworthy AI Research (STAIR)
The Stanford Trustworthy AI Research group develops principles and practice of trustworthy machine learning, including robust federated machine learning and metric elicitation, and selecting more effective machine learning metrics via human interaction. Some research applications are neuroimaging, healthcare, and biomedical imaging.
View GroupLobell Lab
The Lobell Lab uses a range of modern tools to study the interactions between food production, food security, and the environment.
View GroupHazy Research
Hazy Research is a Computer Science research group interested in building foundations for the next generation of machine learning systems. The group produces open source projects that are widely used in AI and data science.
View GroupStanford Biodesign Digital Health Group
Oliver Aalami, Paul Schmiedmayer, Vishnu Ravi
The Stanford Biodesign Digital Health Group advances digital health research and applications by fostering an accessible health ecosystem. The group develops, implements, and investigates digital health solutions that improve health journeys, including its flagship project, Stanford Spezi.
View GroupData
While we recognize that there are code aspects to these projects, the following are primarily open data projects.
MetaLab
MetaLab is a database that contains almost 3,000 effect sizes across early language and cognitive development domains, based on data from 747 papers and 48,529 subjects.
View ProjectVascular Model Repository
The Vascular Model Repository is an open source database of normal and diseased cardiovascular models available for academic, government, and industry researchers. The models can be used to verify computational methods for fluid and solid mechanics. It received a DataWorks Achievement Award in 2023.
View ProjectOpenNeuro
OpenNeuro is an open source platform for the archiving and sharing of neuroimaging data. The platform validates and shares Brain Imaging Data Structure (BIDS) compliant MRI, PET, MEG, EEG, and iEEG data.
View ProjectPlatforms & Utilities
Open Graph Benchmark (OGB)
The Open Graph Benchmark (OGB) provides datasets, data loaders, and evaluators for benchmarking graph machine learning methods. It has become the gold standard for research in machine learning for graphs.
View ProjectDatapages
Tools and templates for sharing data along with interactive visualizations, rich documentation, and user-friendly access functionality.
View Project