Artificial Intelligence model trained on NVIDIA GPUs identifies over a million species

Tanya Berger-Wolf’s BioCLIP 2 is an Artificial Intelligence foundation model trained on NVIDIA GPUs and a 214 million-image dataset that maps traits across more than 925,000 taxonomic classes. The model is available open source on Hugging Face and will be presented at NeurIPS.

Tanya Berger-Wolf, director of the Translational Data Analytics Institute and a professor at The Ohio State University, has developed BioCLIP 2, an Artificial Intelligence foundation model trained on the largest and most diverse organism dataset to date. The project grew from Berger-Wolf’s early work identifying individual zebras and now targets broad biodiversity tasks. The underlying dataset, TREEOFLIFE-200M, contains 214 million images spanning more than 925,000 taxonomic classes and was curated with partners including the Imageomics Institute and the Smithsonian Institution.

BioCLIP 2 goes beyond image recognition to learn species traits, taxonomic hierarchies and intra-species variation without explicit labels. In experiments the model arranged Darwin’s finches by beak size, distinguished adult from juvenile and male from female animals, and separated healthy plant leaves from diseased ones while identifying different disease types. The team trained the model for 10 days on 32 NVIDIA H100 GPUs and used a cluster of 64 NVIDIA Tensor Core GPUs to accelerate training, with individual Tensor Core GPUs employed for inference. The model is available under an open-source license on Hugging Face and was downloaded more than 45,000 times in the last month. BioCLIP 2 builds on an earlier version that was also trained on NVIDIA GPUs and received the Best Student Paper award at CVPR.

The researchers plan to extend the work into interactive wildlife digital twins that visualize and simulate ecological interactions and species perspectives, enabling low-impact experimentation and public-facing experiences such as zoo installations. Berger-Wolf and her team emphasize that these tools can help address data deficiencies in conservation biology by serving as a biological encyclopedia and inference platform to fill gaps for under-documented species. The BioCLIP 2 paper will be presented at NeurIPS, with sessions scheduled Nov. 30 to Dec. 5 in Mexico City and Dec. 2 to 7 in San Diego.

68

Impact Score

Nvidia targets Windows PC chips with RTX Spark

Nvidia’s RTX Spark combines its Blackwell RTX GPU and Grace CPU for Windows PCs, pushing the company deeper into markets led by Intel, AMD, and Qualcomm. The move could intensify competition around Arm-based processors and local Artificial Intelligence workloads.

Devin Desktop turns Windsurf into an agent command center

Cognition has renamed Windsurf as Devin Desktop, positioning the IDE as a unified surface for managing coding agents. The product keeps the existing editor experience while adding multi-agent workflows, shared context, and cloud handoff features.

NVIDIA advances U.K. sovereign Artificial Intelligence push

NVIDIA is positioning the U.K.’s sovereign Artificial Intelligence effort as a shift from policy to deployment, with new compute plans, startup funding and enterprise projects. The push spans cloud infrastructure, life sciences, coding, inference and developer training.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.