Supported formats: jpg, png, webp.
This demo uses the BioCLIP backbone (imageomics/bioclip via open_clip) with a frozen encoder and a small linear head trained on the BioTrove Reptilia subset (189 species). Inputs are normalized with the BioCLIP validation transform, fed through the encoder, and scored with a softmax over the reptile classes.
How inference works here:
/predict_frame for scoring.Only the BioCLIP model is used for both uploads and live feed; no additional detectors or models are involved.