Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
CLOP: Omics Pretraining
We present CLOP, a model trained on fasta, bed, and gff files to embed DNA sequences by species and biotype for retrieval, classification and generation.
CLOP is an adaptation of OpenAI’s CLIP but for Omics - in this demo, genomics. The model is trained on fasta, bed and gff files (representing genomes of different species and their annotations) to learn meaningful representations for further retrieval, classification and generation purposes. The model embeds DNA sequences according to species and biotype (e.g. exon, long non coding RNA, pseudogene, etc.)
Demonstrates Git version control and AWS cloud threat detection methodologies.
CLOP visualizes biological embeddings (UMAP/t-SNE) and zero-shot classifies DNA via ONNX.