You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Highlights
Two standalone Streamlit apps — Split the monolithic app.py into apps/embed_explore/ (embed & cluster your own images) and apps/precalculated/ (explore precomputed embeddings), each with dedicated entry points.
Shared module architecture — Common code extracted into shared/{components,services,utils,lib}/ to keep both apps DRY.
GPU-to-CPU fallback — Clustering auto-falls back through cuML → FAISS → sklearn on OOM or CUDA errors.
cuML UMAP stability — UMAP runs in an isolated subprocess with L2-normalized embeddings to prevent SIGFPE crashes.
Lazy-loaded heavy libraries — Deferred imports of torch, cuML, and FAISS cut startup time drastically
Improved visualization — Zoom/pan support and heatmap option for clustering charts; chart interactions no longer trigger full page reruns.