Are you a researcher working with sensitive datasets—like whole genome sequences, clinical records, or phenotype data? You want to collaborate with other scientists, but you can't just hand over your raw data.That's where BioVault comes in.
- Create your private BioVault — set up a secure workspace for your sensitive data (e.g., patient genomes or clinical records) that only you control.$ bv init my_patient_data✓ BioVault "my_patient_data" created locally
- Add your data — place sequencing files (FASTQ, BAM/CRAM, VCF, phenotype data, etc.) into the vault. The files stay on your machine and never leave your control.$ bv add ./data/patient1.vcf --vault my_patient_data✓ Added 1 file to BioVault "my_patient_data"
- Collaborator prepares code — another researcher builds and tests a workflow or arbitrary code against your vault (e.g., GWAS, splicing analysis, or custom scripts).$ bv vault list✓ Listing Vaults$ bv run ./variant_discovery_analysis syft://research@my_patient_data#participants --test✓ Testing workflow on target vault
- Workflow is submitted for your review — the collaborator sends their code to your inbox. You see exactly what will run before deciding whether to approve.$ bv submit ./variant_discovery_analysis syft://research@my_patient_data#participants⚡ Analysis Submitted$ bv inbox1 pending request: Collaborator@University → pipeline: variant_discovery_analysis.nf
- Approve and run locally — once approved, the workflow or arbitrary code runs inside your BioVault. The raw data never leaves your machine; only results are shared back.$ bv approve variant_discovery_analysis✓ Approved. Running NextFlow pipeline securely...$ bv inbox
Project Status Path variant_discovery_analysis ✅ Success ./results/variant_discovery_out.csv
