{% extends "base.html" %} {% set active_page = 'manifest' %} {% block title %}Workset Manifest Generator - Ursa Customer Portal{% endblock %} {% block content %}

Run Configuration

Unique identifier for this analysis run

Analysis Inputs

Each analysis input represents a sequencing library to be processed. This corresponds to a row in stage_samples.tsv.

Manifest Preview (TSV)

# Add analysis inputs to generate the manifest
Required Columns: RUN_ID, SAMPLE_ID, EXPERIMENTID, SAMPLE_TYPE, LIB_PREP, SEQ_VENDOR, SEQ_PLATFORM, LANE, SEQBC_ID, PATH_TO_CONCORDANCE_DATA_DIR, R1_FQ, R2_FQ, STAGE_DIRECTIVE, STAGE_TARGET, SUBSAMPLE_PCT, IS_POS_CTRL, IS_NEG_CTRL, N_X, N_Y, EXTERNAL_SAMPLE_ID
Terminology Guide (GA4GH Standards)
Analysis Input (SAMPLE_ID)
Pipeline input identifier - uniquely identifies a sequencing library for analysis
Subject/Individual (EXTERNAL_SAMPLE_ID)
Source organism identifier (e.g., patient ID, HG002). Multiple libraries may come from one subject.
Biosample
Physical specimen (tissue type, collection info). Captured in SAMPLE_TYPE field.
Sequencing Library
Prepared DNA/RNA with specific protocol. Captured in LIB_PREP, SEQ_VENDOR, SEQ_PLATFORM.
FASTQ Files
Raw sequencing output files (R1_FQ, R2_FQ). Include full S3 paths.
Column Mapping Reference
Column Name Description Example
SAMPLE_ID Unique identifier for this analysis input LIB001
EXTERNAL_SAMPLE_ID Subject/individual ID (e.g., patient ID) SUBJ001
SAMPLE_TYPE Type of biological sample Blood, Saliva, Tissue
LIB_PREP Library preparation method PCR-free, Nextera
SEQ_VENDOR Sequencing vendor/platform Illumina, PacBio
SEQ_PLATFORM Specific sequencing platform NovaSeq, HiSeq
R1_FQ S3 path to forward read FASTQ file s3://bucket/sample_R1.fastq.gz
R2_FQ S3 path to reverse read FASTQ file (optional for single-end) s3://bucket/sample_R2.fastq.gz
{% endblock %} {% block extra_js %} {% endblock %}