IBS Summary File¶
This document describes the structure of the IBS Summary File, a key output used in comparative genomics analyses to record Identical By State (IBS) segments across genomic intervals for a given sample. Along with this summary file, the findIBS command can also produce a BED file format output, which is useful for visualization in genome browsers.
File Format¶
The IBS summary file is a tab-delimited text file where each row represents a genomic region (blocks) for a given sample. Each column is described in detail below.
Column Definitions¶
| Column | Data Type | Description |
|---|---|---|
Block |
int |
Unique identifier for the genomic block (can be sequential or tool-specific). |
Sample |
string |
Sample identifier. It is recommended to use standardized identifiers (e.g., SAM001, SAM002) instead of internal IDs like LK067. |
Chromosome |
string |
Name of the chromosome or scaffold. Standard naming conventions such as chr1, chr2, ..., chrUn (for unplaced scaffolds) should be used. |
Start |
int |
Start position (0-based, inclusive) of the block in base pairs. |
End |
int |
End position (0-based, inclusive) of the block in base pairs. |
Length |
int |
Length of the block in base pairs. Calculated as End - Start + 1. |
TotalBlocks |
int |
Total number of sub-units (or smaller analysis windows) within this block. |
IBSBlocks |
int |
Number of sub-units within the block that were classified as IBS. |
IBSProportion |
float |
Fraction of sub-blocks that are IBS. Computed as IBSBlocks / TotalBlocks. Ranges from 0.0 to 1.0. |
MeanScore |
float |
Mean similarity score across the block (can reflect allele sharing, IBS strength, or other scoring mechanisms depending on the tool used). |
Example¶
Block Sample Chromosome Start End Length TotalBlocks IBSBlocks IBSProportion MeanScore
3495 SAM067 chrUn_01 0 15194 15194 4 4 1.00 0.03
3475 SAM067 chrUn_02 0 59659 59659 12 9 0.75 74.59
3148 SAM067 chr5 1421134 1426134 5000 1 1 1.00 92.11
Note
- A high IBSProportion (
1.00) indicates that the entire region shows complete similarity across all tested windows. - A low MeanScore may indicate sparse or weak similarity, even if all windows are IBS.
- A block with IBSProportion < 1.0 can be indicative of genomic heterogeneity or recombination signals in that region.
- Short blocks with high IBS might be less informative than longer blocks with consistent similarity across a large region.