Home
Alamut-HT 1.0 User Manual
Contents
1. Splice site intronic range Intronic variants located within the specified range lt n gt from the nearest splice site are annotated as splice site in the varLocation annotation field ssIntronicRange lt n gt Genes of interest file List of genes of interest A file of HGNC gene symbols 1 per line If this is specified only variants mapped to the listed genes are annotated glist lt gene list file name gt Regions of interest file List of regions of interest ROIs A tabulated file where ROls are described as lt chromosome start end gt BED format Only variants located in ROIs are annotated roilist lt ROI list BED file name gt External annotation file Output parameters Output annotated variants List of external variant annotations to be reported in output format described below Comment By default variants that cannot be annotated are now v 1 1 also extAnnFile lt external annotation file name gt Command line an reported in the annotation output outputannonly Y file This option cancels this behavior Output VCF QUAL field applies to VCF _ SSES VCF input files only outputVCFQuality Output VCF FILTER field applies to SEKR VCF input files only outputVCFFilter VCF information Output VCF INFO fields specified by a list of IDs e g DP AF AA applies to VCF input files only outputVCFInfo ID ID VCF genotype dat
2. Updating the alamut_db database To update the alamut db database just download the latest snapshot from http downloads interactive biosoftware com and edit the alamut batch ini file to change the Database File field appropriately Interactive Biosoftware 28 28 February 2015
3. strand 1 1 Variants strand must be explicitly 1 forward strand Strand specified either for the entire A reverse strand input file or on a per variant basis as specified in column 6 of input 0 per variant file default 1 Each variant will be annotated on Annotate variants on all all available transcripts if this oe de alltrans transcripts option is specified Otherwise only the longest transcript is used Not applicable to VCF input G Each variant will be annotated on Annotate on specified Si e the transcript specified on a per spectrans transcript only BEN variant basis as specified in column 7 of input file Annotate variants on File format described below preferred transcripts listed translist in specified file Compute missense Perform Align GVGD MAPP and nomispred predictions SIFT predictions cancels default behavior Compute NNSPLICE predictions Perform NNSPLICE predictions nonnsplice cancels default behavior Interactive Biosoftware 14 28 February 2015 Compute GeneSplicer predictions Alamut Batch 1 4 User Manual Perform GeneSplicer predictions nogenesplicer cancels default behavior Ignore input errors Proceed even if input has invalid entries ignoreInputErrors Exon numbering Simple sequential or custom if available exon numbering exonnums simple custom default simple
4. What is new in version 1 4 E 7 What Was E EE 7 What was new inversion 1 2 cea odone ako kk ka tib kt a te a a a kk e kk ok ak ke Ak 7 What WAS new in version D 8 E e EE 9 Client Server GUI frontend Windows on 9 Client Server command line program Windows and Linux 9 Standalone command line program LINUX only 10 Matten RE EE 11 Using Alamut Batch ccccccccccsssssscccecececsssesseaeeeceesesceeseeeseeeceescsesesssaeaeeeeeesseeseeaseseeeesessseseaaeas 12 GUI frontend Windows only RE 12 Command line program Windows and Linux 13 Software parameters visu kab dacheaxacnssaged n nanan 14 Transcript file fOCMAL E 17 Exter alannotationTileSu E 17 Using a local HGMD Professional database installation 17 OUTPUE eg tie alan kte nal la te ake rd a kap dy a be pak ki ta bag kle sen saba ele kd oka ke ba ae AM e one dok pase aki net ena bay 18 Viewing annotated variants in Alamut Visual 25 Local splicing effect predictions se iii ouka si kod ok sa af ik lk a n fk a ta ai ia rai a piki kai 26 Installing Alamut Batch Standalone ewa on den ak py kit kt kk l a e l kk ka kk e a en 27 Alamut Batch Standalone compone
5. activation is reported as weak If it is greater than 10 then it is reported as strong If at position p there is a significant prediction set on the mutated sequence but not on the wild type sequence then Alamut Batch predicts a new splice site creation Interactive Biosoftware 26 28 February 2015 Alamut Batch 1 4 User Manual Installing Alamut Batch Standalone Alamut Batch Standalone components Alamut Batch Standalone includes the following components 1 The Alamut database It stores all gene related information used by the software 2 The alamut batch program It computes variant annotations based on data provided by the database and results computed by ancillary programs 3 Ancillary programs These are external software tools specialized in computing missense and splicing predictions e g SIFT NNSPLICE The Alamut Database As of version 1 1 11 the Alamut database is supplied as a single compressed file to be used as is by the alamut batch program MySQL is no longer required This file is a snapshot of the live database used by Alamut Visual and the Alamut Batch Client Server version Since the live Alamut database is frequently updated bi monthly snapshots are provided for Alamut Batch Standalone and can be downloaded from the Alamut website The Alamut database includes encrypted gene related information and must be queried by the alamut batch program only The current size of the database is 3 5 GBytes estim
6. maternal etc separated list Possible values clinical testing research literature only etc separated list separated list Number of stars 0 4 separated list DP DFP FP FTV DM DM see HGMD Documentation website separated list separated list separated list separated list transition transversion February 2015 All coding substitutions WT AA 1 letter WT AA 3 letters WT codon WT codon frequency Variant AA 1 letter Variant AA 3 letters Variant codon Variant codon frequency AA Position Missense only Number of orthologues in alignment Number of conserved residues in alignment Most distant species in which AA is conserved BLOSUM45 BLOSUM62 BLOSUM80 WT AA composition Variant AA composition WT AA polarity Variant AA polarity WT AA volume Variant AA volume Grantham distance AlignGVGD class AlignGVGD variation GV AlignGVGD deviation GD SIFT prediction SIFT weight SIFT median Interactive Biosoftware Alamut Batch 1 4 User Manual wtAA_1 wtAA_3 wtCodon wtCodonFreq varAA_1 varAA_3 varCodon varCodonFreq posAA nOrthos conservedOrthos conservedDistSpecies BLOSUM45 BLOSUM62 BLOSUM80 wtAAcomposition varAAcomposition wtAApolarity varAApolarity wtAAvolume varAAvolume granthamDist AGVGDclass AGVGDgv AGVGDgd SIFTprediction SIFTweight SIFTmedian 23 28 Februa
7. 4 User Manual Local splicing effect predictions Alamut Batch interprets raw splice site signal recognition by MaxEntScan NNSPLICE and Human Splicing Finder HSF in the variation vicinity to provide predictions about the creation of new splice sites or the activation of existing cryptic sites Note that this is different from predictions at the nearest splice site where only raw prediction scores are provided but not interpreted by Alamut Batch This section describes how local splicing effect predictions are computed Only the MaxEntScan NNSPLICE and HSF splice site predictors are used in the interpretation algorithm The following thresholds are used to consider or discard raw predictions e A MaxEntScan score is deemed significant if gt 0 e An NNSPLICE score is deemed significant if gt 0 4 e AnHSF score is deemed significant if gt 60 Let s define a raw prediction set as a set of raw predictions at the same position for the same signal A raw prediction set is deemed significant if at least two of MaxEntScan NNSPLICE or HSF predictions are significant If at position p excluding natural splice site positions there is a significant prediction set both on the wild type sequence and on the mutated sequence and if the mutated prediction set is significantly higher than the wild type then Alamut Batch predicts a cryptic splice site activation If the change is less than 3 it is not reported If it is less than 10 then the
8. Alamut Server name in field Network IBS Server is a ht interactive Interactive Biosoftware 9 28 February 2015 Alamut Batch 1 4 User Manual biosoftware com by default If you are based in North America please change the server name to a ht na interactive biosoftware com Standalone command line program Linux only See Installing Alamut Batch Standalone at the end of this document Interactive Biosoftware 10 28 February 2015 Alamut Batch 1 4 User Manual Variant Input file The software takes on input a list of genomic variations and outputs a list of annotations for each variant when it is located on a gene available in the Alamut database Alamut Batch supports VCF files and tab delimited files on input VCF files This is the most common format for variant description Alamut Batch supports VCF v4 0 and later Note that variants are implicitly processed on the forward strand and that monomorphic references i e entries with no alternate alleles are not supported Tab delimited files A specific tab delimited text format can also be used for variant input In this format each line should contain the following fields separated by tab characters 1 Variant id anything Chromosome 1 22 X Y Genomic position Reference nucleotide s ACGT or for insertions Mutated nucleotide s ACGT or for deletions Optional strand 1 or 1 used if strand parameter is set to 01 Optional transc
9. be restricted to a list of preferred transcripts specified in a gene transcripts file translist option e Annotation can also be restricted to a range of variants of the input file from and to options not available in the Windows GUI Two other new features are specific to the Standalone version e Multi process support Annotation jobs can now be split among multiple processes on the same computer processes option e Access to local HGMD Professional database installations has been changed since BIOBASE no longer provides a query API see Using a local HGMD Professional database installation Interactive Biosoftware 7 28 February 2015 Alamut Batch 1 4 User Manual What was new in version 1 1 Here are the new features introduced in version 1 1 Integration of HGMD the Human Gene Mutation Database data available to HGMD Professional subscribers Integration of NHLBI GO Exome Sequencing Project ESP data Unannotated variants are now reported in the annotation output file and in the failed variants output file as well unless the outputannonly option is specified If the new option ssIntronicRange n is used intronic variants located within the specified range n from the nearest splice site are annotated as splice site in the varLocation annotation field Variants can now be filtered by regions of interest defined in a BED format file roilist lt ROI list BED file name gt External annotatio
10. Ki INTEractive biosoftware alanqut A BATCH 14 user manual Alamut Batch 1 4 User Manual This document and its contents are proprietary to Interactive Biosoftware They are intended solely for the contractual use of its customer in connection with the use of the product s described herein and for no other purpose This document and its contents shall not be used or distributed for any other purpose and or otherwise communicated disclosed or reproduced in anyway whatsoever without the prior written consent of Interactive Biosoftware Interactive Biosoftware does not convey any license under its patent trademark copyright or common law rights nor similar rights of any third parties by this document The instructions in this document must be strictly and explicitly followed by qualified and properly trained personnel in order to ensure the proper and safe use of the product s described herein All of the contents of this document must be fully read and understood prior to using such product s 2015 Interactive Biosoftware All rights reserved Interactive Biosoftware 2 28 February 2015 Alamut Batch 1 4 User Manual Contents EEN 5 Product descriptio EE 5 Standalone VEN E ON kq aa bn kdk naj asa k en pous ko k kt ken kaa a tas a bite la ra anna eaae ritear 5 Client Server version 5 System regue En EE 6 Standalone VENOM bois pa bs vs pk asasen e kk pak ke eka k a a n vi RA kak eks a ka ka AA 6 Client Server version 6
11. Python MySQLdb package Interactive Biosoftware 17 28 February 2015 Alamut Batch 1 4 User Manual Output The output of Alamut Batch is a tab separated file of annotations 1 line per variant or multiple lines per variant if annotation is performed on multiple transcripts Annotations produced are listed below User defined input fields are reported as is in the last output columns NOTE SA The Chromosome field chrom was previously the fifth output field As of v1 2 it now comes as the second field followed by the new Variant position field pos that replicates the original variant position given in the input file Annotation Name Comment Id Id Variant id as supplied in input file Chromosome chrom Variant position pos As supplied in input file Failed annotation reason unnnotatedReason Field not available if option outputannonly is used Gene symbol gene HUGO Gene Nomenclature Committee HGNC symbol Gene id HGNC geneld HGNC id Transcript transcript e g NM_000249 3 Transcript strand strand Transcript length transLen Full cDNA length Protein protein e g NP_000240 1 Uniprot Uniprot Uniprot accession e g P40692 Variant Type varType substitution deletion insertion duplication delins Variant coding effect codingEffect synonymous missense nonsense in frame frameshift start loss stop loss Variant location varLocation upstream 5 UTR exon intron 3 UTR downstream splice site see s
12. a Output VCF genotype fields specified by a list of IDs e g GT AC GQ applies to VCF input files only outputVCFGenotypeData ID ID Empty values HGMD parameters Interactive Biosoftware Empty output fields are populated with specified value e g NULL Comment 15 28 outputEmptyValuesAs lt value gt Command line February 2015 HGMD Professional login Proxy parameters Internet proxy options Alamut Batch 1 4 User Manual Comment hgmdUser lt HGMD Pro user name gt hgmdPasswd lt HGMD Pro password gt Command line proxyserver lt proxy server name gt proxyport lt proxy server port number gt proxyuser lt proxy user login gt proxypasswd lt proxy password gt Interactive Biosoftware 16 28 February 2015 Alamut Batch 1 4 User Manual Transcript file format The input file for preferred transcripts is tab delimited and requires at least two columns gene name and transcript name Multiple transcripts per gene can be specified in additional columns as in the following example BRCAI gt NM 007294 3 MLH1 gt NM_000249 3 gt NM_001167618 1 External annotation files External variant annotations e g variant pathogenicity status as previously established in the lab can be integrated in the annotation output Variants are described using the chromosome name and genomic level nomenclature Variants and annotations
13. ated growth 3 GBytes year Software Programs All the required programs are either Linux executables or Python 2 6 scripts They must all be installed on the same Linux computer Ancillary programs include missense and splicing prediction tools that are either provided with the Alamut Batch Standalone package or can be installed separately see below System Requirements See above Installing Installing Alamut Batch Standalone requires two steps e Installing the alamut_db database e Installing software components Alamut Batch and ancillary programs Installing the alamut_db database Go to the Alamut Batch Standalone section of http downloads interactive biosoftware com and download the latest database snapshot Place the donwload file anywhere in the local filesystem of the computer running Alamut Batch Installing Alamut Batch Go to the Alamut Batch Standalone section of http downloads interactive biosoftware com and download the latest tarball Edit the alamut batch ini file and supply Interactive Biosoftware 27 28 February 2015 Alamut Batch 1 4 User Manual e Your Institution ID in the Institution field e Your license key in the Licence Key field e User initials as appropriate in the User field _ NOTE v The Alamut Server name in field Network IBS Server is a ht interactive biosoftware com by default If you are based in North America please change the server name to a ht na intera
14. ctive biosoftware com Set the Database File field to the full path of the downloaded database file Installing ancillary programs All ancillary software programs must be installed in the alamut batch standalone ancillary directory gt cd alamut batch standalone ancillary SIFT Download and uncompress gt wget http sift jcvi org www sift4 0 3b tar gz gt tar zxf sift4 0 3b tar gz MAPP optional Download file MAPP zip from http downloads interactive biosoftware com Linux Section Alamut Batch Standalone gt Other Downloads Unzip this file inside the ancillary sub directory NNSPLICE optional Obtain package NNSPLICEO 9 from Martin Reese mreese omicia com and unpack in the ancillary directory Note that NNSPLICE requires glibc i686 GNU 32 bit libc library Other prediction tools Other tools are either provided with the Alamut Batch distribution GeneSplicer and MaxEnt or are embedded inside al amut batch Align GVGD SSF HSF Python proxy programs Two Python proxy programs are needed to ease the communication between Alamut Batch and the ancillary programs mispred_ht py and nnsplice_ht py Both are provided in the Alamut Batch distribution and must reside in the ancillary directory The getHGMD py program also provided in the Alamut Batch distribution serves as a proxy to connect to a local HGMD Professional database if any This program requires the MySQLdb Python package
15. g shows where Alamut Batch and Alamut Visual take place in a typical NGS analysis pipeline DNA Reads Variants Mutations m_ S SO SO Sequencer Alignment venient Annotation Filterin Iinternictation cd 8 Calling 8 Reporting alamut alamut d batch d visual lt Alamut Batch can be used independently from Alamut Visual However results from Alamut Batch can be easily injected into Alamut Visual so as to benefit from its rich feature set including graphical visualization Alamut Batch annotates variants by querying a database storing information about human genes the Alamut database Technically Alamut Batch comes in two versions depending on where the gene database is located e The Standalone version uses a locally installed database e The Client Server version connects over the internet to our hosted database Standalone version The Standalone version of Alamut Batch provides best performance by including in a local installation all software components and the Alamut database required by the annotation process It is most appropriate for intensive variant annotation needs such those of whole exome analyses Alamut Batch Standalone is a Linux command line program Client Server version The Client Server version of Alamut Batch connects remotely to the central Alamut database Due to internet latency the Client Server version is slower than the Standalone Interactive Biosoftware 5 28 Februar
16. he nth variant assbly NCBI36 GRCh37 GRCh38 default GRCh37 strand 1 1 default 1 per variant not applicable to VCF input alltrans annotate variants on all transcripts spectrans annotate variants only on specified per variant Transcripts not applicable to VCF input 1 2 translist transcript file name gt annotate variants only on listed preferred transcripts glist lt gene list file name gt list of genes of interest roilist lt ROI list BED file name gt list of regions of interest nomispred no missense predictions faster nonnsplice no NNSPLICE predictions faster nogenesplicer no GeneSplicer predictions faster ignoreInputErrors proceed even if input has incorrect entries exonnums simple custom default simple ssIntronicRange lt n gt set varLocation as splice site if variant is intronic and within this range extAnnFile lt external annotation file name gt include additional annotations from external file outputannonly output only annotated variants in annotation output outputVCFQuality outputVCFFilter outputVCFInfo ID ID outputVCFGenotypeData ID ID outputEmptyValuesAs lt value gt e g NULL hgmdUser lt HGMD Pro user name gt hgmdPasswd HGMD Pro password gt proxyserver lt proxy server name gt proxyport proxy server port number gt proxyuser lt proxy user login gt p
17. n American population Alternate allele frequency in African American population Alternate allele frequency in all populations Interactive Biosoftware Alamut Batch 1 4 User Manual exacAFRHmz exacAMRHmz exacEASHmz exacSASHmz exacNFEHmz exacFINHmz exacOTHHmz exacFilter exacReadDepth espRefEACount espRefAACount espRefAllCount espAltEACount espAltAACount espAltAllCount espEAMAF espAAMAF espAllMAF espEAAAF espAAAAF espAllAAF 21 28 February 2015 Average sample read depth ClinVar ids ClinVar origins ClinVar methods ClinVar clinical significances ClinVar review status ClinVar phenotypes HGMD mutation id HGMD phenotype HGMD PubMed id HGMD sub category COSMIC ids COSMIC tissues COSMIC frequencies COSMIC sample counts Indels Inserted nucleotides Deleted nucleotides Substitutions Type WT nucleotide Variant nucleotide Nucleotide change PhastCons score phyloP Interactive Biosoftware Alamut Batch 1 4 User Manual espAvgReadDepth clinVarlds clinVarOrigins clinVarMethods clinVarClinSignifs clinVarReviewStatus clinVarPhenotypes hgmdld hgmdPhenotype hgmdPubMedld hgmdSubCategory cosmiclds cosmicTissues cosmicFreqs cosmicSampleCounts insNucs delNucs substType wtNuc varNuc nucChange phastCons phyloP 22 28 separated list separated list Possible values germline somatic de novo
18. ns supplied in variant annotation files can now be integrated in the output extAnnFile lt external annotation file name gt Version 1 1 3 adds three output fields reporting validation details of dbSNP entries Version 1 1 3 also adds three output fields reporting frequencies of ESP alternate alleles alternate alleles not always being minor alleles Version 1 1 4 adds an option to allow processing even if the input file has invalid entries Version 1 1 5 fixes a bug where variants affecting multiple genes where not processed on all genes Version 1 1 6 adds the HGMD variant sub category output field and fixes a network proxy bug for HTTPS Version 1 1 7 brings improvements to the GUI version all command line options are now also available in the graphical interface With version 1 1 7 it is now possible to input variant alleles that are the same as the transcript allele e g when the genome reference sequence has the minor allele of a SNP and the transcript has the major allele Version 1 1 7 can query a local HGMD database installation Version 1 1 8 fixes a bug occurring when a gene cannot be loaded Version 1 1 9 features performance improvements and support for mitochondrial variants Version 1 1 10 fixes a bug causing software crashes on transcripts where the STOP codon is isolated in a 3 UTR exon Version 1 1 11 supports a wider range of VCF variant descriptions i e descriptions that don t strictly comply with the format specificati
19. nts 27 The Alamut KEE E 27 Software Drograms ENER 27 System Requirements viiesa aana a aaa a aaa aaa taa Uaes iaa iaia 27 Interactive Biosoftware 3 28 February 2015 Alamut Batch 1 4 User Manual ee EE ER Installing the alamut db databasE vvrttttttetteteeeeesoooooooessesesseoooooooossasensaoonosooooensonn 27 Installing Alamut Batch 27 Installing ancillary progra MS EE 28 Oth r prediction tools EE 28 Python Proxy Drograms ccccsessssscececeeecsessnseceeeceeecseseseaeeeeeeeesseeseaeeeseceesseeseaeaeeeeeesensees 28 Updating the alamut_ Databases Genee Geheit 28 Interactive Biosoftware 4 28 February 2015 Alamut Batch 1 4 User Manual A propos This user manual describes how to install and use Alamut Batch version 1 4 0 Feb 2015 _ NOTE s Alamut Batch was previously named Alamut HT Product description Alamut Batch is a high throughput annotation engine for NGS analysis Designed for intensive variant analysis workflows this software enriches raw NGS variants with dozens of annotations including effects on human genes detailed SNP information and missense and splicing predictions Annotations provided by Alamut Batch are similar to those available in the Alamut Visual mutation interpretation software Alamut Batch is able to annotate tens of thousands variants per hour This schematic drawin
20. on ExAC allele frequency in other populations Interactive Biosoftware Alamut Batch 1 4 User Manual rsValidations e g Cluster Frequency 1000G rsValidationNumber rsAncestralAllele rsHeterozygosity rsClinicalSignificance rsMAF rsMAFAllele rsMAFCount 1000g_AF 1000g_AFR_AF 1000g_SAS_AF 1000g_EAS AF 1000g_EUR_AF 1000g AMR AF exacAllFreq exacAFRFreq exacAMRFreq exacEASFreq exacSASFreq exacNFEFreq exacFINFreq exacOTHFreq 20 28 February 2015 ExAC homozygosity ratio in African population ExAC homozygosity ratio in Latino population ExAC homozygosity ratio in East Asian population ExAC homozygosity ratio in in South Asian population ExAC homozygosity ratio in Non Finnish European population ExAC homozygosity ratio in in Finnish European population ExAC homozygosity ratio in other populations ExAC VCF filter value ExAC read depth ESP reference allele counts in European American population ESP reference allele count in African American population ESP reference allele count in all populations ESP alternate allele count in European American population ESP alternate allele count in African American population ESP alternate allele count in all populations Minor allele frequency in European American population Minor allele frequency in African American population Minor allele frequency in all populations Alternate allele frequency in Europea
21. on and can now output VCF genotype fields of all input samples Interactive Biosoftware 8 28 February 2015 Alamut Batch 1 4 User Manual Installation Download the software from http downloads interactive biosoftware com The downloaded file is a self extractable archive on Windows and a tarball on Linux Extract the contents Client Server GUI frontend Windows only Launch the program Alamut Batch Ul exe Open the Option panel and supply e Your Institution ID in the Institution field e Your license key in the Licence Key field e User initials as appropriate in the User initials field If your internet access is behind a proxy you will also need to supply appropriate proxy settings NOTE ta The Alamut Server name is a ht interactive biosoftware com by default If you are based in North America please change the server name to a ht na interactive biosoftware com r 7 Options User Insttuson Id License Key User initials HGMD User Password Network Alamut Server a ht interactwe tiosoftware com Server Port 30 Use proxy server Proxy Server Proxy Port Alamut Batch Version 1 3 Rev 0 Le Cancel Client Server command line program Windows and Linux Edit the alamut batch ini file and supply e Your Institution ID in the Institution field e Your license key in the Licence Key field e User initials as appropriate in the User field D NOTE v The
22. otations from these datasets are output as lists where each item is separated by a character Lists in each field are ordered by dataset entries For example variant MLH1 NM_000249 2 c 793C gt T has 3 entries in ClinVar yielding the following ClinVar annotation fields RCV000022502 22 RCV000075872 1 RCVO00034802 1 clinVarlds clinVarOrigins germline germline germline clinVarMethods literature only research research Pathogenic Pathogenic VUS 11311 clinVarClinSignifs clinVarReviewStatus clinVarPhenotypes Lynch syndrome ii Lynch syndrome Not provided What was new in version 1 3 Version 1 3 adds support for the GRCh38 hg38 human genome assembly and includes 1000 genomes Phase 3 version 5 variant frequencies for five sub populations African East Asian South Asian European American What was new in version 1 2 Version 1 2 introduces the following new features e Support for non protein coding genes now available in the Alamut gene database e Output annotation lines now include the original variant position provided in the input variant file This helps in reconciling variants between the output annotation file and other variant files which could previously show problematic in case of variant position changes due to application of HGVS rules e VCF quality filter information and genotype fields are now reported in the output even for not annotated variants e Annotation can now
23. p Leu123Leu Nearest exon if intronic variant 5 3 Predictions at nearest splice site ditto ditto ditto ditto ditto ditto ditto ditto ditto Average change predicted by MaxEntScan NNSPLICE and HSF New Donor Site New Acceptor Site Cryptic Donor Strongly Activated Cryptic Donor Weakly Activated Cryptic Acceptor Strongly Activated Cryptic Acceptor Weakly Activated see Section Local splicing effect predictions yes no yes no Variant flagged as suspect by dbSNP February 2015 dbSNP validation labels dbSNP number of validation categories dbSNP ancestral allele dbSNP variation average heterozygosity dbSNP variation clinical significance dbSNP variation global Minor Allele Frequency dbSNP variation global minor allele dbSNP variation sample size 1000 genomes global allele frequency 1000 genomes allele frequency in African population 1000 genomes allele frequency in South Asian population 1000 genomes allele frequency in East Asian population 1000 genomes allele frequency in European population 1000 genomes allele frequency in American population ExAC global allele frequency ExAC allele frequency in African population ExAC allele frequency in Latino population ExAC allele frequency in East Asian population ExAC allele frequency in South Asian population ExAC allele frequency in Non Finnish European population ExAC allele frequency in Finnish European populati
24. ript id used if spectrans parameter is specified Optional user defined fields e g heterozygosity number of reads etc These fields are not processed but merely reported as is in the output file Sot Se ae Empty lines and lines starting with a character are ignored Example 1d00011 1 23456 T A 42 T gt A substitution id00022 9 876543 TGA 84 TGA insertion 1d00032 5 613720 AC 2 AC deletion 1 Strand is related to the variant itself not to the transcript orientation Interactive Biosoftware 11 28 February 2015 Alamut Batch 1 4 User Manual Using Alamut Batch GUI frontend Windows only Launch the program Alamut Batch Ul exe Program options are spread over three different tabs FI Output annotated variants orty Outpt empty valves as vo FI Output VEF QUAUty field F Outpt VCF FILTER fek Output YO IN Ormaton fiekds DP AF AA Output VCF genotype fields e g GT AC GQ Options are described in section Software Parameters below Interactive Biosoftware 12 28 February 2015 Alamut Batch 1 4 User Manual Command line program Windows and Linux Synopsis alamut batch help listgenes lt output file name NCBI36 GRCh37 GRCh38 in lt variant file name gt ann lt annotation file name gt unann lt unannotated log file name gt from lt n gt start annotating from the nth variant to lt n gt annotate up to t
25. roxypasswd proxy password gt processes lt processes gt Standalone version only k ka N N Options flagged as 1 2 were new in version 1 2 Using the listgenes option puts the program in a special mode making it ouput the unsorted list of genes available in the Alamut database for the given genome assembly Options are described in section Software Parameters below Interactive Biosoftware 13 28 February 2015 Software parameters Input Output files Variant file Alamut Batch 1 4 User Manual Comment Variant input file full path name refer to Section Input file for details of the file format Command line in lt variant file name gt Annotation file Annotation output file full path name refer to Section Output file for details of the file format ann lt annotation file name gt Failed variants file Annotation parameters Output log file name This file lists the variants that could not be annotated Comment Not available in the Windows GUI unann lt unannotated log file name gt Command line from lt n gt start annotating from the nth variant R azi to n annotate up to the nth variant NCBI36 hg18 or GRCh37 hg19 The NCBI36 hg18 genome Sg assembly is still supported but assbly NCBI36 GRCh37 Y youare strongly encouraged to default GRCh37 provide the software with GRCh37 hg19 variations Not applicable to VCF input
26. ry 2015 Alamut Batch 1 4 User Manual MAPP prediction MAPPprediction MAPP p value MAPPpValue MAPP p value median MAPPpValueMedian Interactive Biosoftware 24 28 February 2015 Alamut Batch 1 4 User Manual Viewing annotated variants in Alamut Visual The genomic level and cDNA level HGVS descriptions generated by Alamut Batch annotations gNomen and cNomen can be easily copied and pasted into Alamut Visual Copy a list of HGVS descriptions L o mg o o s gDNAend gNomen cDNAstart cDNAend cNomen 37059009 Chr3 GRCh37 g 37059009A gt G 803 803 NM_000249 3 c 803A gt G 37059031 Chr3 GRCh37 g 37059027_37059030dup 821 824 NM 000249 3 c 821 824dup 37059078 Chr3 GRCh37 g 37059078del 872 872 NM 000249 3 c 872del 41276113 Chr17 GRCh37 g 41276113T gt G 1 1 NM_007300 3 c 1A gt C 41276082 Chr17 GRCh37 g 41276081 41276082insG 32 33 NM 007300 3 c 32 33insC 41276046 Chr17 GRCh37 g 41276045 41276046del 68 69 NM 007300 3 c 68 69del Ctrez sS Chr17 GRCh37 g 41276045_41276046del E Ctrl Y Ctrl X Paste Ctrl V Delete Select All Ctrl A Variants then show up in a variant list Double click on an entry to jump from a variant to another i chr3 GRCh37 g 37059009A gt G chr3 GRCh37 g 37059027_37059030 chr3 GRCh37 g 37059078del chr17 GRCh37 g 41276113T gt G chr17 GRCh37 g 41276081_4127608 chr17 GRCh37 g 41276045_4127604 Interactive Biosoftware 25 28 February 2015 Alamut Batch 1
27. sIntronicRange option Genome assembly assembly gDNA start gDNAstart gDNA end gDNAend HGVS genomic level nomenclature gNomen e g Chr3 GRCh37 g 37059009A gt G cDNA start cDNAstart cDNA end cDNAend HGVS cDNA level nomenclature cNomen e g NM_000249 3 c 803A gt G Interactive Biosoftware 18 28 February 2015 HGVS protein level nomenclature Alt Protein level nomenclature Exon Intron OMIM id Distance to nearest splice site Nearest splice site type WT seq SpliceSiteFinder score WT seq MaxEntScan score WT seq NNSPLICE score WT seq GeneSplicer score WT seq HSF score Variant seq SpliceSiteFinder score Variant seq MaxEntScan score Variant seq NNSPLICE score Variant seq GeneSplicer score Variant seq HSF score Nearest splice site change Splicing effect in variation vicinity Protein domain 1 Protein domain 2 Protein domain 3 Protein domain 4 dbSNP variation dbSNP validated variation dbSNP suspect variation Interactive Biosoftware Alamut Batch 1 4 User Manual pNomen alt_pNomen exon intron omimld distNearestSS nearestSSType wtSSFScore wtMaxEntScore wtNNSScore wtGSScore wtHSFScore varSSFScore varMaxEntScore varNNSScore varGSScore varHSFScore nearestSSChange localSpliceEffect proteinDomain1 proteinDomain2 proteinDomain3 proteinDomain4 rsid rsValidated rsSuspect 19 28 e g p Glu268Gly Like pNomen except for synonymous variants e g
28. should be supplied in tab delimited text files using the following format e First line Tab separated list of annotation labels preceded by chrom and gNomen for clarity For example chrom gt gNomen gt Class gt Freq where gt denotes tabulation characters and Class and Freq are annotation labels e Other lines Tab separated variant description and annotation values in the same order as specified in line 1 For example chr1 gt g 45800167G gt A gt Likely pathogenic gt Q 09Q1 chr13 gt g 32929387T gt C gt Unknown gt 0 005 Annotation labels as supplied in first line are reported in the first line of the output file When input variants and externally annotated variants match the annotation output contains corresponding annotation values Note that multiple external variant annotation files can be supplied using option extAnnFile multiple times Using a local HGMD Professional database installation If you have a downloaded version of HGMD Professional you can let Alamut Batch query it locally rather than over the internet To achieve this you will need to edit the alamut batch ini file and add an HGMD section to specify how to connect to the local server as shown in the following example HGMD host 192 168 0 1 user my_hgmd_user password my_hgmd_passwd database hgmd_pro NOTE v Querying a local H6MD Pro database is available on Linux only and requires Python and the
29. y 2015 Alamut Batch 1 4 User Manual version but is very easy to install It is however an efficient solution for moderate variant annotation needs such those of gene panels sequencing analyses Alamut Batch Client Server is available as a command line program on Windows and Linux operating systems The software is also available with a GUI frontend on Windows System requirements Standalone version Alamut Batch Standalone requires the following system specifications e 64 bit CentOS 6 4 distribution or other compatible Linux distribution e Python 2 6 or 2 7 e Python MySQLdb package if access to a local HGMD Professional database installation is intended e OpenSSL client libraries e g RPM package openssl x86_ 64 e 4GBRAM minimum e 5 GB hard drive space e Internet connection required for license control Client Server version Alamut Batch Client Server requires the following system specifications e 64 bit CentOS 6 4 distribution or other compatible Linux distribution e 4GBRAM minimum e 100 MB hard drive space e Internet connection required e Windows XP 7 or 8 32 bit or 64 bit e 2GBRAM minimum e 50 MB hard drive space e Internet connection required Interactive Biosoftware 6 28 February 2015 Alamut Batch 1 4 User Manual What is new in version 1 4 Version 1 4 adds annotations from ClinVar COSMIC and ExAC Note that since a given genomic variant can match multiple ClinVar or COSMIC records ann
Download Pdf Manuals
Related Search
Related Contents
Valueline VLEP11040W30 power cable Apple iPod 4th Generation MP3 Player manuale istruzioni operating instructions manuel d'instructions ' Service Manual - Technics DJ Store さあ、 以上で終了です。 ー0個のテーマに従って説明すれば、 自然とー ARRETE DU 2 OCTOBRE 1997 Garmin Forerunner 15 Quick Start Manual Philips SHE3622 Manual do Usuário Nokia 6101 ThePortablePaperV2N3_48pagesMay Copyright © All rights reserved.
Failed to retrieve file