Home
Geneious User Manual - Division of Physical & Biological Sciences
Contents
1. iw f Graphical Tree View Sequence View Text View b General b Layout w Formatting v Transform branches Transform cladogram HJ 7 Order branches Ordering increasing Y Show Root Branch Line Weight 21 gt mM Show Tip Labels v mM Show Node Labels Display Node Heights HJ a Font Size 11k Significant Digits 4k mE 1 Figure 3 6 A view of a phylogenetic tree in Geneious Transform branches allows the branches to be equal like a cladogram or proportional Leaving it unselected leaves the tree in its original form Ordering orders branches in increasing or decreasing order of length but within each clade or cluster Show root branch displays the position of the root of the tree Line weight can be increased or decreased to change the thickness of the lines representing the branches If you are unfamiliar with tree structures please refer to Figure 3 7 for the following options Show tip labels This refers to labels on the tips of the branches of the tree Show node labels This refers to labels on the internal nodes of the tree Show branch labels This refers to the branches of the tree Show scale bar This displays a scale bar at the bottom of the tree view to indicate the length of the branches of the tree It has three options Scale range font size and line weight Depending on the option Display can be branch lengths taxon names or node heigh
2. ks A E scd mm O 250 260 Colors LALFISFTGL AFLGGRDRRR WKCISCMYVV MEDSFLTEAL ESDADWEOKI 549172 QNRNFCWENY SFFEIMENPE WCHFXTYTEG NEHPSNATWL FZYBEANSET AMEFOMCSES RWEMGSYMMT Hydropho 549172 product Transmembrane protein HWLF3foom Options b Statistics 342 US20 gene Transmembrane region Hydropho Malla Vd zzi inhi NN ER 549172 EMMEFFCPPN QCXRHAMCEY EBDSNAXFBTEH BEMBSGPRWE SESDGYPEDN GTBTAASTTG KS g 488 gt Hydropho AA umi A Eifion Mi oe mo I Cursor before residue 0 Figure 3 3 The annotations options in the sequence viewer Compress annotations This option reduces the vertical height of the annotations on display This reduces the space occupied by annotations by allowing them to overlap and increases the amount of the sequence displayed on the screen Hide all Show all These buttons can be used to turn off and on all annotations on the sequence The sequence viewer toolbar The top of the sequence viewer panel shows a toolbar containing several actions Some of them operate on a part of a sequence or alignment There are several ways to make such a selection e Mouse dragging Click and hold down the left mouse button at the start position and drag to the end position e Select from annotations When annotations are available click on any annotation to select the annotated residues 56 CHAPTER 3 ANALYSING DATA e Click on sequence name This will select
3. A missing annotation E env alignment AB234101 1 AB This is your Local Documents service D You can drag and drop articles and sequences that you find in external databases for example NCBI into this folder for future reference Help Panel Right click on this folder and select Share Folder to allow your contacts to search it over the Collaboration service If you share a folder its sub folders are always also shared Any folders you share can be searched by all your contacts R m al Sequence View Text View Obras Reverse C LY df Edt p Annotation is Q i 10 20 Ed l consensus GBGBRAGBEN GANGEAGEGA BGBSGBGECA cH AB234107 1 fe senn ss Document Viewer E I a Panel E CRRA ie v ssidue 85 ungapped residue 85 J Figure 1 2 The main window in Geneious 8 CHAPTER 1 GETIING STARTED 1 2 1 The Services Panel The Services Panel contains the services Geneious offers These include your local documents including sample documents EMBL and NCBI links and Collaboration All these services will be described in detail later in the manual For more information see section 2 1 1 1 2 2 The Document Table The Document Table displays summaries of downloaded data such as DNA sequences protein sequences journal articles sequence alignments and trees By clicking on the search icon you can search data for text or by sequence similarity BLAST Data can also
4. The Entrez Structure database This is NCBI s structure database and is also called MMDB Molecular Modeling Database It contains three dimensional biomolecular experimentally or programmatically determined structures obtained from the Protein Data Bank The PubMed database This is a service of the U S National Library of Medicine that includes over 16 million citations from MEDLINE and other life science journals This archive of biomed ical articles dates back to the 1950s PubMed includes links to full text articles and other related resources with the exception of those journals that need licenses to access their most recent issues Entrez Taxonomy This database contains the names of all organisms that are represented in the NCBI genetic database Each organism must be represented by at least one nucleotide or protein sequence UniProt This database is a comprehensive catalogue of protein data It includes protein se quences and functions from Swiss Prot TrEMBL and PIR It has three main components each optimized for a particular purpose The scope and depth of these databases make them critical information sources for molecu lar biologists and bioinformaticians alike However a library is only as good as its librarian Geneious is your librarian allowing you to search for filter and store only the data that you care about 32 CHAPTER 2 RETRIEVING AND STORING DATA 2 4 2 Accessing NCBI BLAST through Geneious BLAST
5. ALTZ 97 CTACAGCAG CAACAGGAAC AGGAGCAGTA A SN 96 D ACACCAGCAG CAAAAGGAGT AGGAGCAGTA ALUG 99 CCAGCAG CAGAAGGAGT AGGAGCAGCA A1 KE 00 K CTGCACCAG CACCAGGAGT AGGAGCAGTA Y Similarity MA NL NN MA 15n Alt left click on a sequence position or annotation or select a region to zoom in Alt right click to zoom out Figure 3 2 The similarity graph for an alignment of nucleotide sequences Hydrophobicity This is available with protein sequences It displays the Hydrophobicity of the residue at every position or the average Hydrophobicity when there are multiple sequences pl pI stands for Isoelectric point and refers to the pH at which a molecule carries no net elec trical charge The pl plot displays the pl of the protein at every position along the sequence or 3 1 DOCUMENT VIEWERS IN GENEIOUS 53 the average pI when multiple sequences are being viewed Sliding window size This calculates the value of the graph at each position by averaging across a number of surrounding positions When the value is 1 no averaging is performed When the value is 3 the value of the graph is the average of the residue value at that position and the values on either side Chromatogram This is available with chromatogram traces It displays the four traces above the sequence where the peak as detected by the base calling program is at the middle of the base letter When viewing more than one chromatogram or an alignment made from chro ma
6. This option is visible when viewing protein sequences chromatogram traces multiple se quences or sequence alignments Turn this option on by clicking the Graph checkbox and the graph s will be displayed below the sequence s A number of graphs are available Similarity This is available for sequence alignments and when more than one nucleotide protein sequence are displayed It displays the similarity across all sequences for every position Green means that the residue at the position is the same across all sequences Yellow is for less than complete similarity and red refers to very low similarity for the given position Figure 3 2 x n os 7 Extract Reverse Complement 9 Translate Edit Y Annotation Ill save p id 40 GTAGGATGGC CTGAGGTTAG 60 GGAAAGAATG gt Zoom Level A1 KE 00 K ALTZ 97 GTGGGATGGC CTGAGATTAG GGAAAGAATG v MGraphs A SN 96 D GCGGGATGGC CTAAGGTTAG GGATAGGATG M similarity A1 UG 99 GTGGGATGGC CTAGGGTTAG GGAAAGAATA Sliding window size 1 A1 KE 00 K AAGGGATGGC CTGAAGTTAG GGAGAGAATG A Similarity ma EN GN NN RE gt Consensus ALKE 00 K AGACGTGCTC GTGCTCCTGC AGCAGCACCT C Annotations KIEL AGACGAACT umm mee sarna FM Layout ASHNOSD AGACARACTO remos mo mra ones CTCCA gt M Colors ALUGOS AGACAM CIC mss cT SSS A1 KE 00 K EAT siet m A gt Statistics 636 85 Similarity Ea p s ALKE 00 K ACTGCCGCAG CACCTGGAGT AGGAGCAGTT
7. View Constraints Sex View Constraints Height View Constraints Create Note Type Delete Note Type Edit Note Type Add Note Done Figure 2 12 Viewing constraints for a field Using Note Types The main purpose of Notes is to add user defined information to Geneious documents How ever Notes and Note Types can be searched for and filtered as well Also columns can be ordered according to the values of an added Note Searching Once a Note Type is defined and a Note of that type added it is automatically added to the standard search fields These are listed under the Advanced Search options in the Document Table From then on you can use them to search your Local Documents If you have more than one Field Type for a Note Type they will both appear as searchable fields in the search criteria Filtering Note values can be used to filter the documents being viewed To do so type a value of your Note Type into the Filter Box in the right hand side of the Toolbar Only matching documents will be shown Ordering columns The fields and values of an added Note Type will appear as columns in the 44 CHAPTER 2 RETRIEVING AND STORING DATA Document Table These new columns can be used to order the table Take the example of pro tein size A click on the column heading will order the documents in increasing or decreasing order according to their protein size Clicking the column heading again
8. Your documents are stored at the location specified by the Data Storage Location field see Figure 2 8 You can change this location by clicking the Browse button and selecting a new location Geneious will remember this new location when you exit eoe Preferences General Plugins General settings Data Storage Location Users alexei Geneious Browse Figure 2 8 Setting the location of your local documents 2 6 Agents Geneious offers a simple way for you to continuously receive the latest information on genomes sequences and protein structures This feature is called an agent Each agent is a user defined automated search You can instruct an agent to search any Geneious accessible database at reg ular intervals eg weekly including your contacts on Collaboration This simple but powerful feature ensures that you never miss that critical article or DNA sequence To manage agents click on the agent icon in the toolbar An agent has to be set up before it can be used 2 6 1 Creating agents To set up an Agent click the Agents icon and the create button You now need to specify a set of search criteria in the exact same way as you do for search the database to search search frequency and the folder you wish the agent to deliver its results to The search frequency may be specified in minutes hours days or weeks You can only use whole numbers Selecting Only get documents created after today will cau
9. GSS and CoreNucleotide The core nucleotide database brings together information from three other databases GenBank EMBI and DDBJ These are part of the International collaboration of Sequence Databases This database also contains Ref Seq records which are NCBI curated non redundant sets of sequences The Entrez Popset database This database contains sets of aligned sequences that are the result of population phylogenetic or mutation studies These alignments usually describe evolution and population variation The PopSet database contains both nucleotide and protein sequence data and can be used to analyze the evolutionary relatedness of a population 2 4 PUBLIC DATABASES 31 Table 2 2 Acronyms Acronym Full name DDBJ DNA Data Bank of Japan EMBL European Molecular Biology Laboratory cDNA Complementary DNA GSS Genomic survey sequences PDB Protein Data Bank EST Expressed Sequence Tags UniProt the Universal Protein Resource RefSeq Reference Sequence PIR Protein Information Resource PRE Protein Research Foundation SIS Sequence Tagged Site HTGS High Throughput Genomic Sequence The Entrez Protein database This database contains sequence data from the translated coding regions from DNA sequences in GenBank EMBL and DDBJ as well as protein sequences sub mitted to the Protein Information Resource PIR SWISS PROT Protein Research Foundation PRE and Protein Data Bank PDB sequences from solved structures
10. Uncheck the color checkbox to turn off all coloring without viewing further options Coloring schemes differ depending on the type of sequence For example the Polarity and Hydrophobicity coloring schemes are available only for Protein sequences Layout Layout has various options controlling the layout of the sequence e Show tree This toggles the display of the phylogenetic tree when viewing the alignment of a phylogeny document e Show residue positions This toggles the display of the residue position number above the sequence residues e Show original sequence positions This toggles the display of the residue position numbers for the original sequence on a per sequence basis It is only available for alignment docu ments and sequences that were extracted from other sequences e Show space every 10 residues If you are zoomed in far enough to be able to see individual residues then an extra white space can be seen every 10 residues when this option is selected e Wrap sequence This wraps the sequences in the viewing area A shortcut is to click the layout check box without expanding it e Wrap on 10 residue boundaries This is automatically turned on if the wrap sequence option is on and will force the sequence wrapping to occur in multiples of 10 nucleotides or amino acids 52 CHAPTER 3 ANALYSING DATA e Show sequence and graph names Show or hide sequence and graph names inside the se quence viewer panel Graphs
11. 5 The Toolbar The toolbar contains several icons that provide shortcuts to common functions in Geneious You can alter the contents of the toolbar to suit your own needs The icons can be displayed small or large and with or without their labels The Help icon is always available The Back and Forward options help you move between previous views in Geneious and are analogous to the back and forward buttons in a web browser The V option shows a list of previous views The other features that can be accessed from the toolbar are described in later sections The toolbar can be customized by right clicking Ctrl Click on MacOS on it This gives a popup menu with the following options e Show Labels Turn the text labels on or off e Large Icons Switch between large and small icons e A list of all available toolbar buttons Selecting deselecting buttons will show hide the buttons in the toolbar 20 CHAPTER 2 RETRIEVING AND STORING DATA 2 1 6 Status bar Below the Toolbar there is a grey status bar This bar displays the status of the currently selected service For example when you are running a search it displays the number of matches and the time remaining for the search to finish 2 17 The Menu bar File Menu This contains some standard File menu items including printing and Exit on Windows On Geneious pro it contains New Sequence which will create a new nucleotide or protein sequence from residues that you can p
12. DDBJ PDB sequences no EST STS GSS or HTGS sequences genome Genomic entries from NCBI s Reference Sequence project est Database of GenBank EMBL DDBJ sequences from EST Divisions est human Human subset of est est mouse Mouse subset of est est others Non Human non mouse subset of est gss Genome Survey Sequence includes single pass genomic data exon trapped sequences and Alu PCR sequences htgs Unfinished High Throughput Genomic Sequences phases 0 1 and 2 finished phase 3 HTG sequences are in nr pat Nucleotide sequences derived from the Patent division of GenBank PDB Sequences derived from the 3D structures of proteins from PDB month All new updated GenBank EMBL DDBJ PDB sequences released in the last 30 days RefSeq NCBlI curated non redundant sets of sequences dbsts Database of GenBank EMBL DDBJ sequences from STS Divisions chromosome A database with complete genomes and chromosomes from the NCBI Reference Sequence project wgs A database for whole genome shotgun sequence entries env nt This contains DNA sequences from the environment i e all organisms put together Geneious can perform five different kinds of BLAST search e blastp Compares an amino acid query sequence against a protein sequence database e blastn Compares a nucleotide query sequence against a nucleotide sequence database 2 5 STORING DATA YOUR LOCAL DOCUMENTS 33 Table 2 4 Protein sequence searches in the BLAST databases Database
13. Geneious also needs Java 1 5 to run If you do not have this on your system already please h h h Li Li 7 L 63 1 3 TROUBLESHOOTING 11 download it from http www geneious com On the download page select the Includes Java 1 5 option This involves downloading a larger file If you are a Mac user and have OS X 10 4 or later you will have to download Java 1 5 from http www apple com support downloads java2se50release3 html 1 3 2 I get a connection error when trying to search using NCBI or EMBL If the message reads Check your connection settings there is a problem with your Internet connection Make sure you are still connected to the Internet Both Dial up and Broadband can disconnect If you are connected then the error message indicates you are behind a proxy server and Geneious has been unable to detect you proxy settings automatically You can fix this problem 1 Check the browser you are using These instructions are for Explorer Safari and Firefox 2 Open your default browser 3 Use the steps in Figure 1 6 for each browser to find the connection settings 4 Now go into Geneious and select Preferences There are two ways to do this e Shortcut keys Ctrl Shift P Windows Linux Command Shift P Mac OS X e Tools Menu Preferences 5 This opens the Preferences Click on the General tab There are five options in the drop down options under Connection settings Figure 1 7 e Use dire
14. Panel The Services Panel shows a tree that concisely displays sources of data and your stored docu ments The plus symbol indicates that a folder contains sub folders A minus indicates that the folder has been expanded completely and has no sub folders Click these symbols to expand or contract folders 15 16 File Edit Tools Help Menu Bar CHAPTER 2 RETRIEVING AND STORING DATA E o Ze o m a gt Pu D Tool Bar a Back Forward Search Agents Note Sor Alignment Tree Help Completed Found 2 documents Sources Help E 3 M S 9 Local 0 T Alignment v Include Subfolders More Options x se gt acna 30 m i e ebgC 58 Name Summary gt Envelope sequences 108 A missing annotation house mouse geri Folder Alig nments B f Genes 0 gt Downloaded 336 0 Extracted 52 New ios 27 unread 3D Structures 2 i Alignments 7 5 Nucleotide Documents 3 1 Protein Documents 2 gt Tree Documents 2 e Temporary 3 2 unread amp Collaboration 8 strabo richard amp talk geneious com amp 9 Emer E Uniprot amp 9 NCBI 5 8 BLAST E blastn E blastp blastx i 2 tblastn tblastx B Genome B Nucleotide E PopSet 3 Protein A PubMed S Structure E Taxonomy A cine ECHTE AB234101 1 AB Sequence View Text View En Wy Annotation Ch Ext
15. Reducing the required num ber of matches for a given window size will increase the sensitivity of the dotplot but will also lead to more false positives regions that match due to chance alone If the stringency is set to 3 and the window size to 5 then a pair of 5 base windows that contain 3 or more matching nucleotides will be classified as a match A dot corresponding to the center of the two windows will then be displayed in the dotplot Anything less will be classified as a mismatch and no dot will be drawn 3 4 1 Viewing Dotplots To view a dotplot in Geneious select two nucleotide or protein sequences in the Document Table and select Dotplot Viewer in the Document Viewer Panel see Figure 3 4 For more details on the dotplot viewer and the available options refer to section 3 1 2 64 CHAPTER 3 ANALYSING DATA 3 4 2 Interpreting a Dotplot e Each axis of the plot represents a sequence e A long largely continuous diagonal indicates that the sequences are related along their entire length e Sequences with some limited regions of similarity will display short stretches of diagonal lines e Diagonals on either side of the main diagonal indicate repeat regions caused by duplica tion e A random scattering of dots reflects a lack of significant similarity These dots are caused by short sub sequences that match by chance alone For more information on dotplots refer to the paper by Maizel amp Lenk 15 3 5 Pairwi
16. can be dragged to another location if desired Chapter 4 Smart Folders pro only Smart folders are a new feature of Geneious that allow you to separate relevant data from extraneous search results retrieved by an agent Smart Folders are created from within the Create Agent dialog To open the Create Agent dialog choose the Agents button from the toolbar and then select Create from the agents dialog Choose a folder for the agent or create a new one and make sure that the Make destination folder a smart folder checkbox is checked When a folder is turned into a smart folder it is given a subfolder called reject At first all the documents delivered by the agent will be put in this folder Drag the documents that you want to keep into the main folder and future documents delivered by the agent will be compared to the accepted and the reject documents and stored in one or other of the two folders appropriately Make sure that you leave documents in the reject folder as smart folders need negative examples to build an accurate comparison model Note that unread documents in the main folder will not be compared while all documents in the reject folder will be 79 80 CHAPTER 4 SMART FOLDERS PRO ONLY Chapter 5 Geneious Education pro only This feature allows a teacher to create interactive tutorials and exercises for their students A tutorial consists of a number of HTML pages and Geneious documents The student
17. can be imported into Geneious 2 23 Where does my imported data go The above formats can be all imported into Geneious from files on your hard drive Geneious also enables you to download certain types of documents directly from public databases such as NCBI and EMBL The method used to retrieve a particular piece of data will determine where in Geneious it is stored Data imported from your hard drive This is imported directly into the currently selected local folder within Geneious If no folder is selected Geneious open a dialog which lets you specify a folder Data from an NCBI EMBL Contacts search Data downloaded from public databases within Geneious will appear in the Document Table and can be dragged from there into a local folder of your choice 26 CHAPTER 2 RETRIEVING AND STORING DATA Important if you don t drag the documents from a database search into your local folders the results will be lost when Geneious is closed 2 234 Data output formats Each data type has several export options Any set of documents may be exported in Geneious native format Data type Export format options DNA sequence FASTA Geneious Amino acid sequence FASTA Geneious Protein 3D structure PDB FASTA Geneious Multiple sequence alignment FASTA NEXUS 12 Geneious Phylogenetic tree Phylip phy FASTA NEXUS 12 MEGAS 11 Geneious PDF document PDF Geneious 2 2 5 Export to comma separated values CSV file The
18. decline the chat invitation If you click the Accept button a chat window will open and you can begin chatting with your contact and anyone else they have invited If you want to decline you can optionally enter a reason that the contact and anyone else in the chat will see explaining why you do not want to chat then click the Decline button to inform those in the chat that you will not be joining them 90 CHAPTER 6 COLLABORATION PRO ONLY 6 5 2 The Chat Window Once you have started or joined a chat the Chat Window will appear In the main section of the Chat Window you will see the messages that the other contacts in the chat have sent as well as information as to who has joined or left the chat At the bottom of the Chat Window is a text box where you can write messages you want to send to the chat To send messages click the Send button or press the Enter key To leave the chat close the Chat Window Bibliography 1 SF Altschul W Gish W Miller EW Myers and DJ Lipman Basic local alignment search tool J Mol Biol 215 1990 no 3 403 410 17 18 22 32 2 MO Dayhoff ed Atlas of protein sequence and structure vol 5 National biomedical re search foundation Washington DC 1978 65 66 3 R Durbin S Eddy A Krogh and G Mitchison Biological sequence analysis Cambridge University Press 1998 67 4 J Felsenstein Confidence limits on phylogenies An approach using the bootstrap Evolution 39
19. different rate to the transversions Tamura Nei This model also assumes different equilibrium base frequencies In addition to distinguishing between transitions and transversions it also allows the two types of transitions A gt G and C e T to have different rates 21 3 8 5 Resampling Bootstrapping and jackknifing Resampling is a statistical technique where a procedure such as phylogenetic tree building is repeated on a series of data sets generated by sampling from an original data set The results of analyzing the sampled data sets are then combined to generate summary information about the original data set In the context of tree building resampling involves generating a series of sequence alignments by sampling columns from the original sequence alignment Each of these alignments known as pseudoreplicates is then used to build an individual phylogenetic tree A consensus tree is constructed from by combining information from the set of generated trees providing an estimate for the level of support for each clade in the final consensus tree 4 Bootstrapping is the statistical method of resampling with replacement To apply bootstrapping in the context of tree building each pseudo replicate is constructed by randomly sampling columns of the original alignment with replacement until an alignment of the same size is obtained 4 72 CHAPTER 3 ANALYSING DATA Jackknifing is a statistical method of numerical resampling
20. document when creating a bibliography Alternatively a set of articles in Geneious can be directly exported to an EndNote 8 0 compatible format This is usually done when creating a bibliography for Microsoft Word documents 3 3 Sequence data Basic techniques such as dotplots and pairwise alignments can be used to study the relation ships between two sequences However as the number of sequences increases methods for determining the evolutionary relationships between them become more complicated When analyzing more than two sequences there are some common steps to determine the ancestral relationships between them The following sections outline the basic tools for prelim inary sequence analysis dot plots sequence alignment and phylogenetic tree building 3 4 Dotplots A dotplot compares two sequences against each other and helps identify similar regions 13 Using this tool it can be determined whether a similarity between the two sequences is global present from start to end or local present in patches The dotplot uses a window of comparison to determine the level of similarity between every pair of sub sequences of length window size in the two sequences being compared The dots in a dotplot are determined by two factors stringency and window size The strin gency is the number of matches required in the given window size for a dot to be plotted and it acts as a threshold that determines the sensitivity of the dot plot
21. edits the pages and documents to answer the tutorial questions and then exports the tutorial to submit for marking 5 1 Creating a tutorial The backbone of Geneious Tutorials are the HTML documents Simply create your documents and place them together in a folder If you make a page called index html it will be treated as the main page Geneious will follow all hyperlinks between the pages and external hyperlinks beginning with http will be opened in the user s browser If you want to include figures and diagrams in the pages just put the image files in the folder and reference them with lt img gt tags like a normal HTML document supported image formats are GIF JPG and PNG If you want to include Geneious documents in your tutorial simply place them in the folder as above and they will automatically be imported into Geneious with the tutorial If you want to link to them from the tutorial pages create a hyperlink pointing to the file in the HTML document For example to create a link to the file sequence fasta in your tutorial folder use the HTML lt a href sequence fasta gt click here lt a gt To open more than one document from a link separate the filenames with the pipe character for ex ample lt a href sequence fasta sequence2 fasta click here lt a gt Note that geneious files must contain only one document to be imported automatically with the tutorial You can add a short one line summary by
22. in the original location In external databases such as NCBI the documents will be copied leaving one in its original location Drag and copy While dragging a document over to your folder hold the Ctrl key Alt key on Maje s down This places a copy of the document in the target folder while leaving a copy in the original location This is useful if you want copies in different folders The Edit menu Select the document and then open the Edit menu on the menu bar Click on Cut Ctrl X Command X or Copy Ctrl C Command C Select the destination folder and Paste Ctrl V Command V the document into it 2 5 2 Searching your Local folders The Services Panel allows you to browse your Local folder hierarchy Next to each folder name in the hierarchy is the number of documents it contains in brackets When the Local folder or a sub folder is collapsed minimized the brackets next to the folder shows how many files are contained in that folder as well as all of its sub folders In addition if some of the documents in a folder are unread the number of unread documents will also appear in the brackets You can search the Local folder and sub folders the same way you search the public databases by clicking on the Search icon If you have defined a new type of note in Geneious and a Note has been added it will also be added to the Advanced Search criteria Look at an example of a new Note type called Protein size which ta
23. model of nucleotide substitution The parameters of the substitution model define a rate matrix that can be used to calculate the probability of evolving from one base to another in a given period of time This section briefly discusses some of the substitution models available in Geneious Most models are variations of two sets of parameters the equilibrium frequencies and relative substitution rates Equilibrium frequencies refer to the background probability of each of the four bases A C G T in the DNA sequences This is represented as a vector of four probabilities 74 nc TG mr that sum to 1 3 8 BUILDING PHYLOGENETIC TREES 71 Relative substitution rates define the rate at which each of the transitions 4 gt G C T and transversions A gt C A e T C e G G e T occur in an evolving sequence It is represented as a 4x4 matrix with rates for substitutions from every base to every other base Jukes Cantor This is the simplest substitution model 10 It assumes that all bases have the same equilibrium base frequency i e each nucleotide base occurs with a frequency of 25 in DNA sequences and each amino acid occurs with a frequency of 5 in protein sequences This model also assumes that all nucleotide substitutions occur at equal rates and all amino acid replacements occur at equal rates HKY The HKY model 8 assumes every base has a different equilibrium base frequency and also assumes that transitions evolve at a
24. over an annotation The information will be presented in a popup box Alternatively double clicking on an annotation will display its details in the annotation editing dialog The best way to save a primer or dna probe for further testing or use is to select the annotation for that primer and click the Extract button in the sequence viewer This will generate a separate short sequence document which just contains the primer sequence and the annotation so it retains all the information on the primer In the case of the reverse primer it should be reverse complemented When the Extract button is chosen for the reverse primer it will offer to reverse complement because the annotation runs in the reverse direction choose Yes When no primers can be found If no primers or DNA probes that match the specified criteria can be found any of the sequences then a dialog is shown which describes how many failed and for what reasons To see why no primers or DNA probes were found for particular sequences click the Details button at the bottom of the dialog The dialog will then open out to display a list of all the sequences for which no primers or DNA probes were found For each of the sequences the following information is listed e Which of Forward Primer Reverse Primer Primer Pair and or DNA Probe could not be found in the sequence 76 CHAPTER 3 ANALYSING DATA e 1st left primer ype primer bind esidue 8 740 8 759 Pair Primer Dim
25. text you are searching for and Geneious will display all the documents that match this text and hide all other documents in the Document Table To view all the documents in a folder clear the Filter box of text or click the button The Sort button in the toolbar provides two actions in a popup menu Sort by similarity is available when a single sequence document is selected in the Document Table It will rank all other sequences by their similarity to the selected sequence The most similar sequence is placed at the top and the least similar sequence at the bottom This also produces an E value column describing how similar the sequences are to the selected one The Remove Sort by Similarity action will remove the E value column and return the table to its previous sorting 2 7 1 Filtering on the fly Filtering can be used while searching for documents via public databases filtering data as it is being downloaded Type in the appropriate text in the Filter Box and only those documents that match both the original criteria as specified by the search terms and the Filter text will be displayed This is an effective way of filtering within your search results 2 8 Notes Add Note is a unique feature offered by Geneious Any Notes that you add can be treated as user defined fields for use in sorting searching and filtering your documents 2 8 NOTES 41 Where can I add Notes The Add Note function can be used to add note
26. that Geneious Tutorial File is selected as the filetype and then give it a name and click Export Chapter 6 Collaboration pro only Collaboration allows Geneious users to share the products of their research and work with each other It uses an open internet protocol called Jabber to move documents across the internet It can work with any existing Jabber service such as Google Talk but we recommend using the Geneious default talk geneious com If you have the knowledge to run and manage your own Jabber server we recommend us ing Wildfire from Jive Software http www jivesoftware org wildfire which is available for free under the GNU General Public License http www gnu org copyleft gpl html Collaboration is only available in Geneious pro This chapter shows you how to e Create a new collaboration account e Search for and add contacts to your account e Share local folders with your contacts e Search your contacts as you would an online database 6 1 Managing Your Accounts When you start Geneious you will see the empty Collaboration service in the Services Panel and the Collaboration menu at the top You can open the Add New Account dialog by either right clicking Ctrl click on MacOS on Collaboration in the Services Panel and clicking Add New Account in the popup menu or by selecting the same option from menu at the top 83 84 CHAPTER 6 COLLABORATION PRO ONLY 6 1 1 Add New Account In
27. the two sequences are removed In subsequent steps one of three things can happen e Another pair of sequences is aligned e A sequence is aligned with one of the intermediate alignments e A pair of intermediate alignments is aligned This process is repeated until a single alignment containing all of the sequences remains Feng amp Doolittle were the first to describe progressive pairwise alignment 5 Their algorithm used a guide tree to choose which pair of sequences alignments to align at each step Many variations of the progressive pairwise alignment algorithm exist including the one used in the popular alignment software ClustalX 22 3 6 2 Multiple sequence alignment in Geneious Multiple sequence alignment in Geneious is done using progressive pairwise alignment The neighbor joining method of tree building is used to create the guide tree As progressive pairwise alignment proceeds via a series of pairwise alignments this function in Geneious has all the standard pairwise alignment options In addition Geneious also has the option of refining the multiple sequence alignment once it is done Refining an alignment in volves removing sequences from the alignment one at a time and then realigning the removed 68 CHAPTER 3 ANALYSING DATA sequence to a profile of the remaining sequences The number of times each sequence is re aligned is determined by the refinement iterations option in the multiple alignment window The res
28. this dialog you are given the options of creating a new account on the server or entering the details for an existing account In most cases you will be creating a new account but there may be occasions when you need to re enter the details of an existing account If you choose to create a new account Geneious will attempt to automatically register your account on the server at the end of this process Add New Account Create a new account on the server This account already exists just connect Username Password Confirm password Email address optional C Connect every time I run Geneious 2 More Options Figure 6 1 Add New Account dialog box Choose a username and password now Enter your password twice for a new account You can also optionally add an email address Biomatters will need this if you require support regarding eg reset of password or deletion of accounts If you want this account to connect to the server every time you start Geneious check the checkbox labelled Connect every time I run Geneious More Options You can change some of the defaults for new and exiting accounts e Account Name is the name displayed in the Services Panel for this account It defaults to your username if nothing is entered e Server is the server your account connects to e Jabber Service Name is required by some other Jabber service providers such as Google Talk 6 1 MANAGING YOU
29. value displayed in the document table can be exported to csv file which can be loaded by most spread sheet programs When choosing to export in csv format Geneious will also present a list of the available columns in the table including hidden ones so you can choose which to export Please note this format cannot be imported currently 23 Searching Searching is designed to be as user friendly as possible and the process is the same if you are searching your local documents or a public database such as NCBI To search the selected database or folder click the Search button from the toolbar For non local folders search will be on by default and cannot be closed This applies to NCBI and EMBL databases For local folders search is off by default When search is first activated the document table will be emptied to indicate no results have been found To return to browsing click the Search button again or press the Escape key while the cursor is in the search text field To initiate a search enter the desired search term s in the text field and press enter or click the adjacent Search button Once a search starts the results will appear in the document table as they are found The Search button changes to a Cancel button while a search is in progress 2 3 SEARCHING 27 and this may be clicked at any time to terminate the search Feedback on a search progress is presented in the status bar directly below the toolbar immunodefi
30. writing your summary in a file called summary txt case sensitive and putting it in the tutorial folder Make sure that the entire summary is on the first line of the file as all other lines will be ignored Once you have all your files together put the contents of the folder in a zip file with the exten 81 82 CHAPTER 5 GENEIOUS EDUCATION PRO ONLY sion tutorial zip Be careful not to put subfolders in your zip file as these are not supported 5 2 Answering a tutorial Import the tutorial document into Geneious use File Import From file The tu torial document and any associated geneious documents will be imported into the currently selected folder The tutorial itself will be displayed in the help pane on the right hand side of the Geneious window If you accidentaly close the help pane you can display it by choosing Help from the Help menu If the tutorial requires you to enter answers click the edit button at the top of the tutorial window and type your answer in to the space provided Click the save button when you are done If the tutorial has a link to a Geneious document when you click the link the document will be opened in the document viewer Any changes you make to this document will be preserved when you export the tutorial When you have finished the tutorial export it by selecting the tutorial document and choosing File Export Selected Docuemnts from the main menu Make sure
31. 1 stands for Basic Local Alignment Search Tool It allows you to query the NCBI sequence databases with a sequence in order to find entries in the public database that contain similar sequences When BLAST ing you are able to specify either nucleotide or protein sequences and nucleotide sequences can be either DNA or RNA sequences The result of a BLAST query is a table of hits Each hit refers to a GenBank accession number and the gene or protein name of the sequence Each hit also has a Bit score which provides information about how similar the hit is to the query sequence The bigger the bit score the better the match Finally there is also an E value or Expect value which represents the number of hits with at least this score that you would expect purely by chance given the size of the database and query sequence The lower the E value the more likely that the hit is real Geneious is able to run NCBI BLAST on many different databases Some of these databases are non redundant in order to reduce duplicate hits You can submit either a raw sequence or Genbank accession number into NCBI BLAST and receive a summary of results for each hit This summary contains the bit score e value identity and the stretch of the query sequence and hit sequence that match The databases that can be searched are Table 2 3 Nucleotide sequence searches in the BLAST databases Database Nucleotide searches nr All non redundant GenBank EMBL
32. 1985 no 4 783 791 71 5 DF Feng and RF Doolittle Progressive sequence alignment as a prerequisite to correct phyloge netic trees J Mol Evol 25 1987 no 4 351 60 67 6 O Gotoh An improved algorithm for matching biological sequences J Mol Biol 162 1982 705 708 65 7 M Vingron HA Schmidt K Strimmer and A von Haeseler Tree puzzle maximum likelihood phylogenetic analysis using quartets and parallel computing Bioinformatics 18 2002 no 3 502 504 23 8 M Hasegawa H Kishino and T Yano Dating of the human ape splitting by a molecular clock of mitochondrial dna J Mol Evol 22 1985 no 2 160 174 71 9 S Henikoff and JG Henikoff Amino acid substitution matrices from protein blocks Proc Natl Acad Sci U S A 89 1992 no 22 10915 10919 65 66 10 T Jukes and C Cantor Evolution of protein molecules pp 21 32 Academic Press New York 1969 71 11 S Kumar K Tamura and M Nei Mega3 Integrated software for molecular evolutionary ge netics analysis and sequence alignment Brief Bioinform 5 2004 no 2 150 163 26 12 DR Maddison DL Swofford and WP Maddison Nexus an extensible file format for sys tematic information Syst Biol 46 1997 no 4 590 621 23 26 70 91 92 BIBLIOGRAPHY 13 JV Maizel and RP Lenk Enhanced graphic matrix analysis of nucleic acid and protein se quences Proc Natl Acad Sci U S A 78 1981 no 12 7665 9 63 64 14 C Michene
33. 2 kh source M mRNA 1 O _LL LI o oI 1 Cys peroxiredoxin protein exonn3 4 source 1 MI 660 693 126 59 92 825 858 zn 924 E Show AN Hide AID exon 3 gt i v M Colors 390 1 023 1 056 1 089 1 122 1 155 1 188 1 221 1 254 ot SS OS Gs EG DEDI NGA IDE ID NND D DIG DHL ee dT ee SUN te SS RUN ACCR Hd gt V Zoom Options 1 Cys peroxiredoxin protein CDS v Statistics 147 1596 1 Cys peroxiredoxin protein mR I Cys peroxiredoxin protein mRNA eng 147 E BEEN trequencies p nis sie TEA ra ai te ed rire pal igi risen a ta m A 40 27 2 C 35 23 8 AAA G 38 25 9 T 34 23 1 GC 73 49 7 Selected 147 residues from residue 525 to 671 Figure 3 1 A view of an annotated nucleotide sequence in Geneious 3 1 DOCUMENT VIEWERS IN GENEIOUS 51 Zooming can also be quickly achieved by holding down the zoom modifier key which is the Ctrl key on Windows Linux or the option key on Mac OS and clicking When the zoom key is pressed a magnifying glass mouse cursor will be displayed e Hold the zoom key and left click on the sequence to zoom in e Hold the zoom key and shift key to zoom out e Hold the zoom key and turn the scroll wheel on your mouse if you have one to zoom in and out e Hold the zoom key and click on an annotation to zoom to that annotation Colors The colors option controls the coloring of the sequence nucleotides or amino acids
34. 5 M New Sequence New sequence documents can be made by pasting or typing residues Biomatters 1 3 5 NM Newick importer Provides Newick importing facilities Biomatters 1 3 5 M Nexus importer Provides Nexus importing exporting facilities Biomatters 1 3 5 V Open reading frame plugin ORF analysis tools Biomatters 1 3 5 mM PDF importer Imports PDF documents Biomatters 1 3 5 M PHYLIP exporter Provides Phylip export Biomatters 1 3 5 M PileUp importer Provides PileUp import Biomatters 1 3 5 M RSF importer Provides RSF import Biomatters 1 3 5 M Sequence Alignment Provides pairwise and multiple sequence alignment algorithms Biomatters 1 3 5 M Sequence Concatenation Provides the ability to concatenate sequences or sequence alignments Biomatters 1 3 5 M Text Viewer A viewer which shows a simply text view of any document type Biomatters 1 3 5 M Tree builder Provides methods for building trees from alignments Biomatters 1 3 5 ok gt Cancel C Apply Restore defaults Figure 2 15 The plugins preferences in Geneious 2 9 3 Appearance and Behaviour Here you can change the way Geneious looks and the way it interacts with you Appearance options allow you to change the way the main toolbar and the document table look Behaviour options allow you to change the way newly created documents are handled Such as whether they are selected straight away and where they should be saved to 2 10 PRINTING AND SAVING IMAGES 47 2 10 Printi
35. 5 1997 no 24 4876 4882 22 23 67 68 23 JD Thompson DG Higgins and TJ Gibson Clustal w improving the sensitivity of progres sive multiple sequence alignment through sequence weighting position specific gap penalties and weight matrix choice Nucleic Acids Res 22 1994 no 22 4673 4680 22 23 68
36. 58 residues from residue 21 to 278 o a Nucleotide sequence b Journal Article c Phylogenetic tree Figure 1 3 Three document viewers 1 2 4 The Help Panel The Help Panel includes a tutorial If you are new to Geneious we recommend working through the tutorial first The help panel displays a short description of the currently selected 1 2 USING GENEIOUS FOR THE FIRST TIME 9 service This panel can be closed at any time by clicking the X symbol in the top left hand corner e Help Tutorial Help Previous 7 of 21 Sequence alignment Geneious allows you to align a set of sequences If the sequences you just imported are not selected select them all now Click on the Alignment button in the toolbar Click OK to accept the default alignment settings When the alignment has finished a new alignment document will be added to the current folder and selected This will display the alignment in the sequence viewer If you are using Geneious pro there is also an option to perform the alignment using ClustalW an independent and widely used alignment program Figure 1 4 The Help Panel 1 2 5 The Toolbar The toolbar gives quick access to commonly used features in Geneious including Search Agents Note Sort and Alignment For more information on the toolbar see section 21 5 1 2 6 The Menu Bar The Menu Bar has five main menus File Edit Tools Collaboratio
37. A Enter Contact ID Figure 6 4 Add New Contact dialog box in searching mode Your new contact will appear immediately in your contact list however you will not be able to tell whether your new contact is online until they accept you as a contact Similarly you will occasionally see a dialog box pop up asking you Allow user name talk geneious com as contact This is another Geneious user attempting to add you as a contact in this manner 6 2 2 Rename Contact You can rename a contact in your contact list by right clicking Ctrl click on MacOS the contact in the Services panel and selecting Rename Contact then entering a new name for the contact Rename Contact 9 Mame For blah blah Geneious User Figure 6 5 Rename Contact dialog box This only changes the name displayed for the contact in the Services Panel 88 CHAPTER 6 COLLABORATION PRO ONLY 6 2 3 Remove Contact If you no longer wish to share documents with a contact you can remove that contact by right clicking Ctrl click on MacOS the contact in the Services panel and selecting Remove Contact This deletes you from their contact list as well If you find that a contact has disappeared from your list this may be the reason 6 3 Sharing Documents Select one of your local folders Select Share Folder from the File menu Alternatively right click Ctrl click on MacOS on a local folder and select the same option e If you share a folder all docu
38. CE ALIGNMENTS 67 the presence of ancestral relationships between the sequences It should be noted that protein sequences that are structurally very similar can be evolutionarily distant This is referred to as distant homology While handling protein sequences it is important to be able to tell what a multiple sequence alignment means both structurally and evolutionarily It is not always possible to clearly identify structurally or evolutionarily homologous positions and create a single correct multiple sequence alignment 5 Multiple sequence alignments can be done by hand but this requires expert knowledge of molecular sequence evolution and experience in the field Hence the need for automatic mul tiple sequence alignments based on objective criteria One way to score such an alignment would be to use a probabilistic model of sequence evolution and select the alignment that is most probable given the model of evolution While this is an attractive option there are no efficient algorithms for doing this currently available However a number of useful heuristic algorithms for multiple sequence alignment do exist 3 6 1 Progressive pairwise alignment methods The most popular and time efficient method of multiple sequence alignment is progressive pairwise alignment The idea is very simple At each step a pairwise alignment is performed In the first step two sequences are selected and aligned The pairwise alignment is added to the mix and
39. Deliver To Local 32 iZ O New Folder Search every 1 Days Hd f Only get documents created after today Cancel Create CreateandRun Figure 2 9 The Create Agent Dialog 2 6 AGENTS 39 2 6 2 Checking agents Once you have created one or more agents Geneious allows you to quickly view their status in the agents window which is accessible from the toolbar Your agents details are presented in several columns Enable Action Status and Deliver To Enable This column contains a check box showing whether the agent is enabled Action This summarizes the user defined search criteria It contains 1 Details of the database accessed For example Nucleotide and Genome under NCBI 2 The search type the Agent performed i e keyword 3 The words the user entered in the search field for the Agent to match against Status This indicates what the Agent is currently doing The status will be one of the following e Next search in x time eg 18 hours The agent is waiting until its next scheduled search and it will search when this time is reached e Searching These are shown in bold The agent is currently searching e Disabled The agent will not perform any searches e Service unavailable The agent cannot find the database it is scheduled to search This will happen if the database plugin has been uninstalled or if for example the Collabora tion contact is offline currently e No search schedu
40. Geneious 2 5 4 Biomatters Ltd February 26 2007 Contents 1 Getting Started 5 1 1 Downloading amp Installing Genelotis 26 24 24 4S go R x R 8 5 1 2 Using Geneious forthe first ime ens cua xS xe a BS 7 e eec ee Hebe i E e Coe oe AR e ee i 10 2 Retrieving and Storing data 15 2l Thenon SUN a AAA e Sel 15 22 importing ond exporting data 2 2445 e o nb a eed eR aw E eS 20 29 ARCA AE TT 26 Z4 DoDBEGabDEEDS ba Sot POH ok Re o AAA d Ha ed 29 25 Storing data Your Local Documents s s 24 2444 85 6 eww R d 99 L5 ROGER sodio BS EROR AAA A BERS edo ded 37 27 Pilterng and Similarity sorting eae orn KR ROS d X N 40 2 rre 40 29 PEREDES ooh sce Qe TE Irae dene ga 44 EIU Dunt and Saving RES uuo uses oe weh ae Eu Rx Rm x oko 47 3 Analysing Data 49 3 1 Document Viewers in Geneious cu 24 dr RR Rex ee ee A Ox 9e ws 49 S EEIE ak eh ee Sat Soe BS ee SO oS eh So he ok Bog 62 des SO QUIEN ux sx es BRS CHEESES BP a BES a e 63 Pairwise sequence alignments Multiple sequence alignments Sequence alignment using ClustalW pro only Building Phylogenetic trees PCR Primers pro only 3 10 Results of analysis Smart Folders pro only Geneious Education pro only 5 1 Creating a tutorial 5 2 Answering a tutorial Collaboration pro only CONTENTS Chapter 1 Getting Started By the end if this chapter you should e Be able to download install and upgrade Geneious e Be able to import Notes and Documents from earlie
41. Protein searches env nr Translations of sequences in env nt month All new updated GenBank coding region CDS translations PDB SwissProt PIR released in last 30 days nr All non redundant GenBank coding region CDS translations PDB SwissProt PIR PRE pat Protein sequences derived from the Patent division of GenBank PDB Sequences derived from 3D structure Brookhaven PDB RefSeq RefSeq protein sequences from NCBI s Reference Sequence Project SwissProt Curated protein sequences information from EMBL e blastx Compares a nucleotide query sequence translated in all reading frames against a protein sequence database You could use this option to find potential translation prod ucts of an unknown nucleotide sequence e tblastn Compares a protein query sequence against a nucleotide sequence database dy namically translated in all reading frames e tblastx Compares the six frame translations of a nucleotide query sequence against the six frame translations of a nucleotide sequence database Please note that the tblastx program cannot be used with the nr database on the BLAST Web page because it is too computationally intensive Geneious also allows you to specify most of the advanced options that are available in BLAST To access the advanced options click the More Options button which is by the Search but ton in all NCBI BLAST services Geneious will now display a large text box labelled Search For in which you can enter your quer
42. R ACCOUNTS 85 e Port Number for Jabber servers running on a non standard port Add New Account Create a new account on the server C This account already exists just connect Username Password Confirm password Email address optional C Connect every time I run Geneious Account Name Server talk geneious com Jabber Service Name Port Number 5222 A Fewer Options Figure 6 2 Add New Account dialog box with More Options 6 1 2 Edit Account Details Select your account in the Services Panel and this option from the menu Or right click Ctrl click on MacOS on your account and select it from the popup menu This option is only available when your account is not connected This dialog has nearly all the same fields as the Add New Account dialog Remember that if you change details such as your username or password you may not be able to connect to your account 6 1 3 Connect Disconnect Connect to or disconnect from the server by right clicking Ctrl click on MacOS on your account or selecting your account in the Services Panel and choosing that option from the menu 86 CHAPTER 6 COLLABORATION PRO ONLY at the top 6 1 4 Delete Account You can delete an account from Geneious by right clicking Ctrl click on MacOS on your account or selecting your account in the Services Panel an choosing that option from the menu at the top This does not de
43. Structure Nucleotide sequence and PDF see Figure 2 5 Match all HJ of the following Document type HJ is m 3D structure m d M Include Subfolders Fewer Options 24 Name Summary FR ai Alpha Helix Alpha Helix ai Cyclopropane Cyclopropane ai VirusVIRAL PROTEIN VirusVIRAL PROTEIN 1G5G Figure 2 5 Document type search options And Or searches The advanced options lets you search using multiple criteria By clicking the button on right of the search term you can add another search criteria You can remove search criteria by 2 4 PUBLIC DATABASES 29 clicking on the appropriate button The Match all any of the following option at the top of the search terms determines how these criteria are combined Match Any requires a match of one or more of your search criteria This is a broad search and results in more matches Match All requires a match all of your search criteria This is a narrow search and results in fewer matches Match all HJ of the following Author HH contains Hj Drummond AJ d Date published EA isbetween RoLjanzoo03 and F 31 Dec 2005 Create Agent Search Fewer Options Name Summary qu Choosing appropriate substitu Choosing appropriate substitution models for the phylogenetic analysis of protein coding np Beth Shapiro Andrew Rambaut Alexei J Drummond 2005 Mol Biol Evol 23 7 9 Tree measure
44. The tree viewer provides a graphical view of a phylogenetic tree Figure 3 6 When viewing a tree a number of other view tabs may be available depending on the information at hand 3 1 DOCUMENT VIEWERS IN GENEIOUS 59 f 3 D structure Sequence View Text View Options M Show atoms Size 50 v Show bonds v Show ribbons v Show hydrogens L Spin Atom symbols Hold down the mouse button and drag to rotate Hold down the shift key and mouse button and drag to zoom Figure 3 5 A view of a 3D protein structure in Geneious The Sequence View tab will be visible if the tree was built from a sequence alignment using Geneious The Text View shows the tree in text format Newick The Notes tab will be present if you have added notes to the tree document There are a number of options for the tree viewer General General has 3 buttons showing the different possible tree views rooted circular and un rooted The Zoom slider controls the zoom level of the tree while the Expansion slider expands the tree vertically Layout This has two options root length and curvature Both Root Length and Curvature can be increased by dragging the bar towards the right Root length is not available for unrooted trees Checking the Align taxon labels box will align the taxa labels to provide easier viewer for large trees Formatting There are a range of formatting options 60 CHAPTER 3 ANALYSING DATA
45. VWKALTCSD seq4 SORHYKD DGGNYFOLREDWWTANRHTVWEAITCSA seq5 NVAALKTRYEK DGONFYOLREDWWTANRATIWEAITCSA seq6 FSKNIX OIEELODEWLLEARYKD TDNYYELREHWWTENRHTVWEALTCEA seq7 KELWEALTCSR seql GGGKYFRNTCDG GQNPTETONNCRCIG ATVPTYFDYVPOYLRWSDE seq2 P GDASYFHATCDSGDGRGGAQAPHKCRCDG ANVVPTYFDYVPOFLRWPEE seq3 KLSNASYFRATC SDGQSGAQANNYCRCNGDKPDDDKP NTDPPTYFDYVPQYLRWSEE seq4 DKGNA YFRRTCNSADGKSQSQOARNQCRC KDENGKN ADQVPTYFDYVPOYLRWSEE seq5 DKGNA YFRATCNSADGKSQSQARNQCRC KDENGXN ADQVPTYFDYVPOYLRWSEE seq6 P GNAQYFRNACS EGKTATKGKCRCISGDP PTYFDYVPOYLRWSEE seg P KGANYFVYKLD RPKFSSDRCGHNYNGDP LTNLDYVPOYLRWSDE EndNote 8 0 XML format EndNote is a popular reference and bibliography manager EndNote lets you search for journal articles online import citations perform searches on your own notes and insert references into documents It also generates a bibliography in different styles Geneious can interoperate with EndNote using Endnote s XML Extensible Markup Language file format to export and import its files FASTA format The FASTA file format is commonly used by many programs and tools including BLAST 1 T Coffee 16 and ClustalX 22 Each sequence in a FASTA file has a header line beginning with a gt followed by a number of lines containing the raw protein or DNA sequence data The sequence data may span multiple lines and these sequence may contain gap characters An empty line may or may not sep
46. and then click on the column which you want to edit Enter the appropriate new information and press enter Certain columns cannot be edited however eg the NCBI accession number Copying Column values can be copied This is a quick method of extracting searchable infor mation such as an accession number To copy a value right click Ctrl click on MacOS on it and choose the Copy name option where name is the column name Sorting All columns can be alphabetically numerically or chronologically sorted depending on the data type To sort by a given column click on its header If you have different types of documents in the same folder click on the Icon column to sort then according to their type Rearranging You can reorder the columns to suit Click on the column header and drag it to the desired horizontal position Choose columns to show The visible columns can be changed by right clicking Ctrl Click on MacOS on any column header or click the small header button in the top right corner of the table This gives a popup menu with a list of all the available columns Clicking on a column will show hide it Your preference is remembered so if you hide a column it will remain hidden in all areas of the program until you show it again Note If a Note is added to a document refer to the section on Adding Notes for more in formation a Note column is added to the end of the existing Document table Also when accessing BLAST 1 in G
47. arate consecutive sequences Here is an example of three sequences in FASTA format DNA Protein Aligned DNA 2 2 IMPORTING AND EXPORTING DATA 23 gt Orangutan ATGGCTTGTGGTCTGGTCGCCAGCAACCTGAATCTCAAACCTGGAGAGTGCCTTCGAGTG gt gi 532319 pir TVFV2E TVFV2E envelope protein ELRLRYCAPAGFALLKCNDADYDGFKTNCSNVSVVHCTNLMNTTVTTGLLLNGSYSENRT OIWOK Chicken CTACCCCCCTAAAACACTTTGAAGCCTGATCCTCACTA CTGT CATCTTAA Geneious format The Geneious format can be used to store all your local documents note types and program preferences A file in Geneious format will usually have a geneious extension or a xml extension This format is useful for sharing documents with other Geneious users and backing up your Geneious data Newick format The Newick format is commonly used to represent phylogenetic trees such as those inferred from multiple sequence alignments Newick trees use pairs of parentheses to group related taxa separated by a comma Some trees include numbers branch lengths that indicate the distance on the evolutionary tree from that taxa to its most recent ancestor If these branch lengths are present they are prefixed with a colon The Newick format is produced by pro grams such as PHYLIP PAUP ClustalW 23 ClustalX 22 Tree Puzzle 7 and PROTML Geneious is also able to read trees in Newick format and display them in the visualization win dow It also gives you a number of display options including tree typ
48. arranges the docu ments in the opposite order An arrow next to the heading indicates if it is in increasing or decreasing V order Viewing Notes Notes are added into columns defined by their Types and Fields Notes are also shown as an extra tab in the Document Viewer Panel Click on the Notes tab to view your notes Notes can be edited or deleted from here too 2 13 Sequence View Text View Source XML Notes xs Source Source Hame NCBI Protein Date Acquired 01 Aug 2006 Modified from original false Figure 2 13 Viewing Notes Editing and Deleting Notes and Note Types Notes can be edited or deleted from the Documents Viewer Panel and from the Add Note screen Click on the Edit Note Type or Delete Note Type button 2 14 2 9 Preferences You can access the preferences screen in two ways 1 Shortcut keys Ctrl Shift P Windows Linux Command Shift P Mac OS X 2 Select the Tools Menu and click Preferences There are several sections in the preferences window which are presented as tabs The most important of these are described below 2 9 PREFERENCES 45 Existing Note Types Note Person gt Create Note Type Delete Note Type Edit Note Type Figure 2 14 Viewing Notes 2 9 1 General This contains connection settings data storage details for your Local documents Automatic new version checking and Search History Check for new version of Geneious E
49. aste or type in It also contains options to create rename delete or share folders and Import Export options Edit Menu Here you will find common editing functions including Cut Copy Paste Delete and Select All These are useful when transferring information from within doc uments to other locations or exporting them This menu also contains Find in Document Find Next and Find Previous options Find can be used to find text or numbers in a se lected document This is useful when looking for annotated regions or a stretch of bases in a sequence This opens a Find Dialog The shortcut to this is Ctrl F Next finds the next match for the text specified in the Find dialog The shortcut keys are F3 or Ctrl G Geneious then allows you to choose another document and continue searching for the same search word Prev finds the previous match The shortcut keys for this are Ctrl Shift G or Shift F3 Tools Menu This contains a list of features such as Note Alignment Tree Viewers and Preferences Click on Viewers to get a list of all the viewers available in Geneious Collaboration Menu pro only This contains actions that can be performed with Collaboration accounts which allow you to share you work with other Geneious users Help Menu This consists of the standard Help options offered by Geneious 22 Importing and exporting data Geneious is able to import raw data from different applications and export the
50. ate it If you would like to change the zoom level of the molecule just hold the shift key and the mouse button and then drag up and down to zoom out and zoom in Another way is to use the scroll button on your mouse Scrolling down enlarges the image and scrolling 58 CHAPTER 3 ANALYSING DATA Chicken 1737 nt Y Zoom Level o 1000 sj 19 x EJIRE wN y Preferences Stringency 18 y b Window size 21 1000 k Australobius 1815 nt E d Fs Figure 3 4 A view of dotplot of two sequences in Geneious upwards shrinks it Under Windows and Linux you can also move the molecule parallel to the viewing plane while pressing Ctrl Alt left mouse button This is currently not possible under Mac OS There are a several viewing options available for 3D structures Show atoms This shows the atoms on the molecule You can use the Size option to increase or decrease the size of the atoms Show bonds This shows the bonds between the atoms of the molecule Show ribbons This option displays the secondary structure of the molecule in the form of rib bons This can be viewed only if the necessary information is available Show hydrogens This option displays the hydrogens attached to the molecule They are shown in white Spin This allows the molecule to spin on a vertical axis for viewing Atom symbols This displays single letter codes for the atoms adjacent to each atom 3 1 4 Tree viewer
51. based on deleting a portion of the original observations for each pseudo replicate A 50 jackknife randomly deletes half of the columns from the alignment to create each pseudo replicate 3 8 6 Tree building in Geneious Geneious can build a phylogenetic tree for a set of sequences using pairwise genetic distances To build a tree select an alignment or a set of related sequences all DNA or all protein in the Document table and click the Build Tree icon or choose this option from the Tools menu Tree from non aligned sequences x Tree Alignment Options Cost matrix 51 similarity 5 0 3 0 yl Gap open penalty Ce Gap extension penalty MO Alignment type Global alignment ex Tree builder options Genetic Distance Model Duescato x Tree build Method Jupama v Outgroup Cancel Figure 3 11 Tree building options in Geneious If you are building a tree from an alignment the following options are seen in the tree window Genetic distance model This lets the user choose the kind of substitution model used to estimate branch lengths If you are building a tree from DNA sequences you have the choices Jukes Cantor HKY and Tamura Nei If you are building a tree from amino acid sequences you only have the option of Jukes Cantor distance correction Tree building method There are two methods under this option Neighbor joining 18 and UPGMA 14 Consensus method via resampling Chec
52. be filtered using the Filter box located at the right side of the toolbar For more information see section 2 1 2 1 2 3 The Document Viewer Panel The Document Viewer Panel is where sequences alignments trees and journal article abstracts can be shown graphically or as plain text This panel also offers various options while visu alizing protein and nucleotide sequences These options include zooming color and layout selection and annotations When viewing trees there are additional options for branch and leaf labeling and controlling tree layout When viewing journal articles this panel includes a direct link to Google Scholar All these options are displayed on the right hand side of the panel Figure 1 3 For more information see section 2 1 3 BibTex save a 7 Estimating mutation parameters population history and genealogy simultaneously PON O EGCATCGGATC GACTAGCTAG CTAGCTAGCI from temporally spaced sequence data ACGATCGATC GATCGATCGA TCGACTGACT GACTAGCTAC GATCGATCGA Alex Drummond Geoff K Nicholis Alen G Rodrigo amp Wiremu Solomon EGEGEEXGE ACATCCATCO RTCCATCCAT CCATCCACT ACTOACTOAGI Jrivesity of Auckland 1001 Auckland New Zeeland rack 0122 gt M Show tice soquoncos gathore at diferent fly discuss the methodological states of prior knowledge We imation of substitution model a goncalogy of KIV envelope PMID 12136032 Selected 2
53. ciency Search More Options Name Summary FR Ed NC_001802 Human immunodeficiency virus 1 complete genome iat NC_004455 Simian immunodeficiency virus 2 complete genome Ed NC_001549 Simian immunodeficiency virus complete genome Ed NC_003074 Arabidopsis thaliana chromosome 3 complete sequence Ed NC_002305 Salmonella typhi plasmid R27 complete sequence A NC 001870 Simian Human immunodeficiency virus complete genome Ed NC_001722 Human immunodeficiency virus 2 complete genome Ed NC_001664 Human herpesvirus 6 complete genome 1 Figure 2 4 The Search tab of the Document Table 2 3 1 Advanced Search options To access advanced search click the More Options button inside the basic search panel To return to basic search click the Fewer Options button Switching between advanced and basic will not clear the search results table This feature provides more search options to select from Geneious allows you to search with a range of criteria however these depend on the database being searched All the fields in the NCBI public databases can be searched in any combination Each database has a specific list of fields and it is important to familiarize yourself with these fields to make full use of the Advanced Search The fields available for a search can be found in the left most drop down box after enabling the advanced search options Note When searching the Genome database the documents returned are only summaries To downl
54. consensus se quences does not contains any gaps Majority consensus When this box is checked only the major character is displayed in the con sensus and ambiguity codes only arise in the case of ties However when this box is unchecked the relative frequencies of the residues is ignored i e a Strict consensus is performed so that if there is any polymorphism it is recognized For example a site containing 4 G s and one A would have an R as the consensus This generally implies ignoring gaps since every position normally has at least one non gap residue Exclude variable sites When this box is checked any variable site is displayed as a gap in the consensus In other words only sites with just one base and in number at least the number of 54 CHAPTER 3 ANALYSING DATA gaps are included in the consensus Turning on No gaps in consensus as well would show only invariant sites i e sites with only one base ignoring any gaps Note that the Exclude setting takes precedence over No gaps i e when both are on invariant sites are still marked as a gap Show only different residues When this option is on any residues in the alignment that have the FF FF same residue as the consensus will be displayed as a instead Color only different residues When this option is on any residues in the alignment that have the same residue as the consensus will be displayed in black rather than the color they would otherwise use according to the c
55. ct connection Use this setting when no proxy settings are required e Use browser connection settings This allows Geneious to automatically import the proxy settings This may not work with all web browsers e Use HTTP proxy server This enables two text fields Proxy host and Proxy port This information is in your browser s connection settings Use this if your proxy server is an HTTP proxy server Please see step 3 e Use SOCKS proxy server Autodetect Type This enables two text fields Proxy host and Proxy port This information is in your browser s connection settings Use this if your proxy server is a SOCKS proxy server Please see step 3 e Use auto config file This enables one text field called Config file location These details can also be found in your browser s settings 6 Setthe proxy host and port settings under the General tab to match those in your browser 12 CHAPTER 1 GETIING STARTED 7 If your proxy server requires a username and password you can specify these by clicking the Proxy Password button directly below Note If you are using any other browser and cannot find the proxy settings please email us at support geneious com e Tools Preferences Tools Internet Advanced Options Options Proxies Connection Connections Settings Connection Lan Settings Settings Figure 1 6 Checking browser settings 1 3 3 Web links inside Geneious don t work under Linux Set your BROWSER enviro
56. e 3 1 6 The PDF document viewer This viewer is under development Currently to view a pdf document either double click on the document in the Documents Table or click on the View Document button This opens the document in an external PDF viewer such as Adobe Acrobat Reader or Preview Mac OS 62 CHAPTER 3 ANALYSING DATA X On Linux you can set an environmental variable named PDFViewer to the name of your external PDF viewer The default viewers on Linux are kpdf and evince 3 1 7 The Journal Article Viewer This viewer provides two tabs Text View and BibTex Text view displays the journal article details including the abstract The text contains a link to the original article through Google Scholar below the title and authors Figure 3 8 BibTex is the standard IATEX bibliog raphy reference and publication management data format IATEX is a common program used to create formatted documents including this one The information in the BibTex screen can be exported for use in TEX documents A virus reveals population structure and recent demographic history of its carnivore host Roman Biek Alexei J Drummond amp Mary Poss Wildlife Biology Program University of Montana Missoula MT 59812 USA rbiek emory edu Science 2006 311 538 41 Google Scholar Directly transmitted parasites often provide substantial information about the temporal and spatial characteristics of host to host contact Here we dem
57. ecause the relationship between the two primers will be taken into account when choosing the best ones If any documents were selected which either are primer sequences or contain primer annota tions then these will be made available for selection in a drop down box next to each of the check boxes in Other and Test only Selected sequences are treated as primer or probe se quences if they are 36bp in length or less Selecting one of these in a drop box will mean the selected existing primer will be used instead of designing one This can be used if for example you already have a forward primer and you wish to design a reverse primer to match it Select Design in the drop box if you wish to design instead of use an existing primer the default Options The options sections allows you to specify what part of a sequence you wish to amplify All of these are expressed in base pairs from the beginning of the sequence and are as follows e Included Region From To Specifies the region of the sequence which primers are al lowed to fall within This must surround the target region and allows you to choose a small region on either side of the target which primers must be in e Product Region From To Specifies which region of the sequence you wish to amplify and unless the advanced options allow otherwise the left and reverse primers must fall somewhere outside this region e Product Size From To Specifies the range of sizes which the product
58. eneious the Document Table has additional columns related to the BLAST search 2 1 THE MAIN WINDOW 19 2 13 The Document Viewer Panel The Document Viewer Panel shows the contents of any document clicked on in the Document Table To view large documents it is sometimes better to double click on them This opens a view in a new window In the document viewer panel there are two tabs that are common to most types of documents Text view and Notes Text view shows the document s information in text format The exception to this rule occurs with PDF documents where the user needs to either click the View Document button or double click to view it The Notes tab only appears if the user has added notes to the document Some document types such as sequences trees and structures have an options panel occupying the right of the document viewer The options in the options panel have an arrow which can be used to expand or hide a group of related options See the next section on document viewers for more information about operating the various viewers in Geneious 2 1 4 The Help Panel The Help Panel has a Help tab and a Tutorial tab The Help tab provides you information about the service you are currently using The Tutorial is aimed at first time users of Geneious and has been included to provide a feel for how Geneious works It is highly recommended that you work through the tutorial if you haven t used Geneious before 2 1
59. er 3 00 Pair Tm Diff 0 1 Figure 3 13 Primer design output e For each of these specific reasons for rejection are listed eg Tm too high or Unaccept able product size along with a percentage which expresses how many of the candidate primers or probes were rejected for this reason After examining the details you can choose take no action or continue and annotate the primer and or DNA probes on the sequences which were successfully designed for 3 9 2 Testing Primers Primers and probes can also be quickly tested against large numbers of sequences to see which ones the primers will bind to Currently this will only find sequences that match the primers exactly To test primers firstly select the primer or DNA probe sequences you wish to test or sequences which contain the desired primers as annotations Then by holding down shift or ctrl command on MacOS also select all of the sequences to test the primers against and choose the same Primers action from the menu Select the Test Primers option at the very top of the Primer Design dialog Now select which of Forward primer reverse primer and DNA probe you wish to test and choose the desired primer or probe from the drop down box next to each of these All of the options also apply to testing so if the primers are expected to bind to quite different regions of the test sequences the primer binding region may have to be extended and the target region option can be omitted Clic
60. es branch lengths and labels Nexus format The Nexus format 12 was designed to standardize the exchange of phylogenetic data in cluding sequences trees distance matrices and so on The format is composed of a number of blocks such as TAXA TREES and CHARACTERS Each block contains pre defined fields Geneious imports and exports files in Nexus format and can process the information stored in them for analysis 24 CHAPTER 2 RETRIEVING AND STORING DATA MEGA format The MEGA format is used by MEGA Molecular Evolutionary Genetics Analysis PileUp format The PileUp format is used by the pileup program a part of the Genetics Computer Group GCG Wisconsin Package Rich Sequence format RSF Rich Sequence Format files contain one or more sequences that may or may not be related In addition to the sequence data each sequence can be annotated with descriptive sequence information DNA Strider Sequence files generated by the Mac program DNA Strider containing one Nucleotide or Pro tein sequence PDB format Protein Databank files contain a list of XYZ co ordinates that describe the position of atoms in a protein These are then used to generate a 3D model which is usually viewed with Rasmol or SPDB viewer Geneious can read PDB format files and display an interactive 3D view of the protein structure including support for displaying the protein s secondary structure when the appropriate information is available PDF
61. es that are grouped under them Tips or leaves of a tree represent the sequences used to construct the tree Taxonomic units These can be species genes or individuals associated with the tips of the tree A phylogenetic tree can be rooted or unrooted A rooted tree consists of a root or the common ancestor for all the taxonomic units of the tree An unrooted tree is one that does not show the position of the root An unrooted tree can be rooted by adding an outgroup a species that is distantly related to all the taxonomic units in the tree A common format for representing phylogenetic trees is the Newick format 12 3 8 2 Neighbor joining In this method neighbors are defined as a pair of leaves with one node connecting them The principle of this method is to find pairs of leaves that minimize the total branch length at each stage of clustering starting with a star like tree The branch lengths and an unrooted tree topology can quickly be obtained by using this method without assuming a molecular clock 18 3 8 3 UPGMA This clustering method is based on the assumption of a molecular clock 14 It is appropriate only for a quick and dirty analysis when a rooted tree is needed and the rate of evolution is does not vary much across the branches of the tree 3 84 Distance models or molecular evolution models for DNA sequences The evolutionary distance between two DNA sequences can be determined under the assump tion of a particular
62. format PDF stands for Portable Document Format and is developed and distributed by Adobe Systems http www adobe com It contains the entire description of a document including text fonts graphics colors links and images The advantage of PDF files is that they look the same regardless of the software used to create them Some word processors are able to export a document into PDF format Alternatively Adobe Writer can be used Currently you can use Geneious to read store and open PDF files and future versions will have more options for storing and manipulating PDF 2 2 IMPORTING AND EXPORTING DATA 25 Sequence Chromatograms Sequence chromatogram documents contain the results of a sequencing run the trace and a guess at the sequence data base calling Informally the trace is a graph showing the concentration of each nucleotide against sequence positions Base calling software detects peaks in the four traces and assigns the most probable base at more or less even intervals Ace files Ace is the format used by the Phrap Consed package created by the University of Washington Genome Center This package is used mainly to assemble sequences DNAStar files DNAStar seq and pro files are used in Lasergene a sequence analysis tool produced by DNAStar GenBank files Records retrieved from the NCBI webiste http www ncbi nlm nih gov can be saved in a number of formats Records saved in GenBank or INSDSeq XML formats
63. g system has been chosen we need an algorithm to find the optimal alignment of two sequences This is done by inserting gaps in order to maximize the alignment score If the sequences are related along their entire sequence a global alignment is appropriate However if the relatedness of the sequences is unknown or they are expected to share only small regions of similarity such as a common domain then a local alignment is more appropriate An efficient algorithm for global alignment was described by Needleman and Wunsch 15 and their algorithms was later extended by Gotoh to model gaps more accurately 6 For local alignments the Smith Waterman algorithm 19 is the most commonly used See the references provided for further information on these algorithms 3 5 3 Pairwise alignment in Geneious A dotplot is a comparison of two sequences A pairwise alignment is another such comparison with the aim of identifying which regions of two sequences are related by common ancestry and which regions of the sequences have been subjected to insertions deletions and substitu tions 66 CHAPTER 3 ANALYSING DATA Alignment options The options available for the alignment cost matrix will depend on the kind of sequence e Protein sequences have a choice of PAM 2 and BLOSUM 9 matrices e Nucleotide sequences have choices for a pair of match mismatch costs Some scores distinguish between two types of mismatches transition and transversion Tra
64. ic trees PHYLIP Tree Puzzle PAUP ClustalX Nexus nxs nex Trees Alignments PAUP Mesquite MrBayes and MacClade MEGA meg Alignments MEGA PileUp msf Alignments pileup gcg Rich Sequence Format rsf Sequences alignments GCGs NetFetch DNA Strider str sequence DNA Strider Mac program ApE PDB pdb 3D Protein structures SP3 SP2 SPARKS Protein Data Bank PDF pdf Documents presentations Adobe Writer KTpX Miktex Raw sequence text seq Sequences Any file that contains only a sequence Sequence Chromatograms abl scf Raw Sequencing trace and sequence Sequencing machines Ace ace Contig assemblies Phrap Consed GenBank gb xml Nucleotide and protein sequences GenBank DNAStar seq pro Nucleotide and protein sequences DNAStar 22 CHAPTER 2 RETRIEVING AND STORING DATA CLUSTAL format The Clustal format is used by ClustalW 23 and ClustalX 22 two well known multiple se quence alignment programs Clustal format files are used to store multiple sequence alignments and contain the word clustal at the beginning An example Clustal file CLUSTAL W 1 74 multiple sequence alignment seql KSKERYKDENGGNYFOLREDWWDANRETVWKAITCNA seg2 YEGLTTANGXKEYYODKNGGNFFKLREDWWTANRETVWKAITCGA seq3 KRIYKKIFKEIHSGLSTKNGVKDRYON DGDNYFOLREDWWTANRST
65. ications or other types If you cannot see all of the columns in the document table you may want to close the help panel to make more room This information is presented in table form Figure 2 2 Name Summary Xidentical Journal Title First Author PMID Sequence Resi URL Fa VI A virus reveals population str poe TEYENIS POPUTSCIOTE SUTUCTUTE STRT TECETTE eNO Tam Science Roman Biek 16439664 http w history of its carnivore host 2 Population genetic estimation Population ganct Osumio of the loss of genetic diversity BMC Evol Biol Charles T T 16556318 http ww during horizontal transmission of HIV 1 Relaxed phylogenetics and dating with confidence 2 Relaxed phylogenetics and da Alexei J Drummond Simon Y W Ho Matthew J Phillips amp Andrew PLoS Biol Alexei J Dru 16683862 http bic MA modified cc cd11 M13F COS modified cc_cd11_M13F_C05_022 ab1 Length 597 GCTCACGA DA cc_cd11_M13F_C05_022 ab1 cc_cd11_M13F_C05_022 ab1 Length 597 gctsacgatgc MWA modified cc_cd12_M13F_DOS modified cc_cd12_M13F_D05_021 ab1 Length 618 GCTSCGATG A Nucleotide alignment 6 Alignment of 2 sequences cc cd11 M13F C05 022 abl 82 86 E E New nucleotide sequence New nucleotied sequence A new nucleotide sequence entered ACGATCAC K 1996YangGeneticsv144p194 1996YangGeneticsv144p1941 pdf E tree txt tree txt 4 tips B m fe tree3 tx
66. icon which appears in the status bar When this icon is present items cannot be added or removed from the table and they cannot be modified in any way To modify an item you must first move it to your local folders 2 4 1 NCBI Entrez and EMBL databases NCBI was established in 1988 as a public resource for information on molecular biology Geneious allows you to directly download information from seven important NCBI databases NCBI BLAST services and one EMBL database Table 2 1 Table 2 1 NCBI and EMBL databases accessible via Geneious Database Center Coverage Genome NCBI Whole genome sequences Nucleotide NCBI DNA sequences PopSet NCBI sets of DNA sequences from population studies Protein NCBI Protein sequences Structure NCBI 3D structural data PubMed NCBI Biomedical literature citations and abstracts Taxonomy NCBI Names and taxonomy of organisms Uniprot EMBL Protein sequences This section gives a brief explanation of the key databases that can be searched using Geneious There is also a bewildering array of acronyms If you are not already a bioinformatician please check them before you continue The Entrez Genome database This provides views of a variety of genomes complete chromo somes sequence maps with contigs contiguous sequences and integrated genetic and physi cal maps The Entrez Nucleotide database This database in GenBank contains 3 separate components that are also searchable databases EST
67. in the saved image 48 CHAPTER 2 RETRIEVING AND STORING DATA Chapter 3 Analysing Data By the end of this chapter you should be able to e Know about the main document viewers in Geneious e Understand the basic principles of bioinformatics e Perform simple bioinformatics analyses with Geneious 3 1 Document Viewers in Geneious 3 11 The Sequence and alignment Viewer The Sequence view tab in the Document Viewer panel is available for Nucleotide sequences Protein sequences Alignments and some 3D structure documents The options available are grouped under headings Zoom level Annotations Colors Layout Zoom options and Statistics The presence of these options varies with the kind of sequence data being viewed Zoom level The plus and minus buttons increase and decrease the magnification of the sequence by 50 or by 30 if the magnification is already above 50 P zooms to 100 The 100 zoom level allows for comfortable reading of the sequence na 4 zooms out so as to fit the entire sequence in the available viewing area 49 CHAPTER 3 ANALYSING DATA e Extract E Reverse Complement E Translate PA Edit b Annotation ul Save a 1 33 66 99 132 165 198 231 264 29 i i 1 l l i a Y Zoom Level Y Ee em 1 Cys peroxiredox E emis C Compress annotations 330 363 396 429 462 495 528 561 594 27 CI ALEA 3000 X Go TG 0 05 00 m cromo EEENHIHNSGSURINSSIGCION NOCH NOE V exon
68. individual options in the Primer Picking Weights dialog again hover your mouse over the option to see a short description 3 9 4 More Information The Primer feature in Geneious is based on the program Primer3 http frodo wi mit edu cgi bin primer3 primer3 www cgi Copyright c 1996 1997 1998 1999 2000 2001 2004 Whitehead Institute for Biomedical Research All rights reserved If you use the primer design feature of Geneious for publication we request that you cite primers as Steve Rozen and Helen J Skaletsky 2000 Primer3 on the WWW for general users and for biologist programmers In Krawetz S Misener S eds Bioinformatics Methods and Protocols Methods in Molecular Biology Humana Press Totowa NJ pp 365 386 Source code available at http fokker wi mit edu primer3 Further information on the functionality of Geneious primer design feature can be found in the primer3 documentation available here http frodo wi mit edu cgi bin primer3 primer3 www help cgi Please note that some controls have been changed renamed or re moved from Geneious but most of the primer3 functionality is available 3 10 Results of analysis All analysis results are deposited in the currently selected folder If no local folder is selected then you will be prompted for a local folder This applies to sequence alignments phylo genetic trees sequence translations reverse complements and extraction of sequences Once generated analysis results
69. ise alignments In order to align a pair of sequences a scoring system is required to score matches and mis matches The scoring system can be as simple as 1 for a match and 1 for a mismatch between the pair of sequences at any given site of comparison However substitutions inser tions and deletions occur at different rates over evolutionary time This variation in rates is the result of a large number of factors including the mutation process genetic drift and natural selection For protein sequences the relative rates of different substitutions can be empirically determined by comparing a large number of related sequences These empirical measurements can then form the basis of a scoring system for aligning subsequent sequences Many scoring systems have been developed in this way These matrices incorporate the evolutionary prefer ences for certain substitutions over other kinds of substitutions in the form of log odd scores Popular matrices used for protein alignments are BLOSUM 9 and PAM 2 matrices Note The BLOSUM matrix is a substitution matrix The number of a BLOSUM matrix indicates the threshold similarity between the sequences originally used to create the matrix BLO SUM matrices with higher numbers are more suitable for aligning closely related sequences When aligning protein sequences in Geneious a number of BLOSUM and PAM matrices are available 3 5 2 Algorithms for pairwise alignments Once a scorin
70. k the OK button and testing will commence Once complete a dialog will be presented with the results This dialog tells you how many of the sequences were compatible with the specified primers and probes and provides details and choices very similar to the one described in section 3 9 1 3 9 PCR PRIMERS PRO ONLY 77 3 9 5 Advanced Primer Design Options The parameters which are used to pick primers and DNA probes are highly customisable through the advanced options section of primers dialog To access this make a selection of sequence for testing or designing and select Primers from the menu as detailed above Now click the More Options button and a large section will appear below the standard options The advanced options section has two tabs which are available depending on the task you have chosen The Primer section is available if Forward Primer or Reverse Primer is selected and DNA Probe is available if DNA Probe is selected in the tasks These two sections are quite similar the DNA probe section has a subset of the options available in the primer section This is because primers are usually chosen in pairs and so several options can be set for how pairs are chosen Most of the options are used to set absolute limits on properties of primers and probes such as melting point and GC content Optimum values can also be specified For details on individual options hover your mouse over them and a popup box will describe the fu
71. k this box to build a consensus tree using resampling of sequence alignment data Resampling method Either bootstrapping or jackknifing can be performed when resampling columns of the sequence alignment 3 9 PCR PRIMERS PRO ONLY 73 Number of samples The number of alignments and trees to generate while resampling and building a consensus tree A value of at least 100 is recommended Support threshold This is used to decide which monophyletic clades to include in the consensus tree after comparing all the trees in the original set as defined below Resampling results in a set of trees which we will refer to as the original set of trees for the definitions that follow A 100 support threshold results in a Strict consensus tree which is a tree where the included clades are those that are present in all the trees of the original set A 50 threshold results in a Majority rule consensus tree that includes only those clades that are present in the majority of the trees in the original set A threshold less that 50 gives rise to a Greedy consensus tree In constructing a Greedy consensus tree clades are first ordered according to the number of times they appear i e the amount of support they have then the consensus tree is constructed progressively to include all those clades whose support is above the threshold and that are compatible with the tree constructed so far Note The above definitions apply to rooted trees The
72. kes a text value for the protein in kDa kiloDaltons see Figure 2 7 Important You must use quotation marks if and blank spaces are part of your search criteria No quotation marks lead to unreliable results 36 CHAPTER 2 RETRIEVING AND STORING DATA Wild card searches When you are looking for all matches to a partial word use the asterisk For example typing oxi would return matches such as oxidase oxidation oxido reductase and oxide This is useful for performing generic searches You can also place the asterisk in the middle of the word but not at the beginning This feature is available only for local documents Match all FH of the following Any Field EX contains HA Document type geneious document type E Value First Author The article principal author iluc CID The genbank ID of the sequence E Height Tree Height Hit range Position of hit in t sequence Journal Title Where ar Last Author The art ished ublication date Medline Date The article H Molecule Type The molecule type of the sequence H Name name of a document No nodes Number of Nodes I No tips Number of tips Organism Organism The organism of the sequence i PDB name Name in PDB i PMID The PubMed ID of the article H I Publication Date The article publication date uH Sequence Annotations sequer Ano ext u Sequence Length Residue leng
73. led The agent is enabled but doesn t have a search scheduled To correct this click the Run now button in the agent dialog to have it search immediately and schedule a new search Deliver To This names the destination folder for the downloaded documents This is usually your Local Documents or one of your local folders Note If you close Geneious while an agent is running it will stop in mid search It will resume searching when Geneious is restarted Also all downloaded files are stored in the destination folder and are marked unread until viewed for the first time 2 6 3 Manipulating an agent Once an agent has been set up it can be disabled enabled edited deleted and run All these options are available from within the Agents dialog 40 CHAPTER 2 RETRIEVING AND STORING DATA e Enable or disable an agent by clicking the check box in the Enable column e Run Now Cause the agent to search immediately e Cancel If the agent is currently searching this can be clicked to stop the search e Edit Click this to change an agent s database search criteria destination or search in terval e Delete Delete the agent permanently Any documents retrieved by the agent will re main in your local documents 2 7 Filtering and Similarity sorting The Filter allows you to instantly identify documents in the document table matching chosen keywords It is located in the top right hand corner of the Main Toolbar Type in the
74. lete the account on the server 6 2 Managing Your Contacts Once you have an account and are connected you can start adding contacts You will not be able to add contacts while an account is disconnected 6 2 1 Add Contact Select your account in the Services Panel and choose Add Contact from the menu at the top or right click Ctrl click on MacOS on your account in the Services Panel and choose the same option You will see a simple dialog with one field Jabber ID A Jabber ID looks like an email address and has a similar function It uniquely identifies some other Geneious users account You can enter a contact s Jabber ID directly into this field if you know it To see your own Jabber ID hover your mouse over your account in the Services Panel and it will appear in a tool tip 7 Add Contact to myaccount Jabber ID e g user name talk geneious com 7 Search For Contact Figure 6 3 Add Contact dialog box If the server supports it you should also see a Search For Contact link Click this to go to the next dialog Here you will see a box for a search string and some checkboxes indicating what you are searching on Enter all or part of the name or email of the contact you want and click the 6 2 MANAGING YOUR CONTACTS 87 Search button If any rows are returned in the results table you will be able to select one or entries and add them as contacts 7 Add Contact to myaccount Username Name V Email
75. ly save the new contents The Pop up menu in the sequence viewer The toolbar actions are available via a pop up menu as well Right click Ctrl click on Ma cOS on any sequence partly highlighted sequence or annotation to show the various options The pop up menu contains the Copy residues action keyboard Ctrl C to copy the selected residues to the system clipboard Printing a sequence view To print a sequence view go to File Print and click OK The view is printed without the options panel It is recommended to turn on Wrap sequence and deselect Colors before printing Wrapping prints the sequence as seen in the sequence viewer and the font size is chosen to fill the horizontal width of the page 3 1 2 Dotplot viewer This is a special viewer that appears when two sequences are chosen A dotplot compares two sequences to find regions of similarity Each axis X and Y on the plot represents one of the sequences being compared Figure 3 4 For more information about interpreting dot plots refer to the section on them in chapter 3 3 13 3D structure viewer The 3D structure viewer was introduced in Geneious 1 0 Figure 3 5 3D structures can be obtained by searching NCBI s Structure database from within Geneious or by importing 3D structure files such as PDB files but other formats are supported as well from your hard drive To view the molecule from another angle just click on the molecule and drag the mouse to rot
76. mentioned above 6 5 CHAT 89 6 5 Chat You can chat with your online contacts 6 5 1 Starting a Chat If you want to start a chat you will need to invite some or all of your contacts to join you in the chat Starting a Chat with One Contact To start a chat with a particular contact select that contact and either select New Chat Ses sion from the Collaboration menu or right click Ctrl click on MacOS on the contact and then select New Chat Session from the popup menu If you want you can enter a reason into the dialog box which appears explaining to that contact why you want to chat with them Clicking ok will invite the contact to chat with you and open a Chat Window in which you can chat with them Starting a Chat with Multiple Contacts To start a chat with some or all of your contacts select your account and either select New Chat Session from the Collaboration menu or right click Ctrl click on MacOS on the account and then select New Chat Session from the popup menu A dialog will appear asking you to select the contacts you wish to invite to chat with you If you want you can also enter a reason explaining to those contacts why you want to chat with them Clicking ok will invite the contacts to chat with you and open a Chat Window in which you can chat with them Accepting or declining an invitation to chat When one of your contacts invites you to chat a dialog will appear asking you to accept or
77. ments in that folder are shared e If you share a folder all sub folders of that folder are shared e If you share a folder it is available to all your contacts 6 4 Browsing Searching and Viewing Shared Documents Folders that your contacts have shared will appear beneath that contact just as they do in your contact s own Services panel You can browse these folders as you do your local folders You can also search a shared folder just as you can a local one Additionally you can search all of a contact s shared documents by clicking on the contact itself and then conducting the search You can also search all the shared documents of all of an account s contacts by clicking on the account and conducting the search Agents can be set up on shared folders contacts and accounts You cannot search browse or run or set up agents on a contact that is currently offline When you first view your contact s documents in the Document Table the documents you see are only summaries To view the whole document select the summary s of the documents s you would like to view and the click the Download button inside the document view or just above it There are also Download items in the File menu and in the popup menu when doc ument summary is right clicked Ctrl Click on MacOS The size of these files is not displayed in the Documents Table You can cancel the download of document summaries by selecting Cancel Downloads from any of the locations
78. mists Introducing Geneious Geneious is a self organizing automatically updating library of genomic and genetic data that provides a fully integrated Read about Biomatters Q READ BLOG visually advanced toolset for Sequence alignment and phylogenetics Sequence analysis BLAST Protein structure viewing NCBI EMBL Pubmed auto find amp More Geneious Hits No1 on Apple website 26 Apr 2006 Geneious 1 0 hit the No 1 spot as the most downloaded software on Apple s Math amp Science website just two days after its launch The high download rate matches the MORE Find out more about Geneious Bioinformatics finally gets user friendly 24 Apr 2006 If you can use email you can use Geneious 1 0 Any scientist who uses email can now embark on novel research once considered advanced bioinformatics Announcing the launch of Geneious MORE g Download Geneious 1 2 1 Click the link above to download the Geneious application for free today Top EU scientist visits Biomatters 22 Mar 2006 Dr Gerton Lunter one of Europe s leading experts in bioinformatics is Biomatters first distinguished overseas expert in a new program designed to build collaboration between Biomatters and the global bioinformatics MORE Copyright 2005 2006 Biomatters Ltd All rights reserved Made by Cactuslab Powered by Supermodel Figure 1 1 The Geneious home
79. n and Help For details on the menu bar see section 2 1 7 10 CHAPTER 1 GETIING STARTED Geneious pro trial 1 3 5 a PB UT 47 Back Forward Search Agents Note Sort Alignment Tree Completed found 23 documents Sources Y Local 32 pg 51 gt 185 22 DefaultPhylogenyDocumentTest 14 pg 58 Y HW 0 9 nef 40 n Y NewcastleDisease 1 lentogenic1 23 lentogenic2 19 Y Sample Documents 0 3D Structures 2 Alignments 82 O Nucleotide Documents 4 Protein Documents 2 9 Tree Documents 2 A ca sO L 7229463 Figure 1 5 The Toolbar 1 2 7 Popup Menus Many actions can be quickly accessed for data items services and sometimes selections in a viewer via popup menus To invoke a popup menu for an item simply right click Ctrl Click on MacOS The popup menu will contain the actions which are relevant to the item you clicked 13 Troubleshooting 1 3 1 Geneious won t start Geneious has some minimum system requirements It is compatible with the three most com mon operating systems Windows Mac and Linux Check that you have one of the following OS versions before you launch Geneious a v Show Labels v Large Icons v Back v History v Forward v Search v Agents v Divider 1 v Note v Sort v Divider 2 v Alignment v Tree v Divider 3 v Help Operating System System requirements Windows 2000 XP Mac OS X 10 4 Unix Linux
80. nable this to have Geneious check for the release of new versions everytime it is started If a new version has been released Geneious will tell you and give you a link to download it Also check for beta version of Geneious Enable this to also have Geneious alert you when new beta versions are released A beta version is a version that is released before the official release for the purposes of testing It may therefore be less stable than official releases Max memory available to Geneious allows you to enter how many megabytes of your com puters memory you wish to allow Geneious to use Search History This clears all the previously searched words in Geneious After this the auto completion drop box will be empty Connection settings These are described in the troubleshooting section of the manual 46 CHAPTER 2 RETRIEVING AND STORING DATA 2 9 2 Plugins The Plugins tab Figure 2 15 contains a table of the currently available plugins for Geneious To enable a plugin select the checkbox next to it To disable a plugin deselect the checkbox next to it eoe Preferences Plugins are optional modules that are not required in order for Geneious to function Plugins may be created by third party software vendors to provide additional features Use this page to enable disable plugins Plugin Name 4 Description Author s Version F mM 3 D structure importer and exp Provides importing and exporting support for a variety
81. nction of the option f Primer DNA Probe Size Min 18 Optimal 20 Max 27 Tm Min 57 Optimal 60 0 Max 63 0 GC Min 20 0 Optimal 50 0 Max 80 0 Product Tm Min Optimal Max Max Tm Difference GC Clamp 0 Max Hairpin Score 8 00 Max Primer Dimer Score 3 00 Max Number of N s 0 Max Poly X 5 Max 3 Stability 9 0 _ Allow primers inside target with penalty 1 0 Primer Picking Weights Figure 3 14 Primer design advanced options Primer Picking Weights At the bottom of both the advanced primer and DNA probe options there is a Primer Picking Weights button Clicking this brings up a second dialog containing many more options The purpose of all of these options is to allow you to assign relative weights to each of the param eters you can set in the options The weight specified here determine how important each of 78 CHAPTER 3 ANALYSING DATA the options are in choosing which primers and probes are best A value of 0 means the option is not considered in choosing the best primer then the higher the value the more important the options is Some of the weights allow you to specify a Less Than and Greater Than This is for options which allow you to specify an optimum score such as GC content These weights are used when looking at primers whose value for this option falls below and above the optimum respectively The other weights are for options which can only vary in one direction For details on
82. ng and Saving Images Geneious allows you to print or save as an image the current display for any document viewer This includes the sequence viewer tree view dotplot and text view 2 10 1 Printing Choose print from the file menu The following options are available Portrait or landscape Controls the orientation of the page Scale Can be used to decrease or increase the size of everything in the view while still printing within the same region of the page For many types of document views this will cause it to wrap to the following line earlier usually requiring more pages Size Controls the size the printed region on the paper Effectively increasing the size reduces the margins on the page 2 10 2 Saving Images Choose save to image file from the file menu The following options are available Size Controls the size of the image to be saved Depending on the document view being saved these may be fixed or configurable For example with the sequence viewer if wrapping is on you are able to choose the width at which the sequence is wrapped but if wrapping is off both the width and height will be fixed Format Controls image format Vector formats PDF and SVG have the advantage over raster formats PNG and JPG that they don t become pixelly when magnified Vector formats are only available in the pro version Resolution Only applies to raster formats PNG and JPG and is used to increase the number of pixels
83. nment variable to the name of your browser The details depend on your browser and type of shell For example If you are using Mozilla and bash place export BROWSER mozilla in your bashrc file When using a csh shell variant place setenv BROWSER mozilla in your cshrc file 1 3 TROUBLESHOOTING eoe Preferences General Plugins General settings Data Storage Location Users alexei Geneious Search history M Check for new versions of Geneious 7 Also check for beta versions of Geneious Use browser connection settings Hd Proxy host Connection settings Proxy port Config file location GO Cancel Apply Restore defaults Figure 1 7 General Preferences 14 CHAPTER 1 GETTING STARTED Chapter 2 Retrieving and Storing data Geneious is a one stop shop for handling and managing your bioinformatic data This chapter summarizes the different ways you can use Geneious to acquire update organize and store your data By the end of this chapter you should be able to e Know the purpose of each panel in Geneious Import Export data from various sources e Organize your data into easily accessible folders Automatically update your data Know about the advantages of the Note functionality Customize Geneious to meet your needs 2 4 The main window This section provides more information on each of the panels in Geneious Figure 2 1 2 1 1 The Services
84. nsitions A gt G C T generally occur more frequently than transversions Differences in the ratio of transversions and transversions result in various models of substitution When applicable Geneious indicates the target sequence similarity for the alignment scores i e the amount of similarity between the sequences for which those scores are optimal e Both protein and nucleotide pairwise alignments have choices for gap open gap exten sion penalties costs Unlike many alignment programs these values are not restricted to integers in Geneious There is also a check box for free gaps at ends If this is selected gaps at either end of the alignment are not penalized when determining the optimal alignment This is especially useful if you are aligning sequence fragments that overlap slightly in their starting and ending posi tions e g when using two slightly different primer pairs to extract related sequence fragments from different samples x Select alignment type Geneious Alignment Clustalw Cost matrix Blosum z y Gap open penalty ha Gap extension penalty hoo o ooo Alignment type Global alignment Needleman Wunsch Cancel Figure 3 9 Options for protein pairwise alignment 3 6 Multiple sequence alignments A multiple sequence alignment is a comparison of multiple related DNA or amino acid se quences A multiple sequence alignment can be used for many purposes including inferring 3 6 MULTIPLE SEQUEN
85. oad the whole genome select the summary s of the genome s you would like to download and the click the Download button inside the document view or just above it There are also Download items in the File menu and in the popup menu when document summary is right clicked Ctrl Click on MacOS The size of these files is not displayed in the Documents Table Be aware that whole genomes can be very large and can take a long time to download You can cancel the download of document summaries by selecting Cancel Downloads from any of the locations mentioned above Advanced Search also provides you with a number of options for restricting the search on a field depending on the field you are searching against For example if you are using numbers 28 CHAPTER 2 RETRIEVING AND STORING DATA to search for Sequence length or No of nodes you can further restrict your search with the second drop down box e is greater than gt e is less than lt e is greater than or equal to gt e is less than or equal to lt Likewise if you are searching on the Creation Date search field you have the following op tions e is before or on e is after or on e is between When searching your local folders you have the option of searching by Document type The second drop down list provides the options is and is not The third drop down lists the various types of documents that can be stored in Geneious such as 3D
86. of 3 D structure fil Biomatters 1 9 5 M Chromatogram importer and vi Imports Chromatogram documents Biomatters 1 3 5 M Clustal W importer Provides Clustal W import Biomatters 1 3 5 M Clustalw Builds alignments using ClustalW Biomatters 1 5 5 M Collaboration beta Allows you to share your documents with other geneious users and to acce Biomatters 0 9 NM DNA Strider importer Provides DNA Strider import Biomatters 1 3 5 Document Source Viewer A viewer which displays the entire source xml for documents This is not i Biomatters 1 5 5 M Dotplot View Provides a graphical view of a pair of sequences Biomatters 1 3 5 NM EMBL database Provides access to EMBL s database at http www ebi ac uk embl Biomatters 1 3 5 NM Endnote Provides import and export functions for Endnote libraries Biomatters 1 3 5 M FASTA Importer exporter Provides FASTA file importing exporting facilities Biomatters 1 3 5 i Generate consensus sequence Generates a single sequence representing the most common residues at e Biomatters 1 3 5 M Graphical Sequence View Provides a graphical view of sequence documents Biomatters 1 3 5 M Graphical Tree View Provides a graphical view of a tree s document Andrew Rambaut 1 0 M JMol A plugin for viewing 3D proteins structures Select a 3D structure documen Biomatters 1 3 5 M MEGA exporter Provides MEGA export Biomatters 1 35 V NCBI database Provides access to NCBI s database at http www ncbi nih gov Biomatters 1 3
87. of a primer pair can have The product size is the distance in bp between the beginning of the left primer to the end of the reverse primer 3 9 PCR PRIMERS PRO ONLY 75 e Optimal Product Size Specifies the preferred size of the product Setting this will mean primer pairs that have a product size close to this will be chosen over those that don t Warning Setting this options can cause the primer design process to take considerably longer to complete The final option in this section is Number of Pairs to Generate which specifies how many candidate pairs of primers and DNA probes to generate Setting this to 1 will give you only the primer pair which was considered best by the set parameters Output from Primer Design Once the task and options have been set click the OK to design the primers A progress bar may appear for a short time while the process completes When complete a new copy of each of the target sequences you selected will be created and possibly selected straight away Each of the sequences will have the designed primers and probes added to them as sequence annotations The annotations will be labelled with their rank compared to the other primers eg 1st 2nd where 1st is the best and what type they are Forward primer Reverse primer or DNA probe Also Primers are coloured green and probes red Detailed information such as melting point tendency to form primer dimers and GC content can be seen by hovering the mouse
88. om other user defined note types They do not have any constraints Here are some examples of Note Types Name Description Data type Protein size Size of the protein in kDa Protein sequences Tree building method Method used to build tree UPGMA Neighbor joining Phylogenetic trees x Note Type Name Note Type Description Field Name Field Type Constraints Text Y Edit Constraints dh Text TN Figure 2 11 Creating new Note Types You need to decide what other characteristics your Note Type will have Field name This defines what the field will be called It will be displayed alongside columns such as Description and Creation Date in the Documents Table You can have more than one Field in a single Note Type Field type This describes the kind of information that the column contains such as Text Integer and True False The full list of choices in Geneious is shown in 2 11 Constraints These are limiting factors on the data and are specific to each field type For ex 2 8 NOTES 43 ample numbers have numerical constraints is greater than is less than is greater or equal to and is less or equal to These can be changed to suit The constraints for each field can be viewed by clicking the View Constraints button next to the field This will show a pop up menu with the constraints you have chosen 2 12 xi Existing Note Types Note Person Source Name Date of birth
89. onstrate that a fast evolving virus feline immunodeficiency virus FIV can reveal details of the contemporary population structure and recent demographic history of its natural wildlife host Puma concolor that were not apparent from host genetic data and would be impossible to obtain by other means We suggest that rapidly evolving pathogens may provide a complementary tool for studying population dynamics of their hosts in shallow time PMID 16439664 Figure 3 8 Viewing bibliographic information in Geneious 3 2 Literature Geneious allows you to search for relevant literature in NCBI s PubMed database The results of this search are summarized in columns in the Document Table and include the PubMed ID PMID first and last authors URL if available and the name of theJournal When a document is selected the abstract of the article is displayed in the Document Viewer along with a link to the full text of the document if available and a link to Google Scholar both below the author s name s Note If the full text of the article is available for download in PDF format it can also be stored in Geneious by saving it to your hard drive and then importing it This will allow full text searches to be performed on the article 3 8 SEQUENCE DATA 63 As well as the abstract and links Geneious also shows the summary of the journal article in BibTex format in a separate tab of the Document Viewer This can be imported directly into a BIEX
90. p up a menu Clicking on New folder opens a dialog that will prompt you to name the folder The named folder is then created as a sub folder of the folder that you originally right clicked on Important Search results will be lost when you exit Geneious unless the downloaded docu ments have been copied or moved to one of your local folders In Geneious you can create new folders rename existing folders delete and export folders All these choices are available by either right clicking on the folder clicking on the action menu Mac OS X or by holding down the control button and clicking Mac OS X Also in Mac OS X you can also use the plus and minus buttons located at the bottom of the service panel to create and delete folders 2 5 STORING DATA YOUR LOCAL DOCUMENTS 35 25 1 Transferring data Itis quick and easy to transfer data to your local folders from either a Geneious database search or from your computer s hard drive Please check you have already set up your destination folders before continuing Moving documents from Geneious searches to your Local folders There are a number of ways to do this Drag and drop This is quickest and easiest Select the documents that you want to move Then while holding the mouse button down drag them over to the desired folder and release If you dragged documents from one local folder to another this action will move the documents so that a copy of the document is not left
91. page 1 2 USING GENEIOUS FOR THE FIRST TIME 7 11 1 Upgrading to new versions All versions of Geneious from 0 9d onwards will automatically self update and retain all of your data If you are upgrading from 0 9b or earlier please see the section Upgrading from Version 0 9b or earlier 12 Using Geneious for the first time Figure 1 2 displays the Geneious window when opened for the first time There are six main panels g Geneious pro File Edit Tools Help a Back Search Agents Note Menu Bar LA La meme Alignment Tree d Tool Bar ane Completed Found 2 documents Sources e Local 0 gt acna 30 gt ebaC 58 D Envelope sequences 108 B f Genes 0 5 Downloaded 336 LO Extracted 52 gt New 29 27 unread Services Panel t Sample Documents 0 gt 3D Structures 2 Alignments 7 gt Nucleotide Documents 3 F Protein Documents 2 Tree Documents 2 gt Temporary 3 2 unread S Collaboration Edi strabo richard talk geneious com 5 8 Emel e Uniprot amp 9 Ncer 5 BLAST 2 blastn 3 blastp 9 blastx 7 tblastn 7 tblastx E Genome B Nucleotide 2 PopSet e Protein 2 PubMed 3i Structure e Taxonomy unread jad E T Alignment Z Include Subfolders 32 More Options 5 Name Summary n bicnmentof 2csquences house mouse ger 2 Folder Alignments Document Table Tutorial Help
92. percent similarity over the alignment This is computed by looking at all pairs of bases at the same column and scoring a hit one when they are identical divided by the total number of pairs Identical sites The number of sites that are identical across all sequences 3 1 DOCUMENT VIEWERS IN GENEIOUS 55 Annotations Some protein and nucleotide sequences come with annotations and these can be viewed within Geneious sequence viewer In the presence of annotations the options panel includes an An notations check box Figure 3 3 Uncheck the check box to turn off all annotations Individual annotation types can be turned on or off by using the check boxes next to them Sequence View Text View Notes gt Extract Reverse Complement GP Translate Edit B Annotation i Save n 30 4 3 549172 MRWESRARSA CTWISCTSES PCSTSCPPSP AAPTEERRRS EPOORRRPSS SPNRRYRGWI TSPCPrRSE4 ZOOM Level Transmembanepot nHWU3 a US20 gene a f v Annotations Hydropho B ll m 1 oo oo a o o A m odis oS ol Compress annotations ni ae 100 Tin To 130 a Sie 3 Ey 549172 YKRRMGAPQR ELCAETVATMQ AQEANALLLS RMEALEWFKK FTMWERVYAI FIFQLAFSFG LGSVFWEGFP CansmembaneprotenHWI3 MENMEM US20 gene OC source 1 MA Transmembrane region m A Hydropho B ea Baa oii n n mou MESS Rand nodi AA gene 1 M Protein 1 EHE US20 Show All Hide All re Transmembrane region ee protein HWS
93. r and R Sokal A quantitative approach to a problem in classification Evolution 11 1957 130 162 69 70 72 15 SB Needleman and CD Wunsch A general method applicable to the search for similarities in the amino acid sequence of two proteins J Mol Biol 48 1970 no 3 443 53 64 65 16 C Notredame DG Higgins and J Heringa T coffee A novel method for fast and accurate multiple sequence alignment J Mol Biol 302 2000 no 1 205 217 22 17 F Ronquist and JP Huelsenbeck Mrbayes 3 Bayesian phylogenetic inference under mixed models Bioinformatics 19 2003 no 12 1572 4 69 18 N Saitou and M Nei The neighbor joining method a new method for reconstructing phyloge netic trees Mol Biol Evol 4 1987 no 4 406 25 69 70 72 19 TF Smith and MS Waterman Identification of common molecular subsequences Journal of Molecular Biology 147 1981 195 197 64 65 20 DL Swofford Paup Phylogenetic analysis using parsimony and other methods version 4 0 Sinauer Assoc Sunderland Mass 1999 69 21 K Tamura and M Nei Estimation of the number of nucleotide substitutions in the control region of mitochondrial dna in humans and chimpanzees Mol Biol Evol 10 1993 no 3 512 526 71 22 JD Thompson TJ Gibson F Plewniak F Jeanmougin and DG Higgins The clustal x windows interface flexible strategies for multiple sequence alignment aided by quality analysis tools Nucleic Acids Res 2
94. r versions of Geneious e Be familiar with the layout of Geneious e Be able to solve connection problems 11 Downloading amp Installing Geneious Geneious is free software that can be downloaded from http www geneious com The panel on the right hand side of the Geneious home page Figure 1 1 allows users to down load Geneious onto three operating systems Windows Mac OS X or Linux Make sure your system meets the requirements before downloading Geneious Click on the operating system to take you to the Download Geneious page Follow the in structions on the page to download and save Geneious It is often easiest to save the program to your desktop Once Geneious is saved double left click on the Geneious icon to start installing the program While this is happening you will be prompted for a location to install Geneious Please check that you are satisfied with the location before continuing CHAPTER 1 GETIING STARTED Welcome to Biomatters Biomatters Mozilla Firefox Ele Edit View Go Bookmarks Tools Help Qa A gt gt 3 e A O http www biomatters com f Getting Started E Latest Headines ABOUT BIOMATTERS i NTA siomatters BIOMATTERS www biomatters com CREATORS OF geneious Research in a flash Welcome to Biomatters Biomatters develops unique easy to use and essential bioinformatics software that speeds up and simplifies research for all molecular biologists and bioche
95. ract Mee se Complement amp Translate E ss BNGNEECNH CHECHECHEN NNHCNCECH EN AB234107 1 AB234104 1 AB234103 1 Similarity cres BNNGNHGG BEGHACCGNH CEARRAGHOH MET Panel Document Viewer I AB234101 1 L m Cursor before residue 85 ungapped residue 85 This is your Local Documents service You can drag and drop articles and sequences that you find in external databases for example NCBI into this folder for future reference our Help Panel Right click on this folder and select Share Folder to allow your contacts to search it over the Collaboration service If you share a folder its sub folders are always also shared Any folders you share can be searched by all your contacts Figure 2 1 Geneious main window 2 1 THE MAIN WINDOW 17 Geneious Service Panel allows you to access Your Local Documents NCBI databases BLAST 1 Genome Nucleotide PopSet Protein Pubmed Structure and Taxonomy An EMBL database Uniprot Your contacts Geneious databases You can view options for any selected service with the right mouse button or by clicking the Options button at the bottom of the Service Panel in Mac OS X 2 12 The Documents Table The Document Table displays your search results or your stored documents While search results usually contain documents of a single type a local folder may contain any mixture of documents whether they are sequences publ
96. results in a range of formats If you are new to bioinformatics please take the time to familiarize yourself with this chapter as there are a number of formats to be aware of 2 2 1 Importing data from the hard drive to your Local folders To import files from your hard disk click File Import From file This will open up a file dialog Select one or more files and click Import If Geneious automatic file format 2 2 IMPORTING AND EXPORTING DATA 21 detection fails select the file type you wish to import Figure 2 3 before selecting the name of the file s The different file types are described in detail in the next section File name Geneious geneious Endnote 8 0 xml Nexus nex PDF pdf 3 D structure documents pdb mol xyz cml gpr hin nwo Fasta Nucleotide fasta Fasta Protein Fasta Unformatted sequence seq Newick newick tree Files of type Figure 2 3 File import options 2 2 2 Data input formats Geneious version 2 5 4 can handle all the following file formats Cancel Format Extensions Data types Common sources Clustal aln Alignments ClustalX Endnote 8 0 XML xml Journal article references Endnote Journal article websites Fasta fasta fas etc Sequences alignments PAUP ClustalX BLAST FASTA Geneious xml geneious Preferences databases Geneious Newick tre tree etc Phylogenet
97. s and the number of segregating sites in time structured population samples Roald Forsberg Alexei J Drummond Jotun Hein 2004 BMC Genet 6 35 Molecular phylogeny of coleoid cephalopods Mollusca Cephalopoda using a multigene approach 4 effect of data partitioning on resolving phylogenies in a Bayesian framework LJ Tree measures and the numb Q Molecular phylogeny of coleoi Figure 2 6 Advanced Search 2 3 2 Autocompletion of search words Geneious remembers previously searched keywords and offers an auto complete option This works in a similar way to Google or predictive text on your mobile phone If you click within the search field a drop down box will appear showing previously used options 2 4 Public databases Geneious allows you to search several public databases in the same way that you can search your local documents The search process is described in section 2 3 Geneious is able to communicate with a number of public databases hosted by the National Centre for Biotechnology Information NCBI and the European Molecular Biology Laboratory EMBL You can access these databases through the web at http www ncbi nlm nih gov and http www ebi ac uk embl respectively Both are well known and widely used storehouses of molecular biology data 30 CHAPTER 2 RETRIEVING AND STORING DATA When viewing data from a public database such as NCBI the data can not be modified This is demonstrated by the small padlock
98. s to all the data types that Geneious can handle including molecular sequences phylogenetic trees and journal articles How do I add Notes There are 3 ways to open the Add Note dialog 1 Clicking on a document Select a document and right click Ctrl click on MacOS to open an options menu Scroll down to Note and select it 2 Through the Toolbar Click the Note icon on the Toolbar 3 From the Tools menu Click on the Tools menu on the Toolbar Scroll down to Add Note and click eoe Add Note Existing Note Types Note Person Protein Size Protein Size Protein Size View Constraints d Create Note Type Delete Note Type Edit Note Type Add Note Done Figure 2 10 The Add Note Menu The top left panel lists the existing Notes types Click on a Note type and add your note For example if you choose to add a note of the type Person you could add your name to record that the sequence was produced by you 42 CHAPTER 2 RETRIEVING AND STORING DATA Creating Note Types Geneious does not restrict you to certain note types You can create new note types to annotate your documents To create a new note type select a document and open the Add Note dialog Click on the Create Note Type button in the left hand panel of this window This brings up a window similar to that displayed in 2 10 Note The Note Type Name and Note Type Description fields distinguish your Note type fr
99. same principles can be applied to un rooted trees by replacing clades with splits Each branch edge in an unrooted tree corre sponds to a different split of the taxa that label the leaves of this tree 3 9 PCR Primers pro only Geneious allows you to design and test both PCR primers and DNA or hybridisation probes for nucleotide sequences 3 9 1 The Basics To design or test primers simply select one or more nucleotide sequences and either select Primers from the Tools menu or right click Ctrl click on MacOS on the document s and select Primers The Primer Design dialog which is then displayed contains two main areas Task Firstly you can choose to design a primer pair design a custom combination of primers and or probe Design Other or test existing primers For details on testing primers refere to section 3 82 If you select Other or Test then checkboxes allow you to choose which of forward primer re 74 CHAPTER 3 ANALYSING DATA 60506 Primers Select Task Design Pair gt Design Other Test Primers Included Region From 1 To 449 Product Must Include From To Product Size From 100 To 1000 Optimal Product Size Number of Pairs to Generate 5 4 rl More Options Cancel OK Figure 3 12 The primer design dialog verse primer and DNA probe you wish to design or test The Design Pair option is the same as choosing forward and reverse in Other This is the most common operation b
100. se sequence alignments A pairwise sequence alignment is an attempt to determine the regions of homology in two se quences Over evolutionary time related DNA sequences diverge through the accumulation of nucleotide substitutions insertions and deletions As a result an alignment is essential to eval uate the degree of similarity between two related sequences If two nucleotides or amino acids are present in the same column of a pairwise sequence alignment they are aligned This im plies that they are homologous and have evolved from a common ancestral nucleotide amino acid While the aligned nucleotides indicate common ancestry the regions of a pairwise alignment that contain gaps represent areas where insertions or deletions have occurred in the evolution ary history of one or both of the sequences Pairwise alignments can be of two main types local and global A Local Alignment A local alignment is an alignment of two sub regions of a pair of sequences 19 This type of alignment is appropriate when aligning two segments of genomic DNA that may have local regions of similarity embedded in a background of a non homologous sequence A Global Alignment A global alignment is a sequence alignment over the entire length of two or more nucleic acid or protein sequences In a global alignment the sequences are assumed to be homologous along their entire length 15 3 5 PAIRWISE SEQUENCE ALIGNMENTS 65 3 5 1 Scoring systems in pairw
101. se the agent to check what docu ments are currently available when the agent is created Then when the agent searches it will only get documents that are new since it was created e g If you have already read all publi cations by a particular author and you want the agent to only get publications released in the future 38 CHAPTER 2 RETRIEVING AND STORING DATA Alternatively you can click the Create Agent button which is available in some advanced search panels This will use the advanced search options you have entered to create the agent The easiest way to organize your search results is to create a new folder and name it appropri ately You can do that by navigating to the parent folder in the Deliver to box and click New Folder or by creating a new folder beforehand 1 Right click Ctrl click on MacOS on the Sample Documents or Local folders This brings up a popup menu with a New Folder option 2 Create a new folder and name it according to the contents of the search For example type CytB if searching for cytochrome b complex 3 Oncecreated select the new folder You can now select the Create or Create and Run The agent will then be added to the list in the agent dialog and it will perform its first search if you clicked Create and Run Otherwise it will wait until its next scheduled search 000 Create Agent Search Database 7 PubMed HJ Match all HH of the following
102. span several sequences and drag it to the left or right Dragging will either move residues over existing gaps or open new gaps when necessary Dragging a selection consisting entirely of gaps moves the gaps to the new location To quickly select a single residue double click on it Triple clicking will select a block of residues within a single sequence Quadruple clicking selects a block of residues in multiple sequences The shift and control alt on a Mac keys can be combined with the keyboard arrow keys to select sequence and alignment regions The shift key extends the current selection and holding down the control alt on a Mac key while pressing the keyboard arrow is equivalent to pressing it 10 times These can be used together For example in an alignment if you have a region of one sequence selected and would like to select the same region in all sequences then you could 3 1 DOCUMENT VIEWERS IN GENEIOUS 57 press control up until you reach the first sequence and then press control shift down and few times until all sequences are selected Sequences can be reordered within an alignment by clicking the sequence name and dragging Sequences can be removed from an alignment by right clicking Ctrl click on MacOS on the sequence name and choosing the remove sequence option Alternatively select the entire sequence by clicking on the sequence name and press the delete key After editing is complete click Save to permanent
103. t the alignment contains the sequences in the same order e Additional options Any additional parameters accepted by the ClustalW command line program can be entered here Refer to the ClustalW manual for a description of the avail able parameters After entering the desired options click OK and ClustalW will be called to align the selected sequences or alignment Once complete a new alignment document will be generated with the result as detailed previously 3 8 Building Phylogenetic trees Geneious provides some basic phylogenetic tree reconstruction algorithms for a preliminary in vestigation of relationships between newly acquired sequences For more sophisticated meth ods of phylogenetic reconstruction such as Maximum Likelihood and Bayesian MCMC we recommend specialist software such as PAUP 20 and MRBAYES 17 Geneious implements the Neighbor joining 18 and UPGMA 14 methods of tree reconstruc tion 3 8 1 Phylogenetic tree representation A phylogenetic tree describes the evolutionary relationships amongst a set of sequences They have a few commonly associated terms that are depicted in Figure 3 7 and are described below 70 CHAPTER 3 ANALYSING DATA Branch length A measure of the amount of divergence between two nodes in the tree Branch lengths are usually expressed in units of substitutions per site of the sequence alignment Nodes or internal nodes of a tree represent the inferred common ancestors of the sequenc
104. t tree3 txt 1 Trees Figure 2 2 The document table when browsing the local folders Selecting a document in the Document Table will display its details in the Document View Panel Selecting multiple documents will show a view of all the selected documents if they are of similar types eg Selecting two sequences will show both of them side by side in the sequence view There are several ways to select multiple documents 18 CHAPTER 2 RETRIEVING AND STORING DATA e Hold Ctrl Command apple key on Mac OS and click to add the document to the current selection e Hold Shift and click to add the document and all documents between it and your previous selection e On windows the right mouse button can be clicked and held while moving the mouse to easily select a block of documents The popup menu will appear once the mouse is released so the newly selected documents can quickly be manipulated Double clicking a document in the Document Table displays the same view in a separate win dow To view the actions available for any particular document or group of documents right click Ctrl click on MacOS on a selection of them Ctrl Click on Mac OS X These options vary depending on the type of document The Document Table has some useful features Editing Values can be typed into the columns of the table This is a useful way of editing the information in a document To edit a particular value first click on the document
105. th of the sequence Il Sequence Residues Size Size of the document H Summary summary of a documen a URL A url link to the published article M m Select a document from the pant Figure 2 7 Searching the local documents on a user defined field Similarity BLAST like searching It is possible to search your local documents not only for text occurrences but by similarity to sequence fragments Click the small arrow at the bottom of the large T to the left of the search dialog select Nucleotide similarity search or Protein similarity search and enter the sequence text Geneious will try to guess the type of search based on the text so that simply entering or pasting a sequence fragment may change the search type automatically The search locates documents containing a similar string of residues and orders them in de creasing order of similarity to the string The ordering is based on calculating an e value for each match You can read more about the e value in subsection 2 4 2 For the search to be successful you need to specify a minimum of 11 nucleotides and 3 amino acids Note that search times depend on the number and size of your sequence documents and so may take a long time to complete 2 6 AGENTS 37 2 5 3 Checking and changing the location of your Local folders To check where your Local folders are being stored on your hard drive open the Tools menu in the Menu Bar Click Tools Preferences General
106. the whole sequence e Select all Use the keyboard shortcut Ctrl A to select everything in the panel The available actions are Extract Extract the selected part of a sequence or alignment into a new document Reverse Complement Reverse sequence direction and replace each base by its complement This is available only for nucleotide sequences Translate Translate DNA into protein Clicking on this choice brings up a list of genetic codes that can be used Choose the appropriate one and click OK This is available only for nucleotide sequences Edit Annotations and Save Editing sequences and alignments To edit sequence s or an alignment click the Edit toolbar button After selecting a residue or a region you can either type in the new contents or use any of the standard editing operation such as Copy Ctrl C Cut Ctrl X Paste Ctrl V and Undo Ctrl Z All operations are under the main Edit menu Selecting a region enables the Annotations button as well which opens an annotation en try dialog Enter an annotation name and select a existing type or type a new one Click on More Options to enter additional properties for that annotation Double click on an existing annotation to edit it or right click Ctrl click on MacOS to display a pop up menu to delete annotations You can also copy an annotation from one sequence to another from the pop up menu When editing an alignment it is possible to select a region which may
107. tograms each chromatogram can be turned on or off individually using the checkbox s below Note that since the distance between bases as inferred from the trace varies the trace may be either contracted or expanded compared with the raw data Show quality This is available with enabled chromatogram traces It displays a quality measure typically Phred quality scores for each base as assessed by the base calling program The quality is shown as a shaded bar graph overlaid on top of the chromatogram Note that those Scores represent an estimate of error probability and are on a logarithmic scale the highest bar represents a one in a million 10 9 probability of calling error while the middle represents a probability of only a one in a thousand 1073 Consensus This option is available when viewing alignments When checked the viewer displays the consensus sequence with the aligned sequences The consensus sequences has the same length and shows which residues are conserved are always the same and which residues are variable Basically a consensus is constructed by taking the most frequent residue at each site position with some special considerations for ties and gaps Ties in a DNA alignment are resolved by taking the appropriate IUPAC nucleotide ambiguity code i e R for A or G etc A tie in a protein alignment is indicated by the general ambiguity symbol X No gaps in consensus When this box is checked gaps are not counted and the
108. ts You can use Font to change the size of the labels 3 1 DOCUMENT VIEWERS IN GENEIOUS 61 Root Branch Node Tip Figure 3 7 Phylogenetic tree terms 3 15 The Chromatogram viewer The Chromatogram viewer provides a graphical view of a the output of a DNA sequencing machine such as Applied Biosystems 3730 DNA analyzer The raw output of a sequencing machines is known as a t race a graph showing the concentration of each nucleotide against sequence positions The raw trace processed by a Base Calling software which detects peaks in the four traces and assigns the most probable base at more or less even intervals Base calling may also assign a quality measure for each such call typically in terms of the expected probability of making an erroneous call Sequence Logo When checked bases letters are drawn in size proportional to call quality where larger implies better quality or smaller chance of error Note that the scale is logarithmic the largest base represents a one in a million 10 9 or smaller probability of calling error while half of that represents a probability of only a one in a thousand 1073 Mark calls Draw a vertical line showing the exact location of the call made by the base calling software Layout Options controlling layout and view Those include X and Y axis scaling size of largest base letter when Sequence logo is on and minimum size of base letter to prevent bases of low quality becoming unreadabl
109. ulting alignment is placed in the folder containing the sequences aligned In some cases building a guide tree can take a long time since it requires making a pairwise alignment between each pair of sequences The build guide tree via alignment option may speed this part by taking a different route First make a progressive multiple alignment using a random ordering and use that alignment to build the guide tree Notice that while this typically speeds up the process that may not be the case when the sequences are very distant genetically 7 Alignment x Select alignment type Geneious Alignment Clustalw Cost matrix Blosumez v Gap open penalty A Gap extension penalty EEC Alignment type Global alignment y Refinement options Refinement iterations 2 v build guide tree via alignment Faster Cancel Figure 3 10 The multiple alignment window 3 7 Sequence alignment using ClustalW pro only ClustalW is a widely used program for performing sequence alignment 23 22 If you have ClustalW installed your computer Geneious pro allows you to run ClustalW directly from inside the program without having to export or import your sequences If you do not have ClustalW or are unsure if you do you should attempt to perform a ClustalW alignment without specifying a location and Geneious will present you with options including details on how to download ClustalW and an automatic search for finding its location on
110. urrent residue color scheme Zoom options These are a few options that can be turned on or off Auto zoom to selection If this option is turned on when you select a range of sequence residues the sequence viewer automatically zooms in or out so that the selected piece fills the entire viewing area A shortcut is to select the zoom options checkbox without expanding it Default zoom level This options allows the user to specify the initial level of zoom when viewing a sequence Please note Geneious automatically restores the previous zoom level and over rides the de fault settings when you return to a sequence that you were previously viewing Statistics This displays some statistics about the sequence being viewed They correspond to the se quence alignment being viewed or the highlighted part of the sequence alignment The length of the sequence or part of the sequence is displayed next to the Statistics option Residue frequencies This section lists the residues for both DNA and amino acid sequences and also for alignments It gives the frequency of each nucleotide or amino acid over the entire length of the sequence including gaps If there are gaps then a second percentage frequency is calculated ignoring gap characters The G C content for nucleotide sequences is shown as well for easy reference The following statistics are available when viewing alignments or multiple sequences Pairwise Similarity The average
111. y sequence this will be automatically filled in if you en tered a sequence in the basic search then clicked More Options Below the search box are all of the advanced options The available options vary depending on the kind of BLAST search you have selected For details on each of the options you can hover your mouse over the option to see a short description or refer to the NCBI BLAST documentation at http www ncbi nlm nih gov blast blastcgihelp shtml If you have a mirror of the NCBI BLAST databases you can set Geneious to use this by select ing the NCBI BLAST service and then clicking the Change database location button and entering the url for the mirror 2 Storing data Your Local Documents Geneious can be used to store your documents locally Under the Local folder in the Services Panel you are able to create sub folders to organize and store a variety of document types 2 5 34 CHAPTER 2 RETRIEVING AND STORING DATA Table 2 5 Geneious document types Document type Geneious Icon Nucleotide sequence Protein sequence Phylogenetic tree 3D structure Sequence alignment Chromatogram SE LROR Journal articles PDF 2 Other documents This is also where you can set up special folders to receive documents that are downloaded by a Geneious agent To create a new folder in Geneious select the Local folder or a sub folder icon in the services panel and right click Ctrl Click on MacOS This will po
112. your hard drive To perform an alignment using ClustalW select the sequences or alignment you wish to align and select the Alignment button from the Toolbar At the top of the alignment options win dow there are buttons allowing you to select the type of alignment you wish to do Choose ClustalW here and the options available for a ClustalW alignment will be displayed 3 8 BUILDING PHYLOGENETIC TREES 69 The options are e ClustalW Location This should be set to the location of the ClustalW program on your computer Enter the path to it in the text field or click the Browse button to browse for the location If the location is invalid and you attempt to perform an alignment Geneious will tell you and offer the options detailed above for getting or finding ClustalW e Cost Matrix Use this to select the desired cost matrix for the alignment The available options here will change according to the type of the sequences you wish to align You can also click the Custom File button to use a cost matrix that you have on your computer the format of these is the same as for the program BLAST e Gap open cost and Gap extend cost Enter the desired gap costs for the alignment e Free end gaps Select this option to avoid penalizing gaps at either end of the alignment See details in the Pairwise Alignment section above e Preserve original sequence order Select this option to have the order of the sequences in the table preserved so tha
Download Pdf Manuals
Related Search
Related Contents
Manual del usuario del servidor Sun Enterprise 420R Whirlpool GJ8646XD User's Manual Watkins GLW Owner`s manual Muvit MUSCP0201 screen protector Dokument_14. guia del usario de ViewStation Neat Image user guide Niles Audio RCA-HT User's Manual User`s Manual for the M5-1003 DVI/USB/RS-232 and Audio Copyright © All rights reserved.
Failed to retrieve file