Home

SiteScan User Guide

image

Contents

1. M S 30 ss KM 34 acil eR C M 36 SiteScan User Guide Welcome Welcome Overview Table of Contents 3 39 SiteScan User Guide Overview Introduction SiteScan is a Windows desktop program which checks websites for broken links and analyzes page performance From a starting URL SiteScan will spider and analyze every page or file on a web site or intranet It will find issues such as slow mistyped or broken links and allow the user to drill down to perform detailed page analysis and locate the relevant code in the HTML source Results a displayed in real time and can be interacted with immediately as scanning continues in the background Intuitive and ready to run out of the box SiteScan is easy to use and highly configurable It provides many user defined options such as scan depth timeouts scheme protocol and file type exclusions rules It runs from any Windows desktop PC or server and communicates directly with the website being scanned SiteScan does not use any intermediate cloud based services during processing for complete data privacy Available in both free and registered editions SiteScan Broken Link Checker is licensed to an individual user with heavily discounted bulk licensing are available for corporate and educational users The free edition of SiteScan is provided f
2. Electronic Publication epub Microsoft Word Document doc 29 39 SiteScan User Guide Options Options Scanning Export Licensing 30 39 Scanning SEC eer loar t niie SiteScan User Guide Scanning and Performance Options To display Scanning and Performance options click the Options button on the toolbar ribbon then click the Scanning button on the Options pop up window Scanning options should be configured before starting a scan and are saved between SiteScan sessions Several options can be changed while a scan is in progress and will take immediate effect such as adding or removing a file extension from an exclusion filter or even changing the scan depth Options which cannot be changed during a scan are typically those which require the application to be restarted or are related to performance utilization and will be grayed out while a scan is in progress Overview Scanning options are at the heart of getting the best out of SiteScan we recommend you explore and experiment to uncover its full potential Scanning options provide detailed control over the speed and depth of scanning while allowing limits to be set and exclusion filters implemented SiteScan ships with all options configured and ready to run By default it will scan downward from a starting URL and test all links it finds for errors response time and HTML body load time The information icon i adjacent to each option provides a
3. Removes any Choose link Visit the SiteScan dialog results from file applied filters to display all results information columns to display on the website to purchase a license key or Stop a running scan Save current results to file Allows results to be filtered for all or only Specific categories of errors See the Error Filters section Auto size column widths to fit screen results grid enter a purchased key to activate SiteScan Stop Save Errors Best Fit User Guide Display the SiteScan user guide scanning Crawl Slow down a running scan and status for existing results without spidering Clear Clear all existing results or only performance and status data Export Export results to Excel or CSV format 8 39 display on linked CSS stylesheets Scripts Filters the results to displayed only linked script files e g JavaScript Emails Filters the results to display only email addresses specified below Pause Refresh Styles Features Pause resume Refresh performance Filters the results to Show hide grid features such as Group By Find Panel or Filter Rows SiteScan User Guide using the mailto scheme Print HTML Print or print preview Filters the results to results display HTML pages Filtering is based on the content type header returned from the server Files Filters the results to display linked files by l
4. 39 39
5. Draga column from the pop up window to the grid Drag and Drop Columns fea Columns Bands Body Load Time s Full Load Feedback Full Load Time s Is Local Last Modified Link Base URI Link Relative URL original decoded Link Relative URL original To remove columns 1 Click and drag a column heading off the grid The cursor will change to a large X when the column can be dropped to be removed Previewing a Result Individual results can be inspected in detail using the Page Analysis tabs below the main results grid To preview a result select it by clicking its row in the results grid Filtering Results can be filtered in several ways selecting a predefined error filter from the Filter Errors drop down button in the Filters section of the tool ribbon By selecting a predefined content type filter from the Filters section of the toolbar ribbon such as Email Styles or Scripts By typing in to the auto filter row immediately below the column header By hovering over a column header to display the drop down filter icon then selecting a column value to filter on Sorting Results are sorted by clicking on a column header Clicking the header multiple times will toggling sorting between ascending descending and unsorted Grouping Results are grouped by dragging a column header to the Group By row shown immediately above the column headers Nested groupings can be created by dragging
6. g JavaScript Highlighting Links To highlight a link in the source code Select the row for the page to analyze in the main Results Grid Click the Links tab to display all HTML lt A gt style links Scroll to the link you wish to highlight and click to select it All occurrences of the link will be highlighted in the source code using a yellow background The Source panel caption will display a number showing how many occurrences SiteScan managed to find o RON Highlighting Images To highlight an image in the source code 1 Select the row for the page to analyze in the main Results Grid SiteScan User Guide Click the Images tab to display all images on the page Scroll to the image you wish to highlight and click to select it All occurrences of the image will be highlighted in the source code using a yellow background The Source panel caption will display a number showing how many occurrences SiteScan managed to find oP wh Saving Source Source code can be saved in various formats by clicking the Save As button in the Source toolbar ribbon EI SiteScan 2 2 Registered Edition Fe ee 1 2 E Copy QP D 99 Find 3 K Replace Gear Formatting Source Text Files txt HyperText Markup Language Format htm html Web Archive single file mht Word 2007 Document docx OpenDocument Text Document odt Word XML Document xml
7. tool tip summary of the feature along with guidance and best practice where applicable SiteScan Options Export Licensing 17 he Scanning and Performance General Settings Maximum number of pages to spider for links 0 unlimited 2 Maximum number of links to test amp follow 0 unlimited G Only scan first occurence of each link Use intelligent data copying where identical links are found recommended G Server response timeout 10 seconds G X a X Body load timeout 10 seconds G a Maximum automatic redirections to follow 5 Use default proxy settings and user credentials G Scan Depth and Filtering Only scan pages or files on these schemes http https file Indude the following link types in results javascript mailto Options in Detail SiteScan User Guide A Scanning and Performance General Settings Maximum number of pages to spider for links 0 0 unlmited Maximum number of links to test amp follow 02 0 unlimited Only scan the first occurence of each link Use intelligent data copying where identical links are found recommended Server response timeout 10 seconds G Body load timeout 10 lt seconds Maximum automatic redirections to follow 5 10 C Use default proxy settings and user credentials Scan Depth and Filtering Only sc
8. Scan Logo Spidering a website for 404 Not Found errors Highlight a broken link in the HTML source code Filter spidered pages by HTTP status code Viewing all referring pages incoming links pointing to a Grouping spidered web pages by HTTP status code to f Grouping crawled web pages by the HTTP content type Filtering spider results to display email addresses Exporting site crawl results to Microsoft Excel Download SiteScan from www byteshift co uk Comodo Secure Logo Yes USD Pricing GBP Pricing Buv Now Gd Scipts Emails Source 2 Image URL http aby ift co ift 205x41 http www byteshift co uk images logos sitescan 105x105 pnq http www byteshift co uk broken ink checker images screenshots thumbnail checking website f page currently selected in the http www byteshift co uk broken ink checker images screenshots thumbnail finding broken link http www byteshift co uk broken ink checker images screenshots thumbnail_ filtering crawl resul http www byteshift co uk broken ink checker images screenshots thumbnail finding all incoming http www byteshift co uk broken ink checker images screenshots thumbnail grouping spidered http www byteshift co uk broken ink checker fimages screenshots thumbnail grouping spidered http www byteshift co uk broken ink checker images screenshots thumbnail filtering crawled pa http ww
9. SiteScan User Guide Table of contents Welcome sae cade cU ua 3 OVEIVICW DINE M EMI TERMI NMMU NM EM DIM EUMDEM DIM MUNI MEI RN AMMUMS 3 t 5 User Interface sca aie I EECETEE EE 6 Tool eM NG iar 7 Results Grid a E ee on 10 Properties 2 Eiere aier Erir eria iE 14 Qut 2452 UD EMT DID MEME 15 In Links Panel 18 Images ET t mo 20 Styles Panel 22 Scripts SOLID 24 Emails Panel EE AO 26 SOURCE E O E neis d eM E 27 DUONG T 29 e
10. an pages or files on these schemes M http Whttps Indude the following link types in results javascript mailto Only extract and follow links from pages found On the same level or beneath the Starting URL On the same sub domain as the Starting URL On the same sub domain as the Starting URL or any sub domain Skip pages or files with the following extensions Resource Utilization Scan resource utilization Y 20 Give priority to Spidering pages Testing amp timing links Timing Measurements General Settings Measure server response time 1 Measure html document load Option Description Note Maximum number of pages to spider for links The maximum number of pages on the website which will be scanned and spidered for links to follow When set to zero all pages on the website will be scanned provided they meet any Scan Depth and Filtering criteria specified Maximum number of links to test amp follow Maximum number of links found on spidered pages which will in turn be scanned and spidered for new links to follow 32 39 SiteScan User Guide When set to zero all links found on all pages will be scanned provided they meet any Scan Depth and Filtering criteria specified This setting is ignored when Refreshing existing results Scan resource utilization The maximum number of concurrent we
11. b requests which SiteScan will make when scanning pages and spidering links A higher value will result in greater CPU and memory usage as well as increasing demand on the web server Use with caution too high a value time outs due to excessive serve In link heavy sites application impaired The default value of 20 point Scale this number up or do optimum performance with overlo network or web server Give priority to Only scan the first occurrence of each page Use intelligent data copying Spidering Pages Priority is given to link extraction Select this option when gauging the size of a site or the potential workload is a priority Testing amp timing Priority is given to testing found links by capping the backlog of links waiting to be tested at 500 Select this option when knowing the number of links waiting to be tested is less important than the actual testing Select this option to scan and report on every page ina site only once Subsequent links to the same page from other pages will be ignored De select this option to see all occurrences of links to all pages throughout the site Ideal when you wish to find all links in a site pointing to a particular page Select this option to allow SiteScan to copy existing link performance and status data rather that scanning identical links multiple times When selected both scanning time and web server load are significantly reduced Server response
12. d page Refreshing The list of images found on the page can be refresh directly from the web server by either Double clicking the related row in the main Results Grid Clicking the green Refresh icon B to the right of the tab headings Exporting The list of images can be exported to Excel in XLSX format Click the XLSX icon to the right of the tab headings 22 39 Styles Panel Styles Panel The Styles panel lists all style sheet type links found on the page currently selected in the main Results Grid Overview The list of style sheets displayed are obtained at the point the page was crawled and take in to account any spider rules specified in options such as scheme or file type filtering specified Therefore those listed may only be a subset of all style sheets on the web page Each style sheets is only listed once in this view regardless of how many times it may appear in a page Each style sheet link displays its HTTP status file name absolute URL load time in seconds and size in both bytes and kilobytes Each style sheet link is cross referenced with the available scan results to show its current HTTP status e g OK 404 Not Found 500 etc and are color coded to highlight errors and warnings As a scan runs and more scan results become available the appropriate status will be shown against each link when refreshed By default the HTTP status for each style sheet is only refreshed whenever its page is sel
13. defined options http www byteshift co uk broken ink checker sitescan us 12 200 SiteScan Download Softpedia SoftPedia http www softpedia com get Internet Search engine tool 13 200 screenshot checking website for 404 errors png Spidering a website for 404 Not Found errors http www byteshift co uk broken ink checker images scre 14 200 screenshot finding broken link in html sourcecode png Highlight a broken link in the HTML source code www byteshift co uk broken ink checker images scre 15 200 screenshot filtering crawl results by http status e Filter spidered pages by HTTP status code http www byteshift co uk broken ink checker images scre 4 b Links found on the selected page which link out to other pages Click to highlight link in the Source code Shift Click URL to navigate Double dick row to find in results Navigating to an Out Link in Results To jump to the corresponding result in the main Results Grid for any link Double click the link s row Previewing an Out Link Result in the Browser To open link in the browser Hold the Shift key and click the URL in the Page URL column 16 39 SiteScan User Guide Highlighting Out Links in the HTML Source Links can be highlighted in the Source code preview tab Click any link row to select it Click the Source tab to view the source jProperties Outiinks Ininks Images SjStyles 2 5 Emails Id Statu
14. din http www byteshift co uk images contact linkedin png Images found on the selected page Click to highlight an image in the Source code tab A number will be appended to the Source tab caption to indicate how many occurrences of the selected image were found Each image found is highlighted in yellow Page Analyis http www byteshift co uk broken link checker sitescan editions aspx Properties amp Links Referrers Images 9 5 8 Scipts g8 Emails Source 56 td class moreFeaturesEditionHeading gt Free lt td gt 156 157 td class moreFeaturesEditionHeading Registered td 158 lt tr gt 159 lt tr gt 160 lt td gt Set maximum pages and or links to spider lt td gt 161 lt td class moreFeaturesValue gt lt img src alt tick 162 gt lt td gt 163 td class moreFeaturesValue gt lt img src images Validation_check_18x18 pna alt tick 164 gt lt td gt 165 lt tr gt 166 lt tr gt 167 lt td gt Spider every link or only the first occurrence lt td gt 168 lt td class moreFeaturesValue gt lt img src alt tick 169 lt td gt 170 lt td class moreFeaturesValue gt lt img src images validation check 18x18 png alt tick 171 td 172 lt tr gt 173 lt tr gt 174 lt td gt Set scan depth by start URL single or multiple sub domains lt td gt c Td nn 5 Y asc cd C i a ie The HTML source code for the selecte
15. ected in the main Result Grid See the Refreshing section below for information on refreshing style sheet status Properties 1 Out Links s Indinks Images ES Styles 5 lag Emails Source 2 Id Status File Name File URL Load Time s Size Kb Size gt 1 200 default_desktop css http www byteshift co uk styles default_desktop css 0 00 7 2 200 default mobile css http www byteshift co uk styles default mobile css 0 00 7 Navigating to a StyleSheet in Results To jump to the corresponding result in the main Results Grid for any style sheet Double click the style sheet row Previewing a StyleSheet in the Browser To open the style sheet page in the browser Hold the Shift key and click the URL in the File URL column Refreshing The list of style sheets can be refresh from the page on the web server by either Double clicking the related page row in the main Results Grid Clicking the green Refresh icon to the right of the tab headings Exporting The list of style sheets can be exported to Excel in XLSX format Click the XLSX icon ESI to the right of the tab headings Scripts Panel Scripts Panel The Scripts panel lists all JavaScript type links found on the page currently selected in the main Results Grid Overview The list of script files displayed is obtained at the point the page was crawled and takes in to account any spider rules specified in options such as scheme or
16. email address Performance Measures Option Description Note Measure server response time Measure the time it takes for the web server to acknowledge the request for a page or file and return a response e g OK or 404 not found Measure HTML document load time Measure the time it takes to obtain and load only the HTML lt body gt content of each web page All external resources such as images stylesheets and scripts are ignored Applies to web pages only All other file types are automatically ignored This is a good measure of page performance on the server particularly for dynamic pages such as ASPX PHP etc 34 39 Export SiteScan User Guide Saving and Exporting Options To display Saving and Exporting Options click the Options button on the toolbar ribbon then click the Export button on the Options pop up window Overview Saving and Exporting options provide control over how results are saved exported and allows triggering of convenience actions such as automatically opening results after export M Saving and Exporting Saving Results Save referrer data when saving results Exporting Results Indude hyperlinks when exporting to Excel Open file after export Option Description Note Save referrer data when saving results Select this option to also save referrer data whenever scan results are saved Saving referrer data with resul
17. esh directly from the web server by either Double clicking the related row in the main Results Grid Clicking the green Refresh icon to the right of the tab headings Exporting Properties can be exported to Excel in XLSX format Click the XLSX icon to the right of the tab headings 15 39 SiteScan User Guide Out Links Panel Out Links Panel The Out Links panel displays all internal or external hyperlinks to other HTML pages or files from the page currently selected in the main crawl Results Grid Out Links are loaded and displayed whenever a page is selected in the results grid Overview The links displayed in the Out Links panel are obtained at the point the page was crawled and take in to account any spider rules specified in options such as scheme or file type filtering specified Therefore the Out Links listed may only be a subset of all links on the web page Out Links are only listed once in this view regardless of how many times they may appear on a page Each Out Link displays the HTML Title of the page to which the link points the Link Text used in the anchor tag and the absolute URL of the link The Relative URL is also available to be shown by right clicking on the grid column headings Out Links are cross referenced with available scan results to show their current HTTP status e g OK 404 Not Found 500 etc and are color coded to highlight errors and warnings As a scan runs and more scan results beco
18. esults Navigating to a Referrer in Results To jump to the corresponding result in the main Results Grid for any referrer m Double click the referrer row Previewing a Referrer in the Browser To open the referring page in the browser Hold the Shift key and click the URL in the Referring Page URL column Refreshing The list of In Links can be refresh from the current crawl results by either Double clicking the related row the main Results Grid Clicking the green Refresh icon to the right of the tab headings Exporting In Links can be exported to Excel in XLSX format Click the XLSX icon ESI to the right of the tab headings SiteScan User Guide Images Panel Images Panel The Images panel lists images declared using img src main Results Grid Images are displayed whenever a page is selected in the results grid Overview Images are only listed once in this view regardless of how many times they may appear on a page The list of images displays the ALT text for the first occurrence of the image found on the page along with the Absolute URL to the image file The Relative URL is also available to be shown by right clicking on the grid column headings 8 Properties a MO UNO 16 Status 200 200 200 200 200 200 200 200 200 200 200 200 200 200 200 200 Outiinks Iniinks Styles Alt Text ByteShift Logo Site
19. file type filtering specified Therefore those listed may only be a subset of all scripts on the web page Each script is only listed once in this view regardless of how many times it may appear in a page Each script link displays its HTTP status file name absolute URL load time in seconds and size in both bytes and kilobytes Each script link is cross referenced with the available scan results to show its current HTTP status e g OK 404 Not Found 500 etc and color coded to highlight errors and warnings As a scan runs and more scan results become available the appropriate status will be shown against each file when refreshed By default the HTTP status for each script link is only refreshed whenever its page is selected in the main Result Grid See the Refreshing section below for information on refreshing script link status Properties 1 Out Links s Indinks Images E3 Styles jm Scripts lag Emails Source 2 Id Status File Name File URL Load Time s Size Kb Size D 1 200 jquery js http www byteshift co uk scripts iquery is 0 46 95 2 200 broken link checker min js http www byteshift co uk broken ink checker scripts broken link chec 0 00 1 3 200 jquery magnific popup min js http www byteshift co uk scripts iquery magnific popup min is 0 12 21 Navigating to a Script File Results To jump to the corresponding result in the main Results Grid for any script link Double click the script link ro
20. he Page Title Link Text and absolute URL of all source pages found during scanning which contain a link to the selected page or file In Links are subject to any scan depth and filtering rules specified in Options For example If a scan is performed on the URL www mysite com products with the scan depth filter set to only extract and follow links On the same level of beneath the starting URL then only referring pages at or below the URL www mysite com products will be reported By default the In Links for each page or file are only refreshed whenever the page is selected in the main Result Grid See the Refreshing section below for information on refreshing In Link data Properties Out Links Images Styles 12 Scripts Emails source 2 Id Page Title Link Text Referring Page URL 1 Website Analysis amp File Management Software ByteShift Website Broken Link Checker Sitescan http www b 2 Website Analysis amp File Management Software ByteShift Website Broken Link Checker SiteScan h ih 3 Software Products for Download ByteShift Website Broken Link Checker SiteScan h 4 How to Check a Website for Broken Links ByteShift sitescan h 5 How to Extract Email Addresses from a Website Byte sitescan h 6 Overview SiteScan User Guide ByteShift www byteshift co uk h 8 Other pages on the site which link in refer to the selected page or file Shift Click URL to navigate Double click row to find in r
21. id elmLinkProducts 124class headerLinkSelected href products default aspx gt products lt a gt lt div gt 125 span class headerLinkSeparator gt lt span gt 126 div class linkMargin gt lt a id elmLinkServices class headerLink 127href services default aspx gt services lt a gt lt div gt 128 span class headerLinkSeparator gt lt span gt 129 lt div class linkMargin gt lt a id elmLinkAbout class headerLink i13ohref about aspx gt about lt a gt lt div gt 131 span class headerLinkSeparator gt lt span gt 132 lt div class linkMargin gt lt a id elmLinkCustomers class headerLink 133href customers aspx gt customers lt a gt lt div gt 134 lt 5 rlassz headerl inkSenarator gt lt snan gt The HTML source code for the selected page Refreshing Out Links can be refresh for the current page directly from the web server by either Double clicking the related row in the main Results Grid Clicking the green Refresh icon B to the right of the tab headings Exporting Out Links can be exported to Excel in XLSX format 17 39 SiteScan User Guide Click the XLSX icon to the right of the tab headings 18 39 SEC eer loar t niie SiteScan User Guide In Links Panel In Links Panel In Links panel lists all pages with links pointing to the page or file currently selected in the main Results Grid Overview The list of In Links displays t
22. iteScan User Guide User Interface User Interface Tool Ribbon Results Grid Properties Panel Out Links Panel In Links Panel Images Panel Styles Panel Scripts Panel Emails Panel Source Panel 7 89 Tool Ribbon Quick Reference SiteScan User Guide The following quick reference provides an overview for each menu and icon on the SiteScan tool ribbon Hover over any menu item within the application to see this information in a tool tip File Menu SiteScan 2 2 Registered Edition 1 o BAR ita oj Be 0 19 Options Scip ees amp f Crawl Load Save Refresh Clear Export Print Clear Errors Columns BestFit Features BuyNow User Guide X Filters amp bi File Save Save current results to file Options Display the Options dialog Exit Exit SiteScan prompts to Save if required Home Ribbon Emails Images The Home ribbon provides access to the most frequently used features including starting and stopping scans filtering data and manipulating results in the grid Bl a x File Home Source 0 u J om 2 3 Styles 7 HTML i T 4 BAR 84 6 amp A Q g Options Start Stop Crawl Load Save Refresh iiw LG M TU Columns Best Fit eub User Guide Settings Scan Results Filter Grid Layout Help Options Start Load Clear Filters Column Buy Now Display the Options Start a new scan Load existing scan
23. me available the appropriate status will be shown against each link By default the HTTP status for each Out Link is only refreshed whenever the page is selected in the main Result Grid See the Refreshing section below for information on refreshing status Properties s In Links E Images E3 Styles Scripts Emails E Source Dres Id Status Page Title Link Text Page URL 1 200 Website Analysis amp File Management Software ByteShift ByteShift Logo http www byteshift co uk default aspx 2 200 Software Products for Download ByteShift products http www byteshift co uk products default aspx 3 200 Software Product Development amp Support ByteShift services http www byteshift co uk services default aspx 4 200 About ByteShift Website Validation amp File Managemen about http www byteshift co uk about aspx 5 200 ByteShift Customers Website Analysis amp File Manage customers byteshift co uk customers aspx 6 200 Contact ByteShift Website Analysis amp File Manageme contact v byteshift co uk contact aspx 7 200 Sitescan exe Download 8 200 Spider Crawl Data ByteShift SiteScan meta data E CO 9 200 List of Common Web Servers ByteShift web server type http www byteshift co uk broken ink checker sitescan wi 10 200 HTTP Status Codes ByteShift http status codes http byteshift co uk broken ink checker sitescan ht 11 200 SiteScan User Defined Options ByteShift User
24. multiple column headers to the Group By row To remove a grouping drag the column header out of the Group By row Saving Results are saved by clicking the Save icon on the Results section of the tool ribbon Results are save in XML format Two files are created A file with a xml extension contains all scan results filtering is ignored when saving Afile with a xmlr extensions contains all referrer data all pages found which point to links in the results When backing up copying or moving results both files should be moved and kept together Note The main xml results can be loaded without referrer data but any referrers will not be displayed in the In Links panel Loading Results are loaded by clicking the Load icon on the Results section of the tool ribbon When loading SiteScan will first load the main xml results file then any referrer data from the xmlr file if found Exporting Results can be exported for external analysis in three formats Excel xlsx format Excel xls format separated csv format The csv format is fastest as cell formatting is removed SiteScan User Guide Properties Panel Properties Panel The Properties panel displays meta data and performance measurements for the page currently selected in the main crawl Results Grid Properties are refreshed loaded and displayed whenever a page is selected in the results grid Overview The properties displayed in Propertie
25. n upgrades are performed on this screen T Licensing amp Registration ence Activation Enter or paste the licence key below and click Active Activate Licence Status Activated Yes Activation Date 22 June 2015 SiteScan Edition Registered Edition SiteScan Version 2 2 0 0 Page Limit Unlimited Activating or Upgrading SiteScan Activation and upgrade follow the same process 37 39 SiteScan User Guide A Licensing amp Registration Li Activati Enter or paste the licence key below and dick Active 12345 12345 12345 12345 12345 Activate No Activation Date n a SiteScan Edition Enterprise Edition Trial 3 days remaining SiteScan Version 2 0 0 46 Page Limit 50 1 Copy the license key from your order confirmation email to the Windows clipboard 2 Paste the key in to the text box adjacent to the Activate button 3 Click the Activate button The license key will then be validated and if successful a similar to the following will be displayed Professional edition activation was successful Thank you for choosing SiteScan Current activation status and edition information can be viewed at any time 38 39 SiteScan User Guide T Licensing amp Registration li Activati Enter or paste the licence key below and dick Active
26. ooking in the content type header for keys Such as application binary and audio Images Filters the results to display only images linked to by A href gt style markup Error Filters The Error Filter menu provides predefined filters to allow quick viewing of results with specific HTTP Status Codes or warning messages Clear Error Filter Show All Errors and Warnings Client Errors 4xx 8xx Server Errors 5 SiteScan Warnings 6 Custom Filter Error Filter Clear Error Filter Clear any applied filters and displays all results Show All Errors and Warnings Filters the results to show only those with Client Errors Server Errors and SiteScan warnings Client Errors Filters the results to show only Client Errors beginning with the 400 or 800 HTTP status code Server Errors Filters the results to show only Server Errors beginning with the 500 HTTP status code SiteScan Warnings Filters the results to show only warning messages suggested by SiteScan such as slow loading pages or possible mistyped URLs Custom Filter Displays the filter creation pop up window to allow creation of user defined complex filter expressions on multiple columns 9 39 SiteScan User Guide Source Ribbon The Source ribbon work exclusively with the Source Code preview panel and provides access to edit format save and print source code File Home So
27. or non business use SiteSca EBSITE ANALYSIS Copyright 2006 2015 ByteShift Ltd www byteshift co uk Key Features Spider websites for slow or broken links Review page titles description and alt text Find mistyped or badly formatted URLs Set timeouts to highlight slow loading pages Check any type of web page file or media Save load amp refresh existing results Spider all hyperlinks or only first occurrence View all in links and out links for any page Control scan depth page amp link count limits Highlight results with conditional formatting Scan http https file mailto amp javascript schemes Work with results in real time during scannin Server amp client side redirect support Sort group search amp filter results like Excel Spider intranet file linked content One click filtering for all errors amp warnings Measure response download and HTML load Highlight URL and image markup in HTML s time Set spider speed pause and resume Export spider results to XLSX XLS CSV For more information visit www byteshift co uk or email support byteshift co uk 4 39 SiteScan User Guide 5 39 Table of Contents Welcome Overview Table of Contents User Interface Tool Ribbon Results Grid Properties Panel Out Links Panel In Links Panel Images Panel Styles Panel Scripts Panel Emails Panel Source Panel Options Scanning Export Licensing S
28. ot Found errors http www byteshift co uk broken ink checker images scre 14 200 screenshot finding broken link in html sourcecode png Highlight a broken link in the HTML source code www byteshift co uk broken link checker images scre 15 200 screenshot filtering crawl results by http status e Filter spidered pages by HTTP status code http www byteshift co uk broken ink checker images scre 4 b Links found on the selected page which link out to other pages Click to highlight link in the Source code Shift Click URL to navigate Double dick row to find in results A number will be appended to the Source tab caption to indicate how many occurrences of the selected link were found Each occurrence of the link found is highlighted in yellow Properties Outdinks Iniinks Images jstyes 42 5 Emails Source 2 108 lt tr gt 109 lt td gt 110 111 lt table class headerContent gt 112 lt tr gt 113 lt td gt 114 115 lt div class headerLogo gt lt a href default aspx title ByteShift gt lt img 116src images logos byteshift 205x41 png alt ByteShift Logo gt lt a gt lt div gt 117 118 div class headerNavigation gt 115 120 lt div class linkMargin gt lt a id elmLinkHome class headerLink 121 href default aspx gt home lt a gt lt div gt 122 span class headerLinkSeparator gt lt span gt 123 lt div class linkMargin gt lt a
29. rget Link Source Page ID Source Page URL Link Relative URL Columns which appear in the Source Page group contain information and links pertaining to the source or referring page on which the Target Links were found Clicking a URL in the Source Page 11 39 group will load the referring page in to the preview Browser then load its properties links images etc for examination Columns which appear in the Target Links group contain information and the links found ON the source or referring page Clicking a URL in the Target Links group will load the page that link points to the target in to the preview Browser then load it s properties links images etc for examination This distinction is important For example when you wish to find and highlight a broken link in Browser or Source Code you must load the Source Page as this is the page on which the broken link can be found then choose the link to highlight from the Links panel For more information on highlighting links see Links Panel or Browser Preview Choosing Columns Several columns are available to be shown on the grid each displaying different information about the page or file which was scanned Several of the most frequently used columns are shown by default with other columns added via the Columns button in the Grid Layout section of the toolbar ribbon To add columns 1 Click the Columns button on the toolbar ribbon to display the list of available columns 2
30. rid for any email address Double click the email link row Refreshing The list of email addresses can be refresh from the page on the web server by either Double clicking the related page row in the main Results Grid Clicking the green Refresh icon to the right of the tab headings Exporting The list of emails addresses can be exported to Excel in XLSX format Click the XLSX icon ESI to the right of the tab headings SiteScan User Guide Source Panel Source Preview The Source preview is used to display the source code from the page currently selected for analysis in the main Results Grid It provides highlighting of links as well as editing formatting and printing of source code The Source toolbar ribbon provides features for working with source code Overview The Source preview is a rich text editor display with support for line numbering text highlighting formatting and saving Its contents are synchronized with the page selected in the Results Grid as well as Properties Out Links Images In Links and so on Its primary function is to allow highlighting of erroneous links via selection from the Links or Images panels 2 3 DOCTYPE html gt 4 5 lt xmlnsz http www w3 org 1999 xhtml style height 100 lang en us gt 6 7 head lt title gt 8 Website Analysis amp File Management Software ByteShift a title meta charset UTF 8 gt meta viewport con
31. s vn yahoo com p dnr OK OK 0 86 text html 0 3 https uk yahoo com p us Malaysia https malaysia yahoo com p OK 0 78 text html 0 4 https uk yahoo com p us Ireland https ie yahoo com p dnr OK OK 0 83 text html 0 5 https uk yahoo com p us Indonesia https id yahoo com p dnr OK OK 0 80 text html 0 6 https uk yahoo com p us Belgique fr https fr be yahoo com p dnr 0 87 text html 0 7 https uk yahoo com p us EM ba https ar yahoo com p dnr OK OK 0 88 text html 0 8 https uk yahoo com p us India https in yahoo com p dnr OK OK 0 81 text html 0 9 https uk yahoo com p us Philippines https ph yahoo com p dnr OK OK 0 91 text html 0 10 https uk yahoo com p us Belgi nl https be yahoo com p dnr OK OK 0 87 text html 0 0 11 https uk yahoo com p us Deutschland https de yahoo com p dnr OK OK 0 87 text html 844 Recordiof676 gt Hover over any column heading for a detailed description of the information the columns contains Target Link B Response Feedback Response Time s Con The HTTP response code returned from the web server when the OK link was requested 0 65 text OK OK 0 28 text OK OK 0 32 text OK OK 0 30 text Column Groups important All columns which can be displayed in the results grid belong to one of two groups also referred to as bands These are Source Page or Ta
32. s Page Title Link Text Website Analysis amp File Management Software ByteShift ByteShift Logo 2 200 Software Products for Download ByteShift products http www byteshift co uk products default aspx 3 200 Software Product Development amp Support ByteShift services http www byteshift co uk services default aspx 4 200 About ByteShift Website Validation amp File Managemen about http www byteshift co uk about aspx 5 200 ByteShift Customers Website Analysis amp File Manage customers http www byteshift co uk customers aspx 6 200 Contact ByteShift Website Analysis amp File Manageme contact http www byteshift co uk contact aspx 7 200 Sitescan exe Download http www byteshift co uk broken link checker download s 8 200 Spider Crawl Data ByteShift SiteScan meta data http www byteshift co uk broken link checker sitescan sr 9 200 List of Common Web Servers ByteShift web server type http www byteshift co uk broken link checker sitescan w 10 200 HTTP Status Codes ByteShift http status codes http www byteshift co uk broken link checker sitescan ht 11 200 SiteScan User Defined Options ByteShift User defined options http www byteshift co uk broken link checker sitescan us 12 200 SiteScan Download Softpedia SoftPedia http www softpedia com aet Internet Search engine tool 13 200 screenshot checking website for 404 errors png Spidering a website for 404 N
33. s panel and the columns shown on the Results Grid are the same When row is selected in the Results Grid the data obtained during spidering is displayed for the selected result Properties are grouped in to categories with each displaying related information Values are read only but can be selected and copied to the Windows clipboard By default the properties for each link is only refreshed whenever the page is selected in the main Result Grid See the Refreshing section below for information on refreshing properties Outiinks Iniinks Images jstyes amp jScipts Emails source 2 Identity Title Broken Link Checker Download free SiteScan trial ByteShift Description Check websites for broken links images stylesheets and JavaScript View page referrers 404 errors HTTP headers titles alt text and more Url http www byteshift co uk broken ink checker Relative Url fbroken ink checker Response Status 200 Performance Response Time secs 0 48 Body Load Time secs 0 41 Full Load Time secs Status Response Feedback 200 OK Full Load Feedback Last Modified 09 07 2015 Location Base Uri http www byteshift co uk broken ink checker Original Relative Url broken ink checker Original Url http www byteshift co uk broken ink checker Web Server Microsoft IIS 8 0 v Response Status The response status code returned from the server when the page was requested Refreshing Properties can be refr
34. tent width device width initial scale 1 meta name author content ByteShift 10Ltd gt lt meta name robots content FOLLOW INDEX gt 11 12 lt Design Only gt 13 lt lt link rel stylesheet href styles default desktop css gt gt 14 15 lt link rel stylesheet media screen and min width 800px href styles default_desktop css gt lt link rel stylesheet media screen and 16 min width 0px and max width 799px href styles default_mobile css gt 17 18 meta name description content ByteShift are a leading software development company specialising in website performance analysis and 19bulk file and document management solutions for business Aberdeen UK gt 20 meta name keywords content broken link checker website analysis website validation file management software document management 21software gt 22 meta name yandex verification content 7059a89ef180e89a gt 23 lt link rel icon type image png hrefz images icons favicon 32x32 png sizes 32x32 link relz icon type image png 24href images icons favicon 16x16 png sizes 16x16 gt 8 The HTML source code for the selected page Loading Source Source code is loaded automatically when a page is selected for analysis in the main Results Grid SiteScan current supports previewing source code for the following content types HTML XML and Text Style sheets e g CSS Script files e
35. that level or below will scanned On the same sub domain as the Starting URL 33 39 SiteScan User Guide If the starting URL is www mysite com products then 1 All links found at that level or below will be scanned 2 All links found adjacent to or above that level will also be scanned For example the following links would also be scanned www mysite com default html www mysite com services On the same sub domain as the Starting URL or any sub domain If the starting URL is www mysite com products then 1 All links found at that level or below will be scanned 2 All links found adjacent to or above that level will also be scanned 3 All links found on any sub domain of mysite com will be scanned For example the following links would also be scanned www mysite com default html www mysite com services products mysite com catalogue Skip pages or files with the following extensions Pages or files with extensions listed here will not be scanned to check if they are responsive or inspected for additional links to follow Enter a comma separated list of file extensions such as aspx exe zip mp4 Include the following link types in results Select any non navigation type links to also display in scan results Non navigable links cannot be checked for a server response They will be listed in the PageURL column to allow further analysis such as filtering by a specific
36. timeout The length of time in seconds to wait for a response from the web server before skipping Body load timeout Maximum automatic redirections to follow The duration in seconds to wait for the HTML content of a web page to be fully received from the web server before skipping The maximum number of automatic server redirections that will be followed before skipping the requested page or file Use default proxy settings and user credentials Scan Depth and Filtering Option Select this option to use 1 Any proxy settings as configured in Internet Options via Control Panel 2 The currently logged on users network credentials for website authentication if requested by the site Description The recommended setting for this When checked performance can reduced if proxy settings are miss Control Panel Internet Options Note Only scan pages or files on these schemes Only extract and follow links from pages found Select the scheme sometimes referred to as protocol types you wish to allow when spidering pages and following links Select one of the three options to set the depth to which SiteScan will scan The depth increases with each option and will result in longer scans and larger result sets An explanation an example of each option follows On the same level or beneath the Starting URL If the starting URL is www mysite com products then 1 Only links found at
37. ts allows viewing of all In Links found for any page or file when results are later loaded When selected loading and saving of results will take longer and produce larger files particularly on large websites The referrer data file will be saved alongside the main results file and have a xmlr file extension These file should be kept together if sharing results or moving them between computers Include hyperlinks when exporting to Excel Select this option to create functioning hyperlinks when exporting results to Excel De select this option to export hyperlinks as plain text faster Excel can sometimes become un attempting to create functioning h result sets which contain restricte If you encounter this problem de choose CSV as the export format Open file after export Select this option to launch the newly exported file in Excel or other native application when the export has completed 35 39 SiteScan User Guide 36 39 SiteScan User Guide Licensing Licensing amp Registration To display the Licensing and Registration screen either Click the Options button on the toolbar ribbon then click the Licensing button on the Options pop up window click the Enter License Key option from the Buy Now button on the toolbar ribbon Overview The Licensing and Registration options screen provides information on the current SiteScan edition and license status Initial activation and editio
38. urce A gt arm DUO 99 Find Undo Redo al B Save As 6 Replace lab Clear Formatting Source Undo Undo the last text or formatting change to the source code Redo Redo the last text of formatting change to the source code Print Print or Print Preview the source code with formatting Save As Save the source code to file Copy Copy any selected text to the Windows clipboard Find Find and highlight text in the source code Replace Find and replace text in the source code IA Change selected text font color Change selected text highlight color Clear formatting from selected text 10 39 SiteScan User Guide Results Grid Results Grid The Results Grid is used to display and interact with scan results Results are displayed in real time as a scan runs or existing results loaded from file for further analysis or refresh Overview Results are displayed in an Excel like grid with full sort filtering and grouping capabilities The most common features are described below with many additional features accessible by right clicking on various places within the grid such as column headers cells or group headers Source Page Target Link ID Source Page URL Link Text Link Relative URL Ree Response Feedback ires Tee ContentType Size Kb 2 OK 0 30 text html 0 2 https uk yahoo com p us Vi t Nam http
39. w Previewing a Script File in the Browser To open the script file in the browser or other native associated application Hold the Shift key and click the URL in the File URL column Refreshing The list of script links can be refresh from the page on the web server by either Double clicking the related page row in the main Results Grid Clicking the green Refresh icon to the right of the tab headings Exporting The list of scripts can be exported to Excel in XLSX format Click the XLSX icon ESI to the right of the tab headings Emails Panel Emails Panel The Emails panel lists all email addresses linked using the mailto scheme prefix found on the page currently selected in the main Results Grid Overview The list of email addresses is obtained at the point the page was crawled Each email address is only listed once in this view regardless of how many times it may appear in a page Each email link displays a status OK or a SiteScan 9xx warning where it appears to be mistyped link text and the email link as specified in the href attribute of the anchor tag in which it was found Properties L Out Links s Indinks Images Styles 8 Scripts Source 2 Id Status Link Text Email Link gt 1 OK inspired byteshift co uk mailto inspired byteshift co uk 2 OK email mailto support byteshift co uk Navigating to a Email Address in Results To jump to the corresponding result in the main Results G
40. w byteshift co uk broken ink checker images screenshots thumbnail exporting broken li http www byteshift co uk images byteshift direct download pna http www byteshift co uk images comodo secure 90x59 pnq http www byteshift co uk images validation check pna http www byteshift co uk images flag usa pnq http www byteshift co uk images flag ab pna http www bvteshift co uk imaaes buv now 106 38 Images found on the selected page Click to highlight image in the Source code Shift Click URL to navigate Double click row to find in results Previewing an Image in the Browser To open an image in the browser or native associated application Hold the Shift key and click the URL in the Image URL column Highlighting Images in the HTML Source Images can be highlighted in the Source code preview tab Click any image row to select it Click the Source tab to view the source 21 39 SiteScan User Guide Page Analyis http www byteshift co uk broken ink checker sitescan editions aspx Properties ga Links Referrers Images Styles 8 Scripts Emails ij Source 56 Id Alt Text Image URL 1 ByteShift Logo http www byteshift co uk images logos byteshift_205x41 pn 3 email http www byteshift co uk images contact email pnq 4 google http www byteshift co uk images contact aoogleplus pna 5 facebook http www byteshift co uk images contact facebook pna 6 linke

Download Pdf Manuals

image

Related Search

Related Contents

UNIT 8: - Dawson County, Montana    600346025 DVF1619 Dometic vent filter manual  3本足とアダプターのお客様  エルクレーブ・フルオート  Epson EB-G6350    Samsung Forno Multifunzione Compact NQ50J5530BS User Manual  Ricoh Camcorder PV-C880A User's Manual  AIRBRUSH PARTS OPERATING INSTRUCTIONS  

Copyright © All rights reserved.
Failed to retrieve file