Home

our final report

image

Contents

1. ylail gil yludenL jp3 yludenUqgwing_ gil syllabus qil syllabus_sel gil lite gil Macintosh HD Users jarredwilson Documents School Graduate School UT University 22 Attiiand E prot Lille 2 gil lille_shaitgil tawe jpg uigieygil Laiange gil TypelBall qil us n_laga_117 gil ula jp3 ula_healiangil ula_small jpg web gil web_iequesls jpg index php mlormalion alumni php cunentstudents php lacully_stall php laculty_la_laq php index php lacal ine plaspective_students php purchasing php thitd_year_ieview php title gil We Wantinlaimatian gil WeWantinlaimatian wae isehaal c s ischaaless bkp iSchaalNews leed xml iSchaalHews leed xs jobs index html job web cap icha audia_capicha php fonts COPYRIGHT TAT lacal canl README TXT RELEASENOTES TXT Vera til Vera Bd Vera ll Veralt ul Vera Mo Bl tl Vera Malt ease Bd Php_capicha ine Listing of iSchool_Web Page 17 lest php lest php visual_capicha php visual_capicha php CJab ine class_mail php CRegislislian inc CSeaic hCiiletis inc CUser ine database DBCieateSciipl Lx L DBLaad ResetScripl csh 5 es DBS3uiceCilies Lx l 5 ceCa u nl ies LL DBSaurcelabs 1 1 1 1 DBSaurceSau
2. 2014pBigit i 20 prot palmquist_tull jpg palmquist_thumb jpg pavelka_lull jipg pavelka_thumb jpg pele san_thumb jpg pallack_thumb jpg Randalph Bias jpg tice live ty_lullipg ke lke saiah_cunningham jpg saiah_cunningham_thumb jpg seaga_thumb jpg s helan _lull jpg shekan_thumb jpg s lukenbilLaiiginal jpg s lukenbil thu mb gil sparks_lull ipg s parks_thumb jpg slamp adiianjpg bias jpg blukenbill jpg caneanjpg lt lt ipg cavinglan jpg cunningham kiupps jpg davis jpg dillan jpg daly jpg daty_thumb jpg gallaway gil giacy ipa hallma k ipg haiman jpg 2002 jpg jpg awens ipg pavelkajpg pallack jpg tice lively jpg yeaga jpa slukenbillail s paiki ipg tui nbull jpg updegiave jpg waa his jpg williams Listing of iSchool_Web Page 9 wyllys sludent jpg turnbullnews pg turnbull thumb jpg updegiave_lull jpg updegiave_news jpg updegiave_thumb jpg vaathis_thumb jpg weslbiaak jpg weslbiaak_thumb jpg whazil_lulljpg williams_thumb jpg weylhys_thumb jpg zamaia_thumb jpg FAQ gil girl ipg gnu hesd liny ipg giading gil giading_sel gil giaphics gil home lemp_dance jpg hame gil hame2 gil hame2_sel gil hame_abautagil hame_abauLsel gil hame_admissians gil hame_admissians_sel gil hame_bay jpg
3. 2 Attend E H prot Lei pg ald_caree i_yervices jpg OLDcareeisenvices jpg OLDtitle gil apen_hause_2012 jpg arange_dal gil atangeigil page_spice ammie_giace jpg bendy_git IDD1 jpg bendy_git IDD2 jpg bendy_git IDD3 jpg gi 1004 bendy_gi DDS ipg bendy_git IDD6 jpg bendy_guyDD1 jpg bendy_guyDD2 jpg bendy guyDD3 iyg bendy 004 bendy guyDD5 ipg chik_y_laplap jpg hilp_s lane jpg imm alh_cla ss jpg lab_ha megi lj pg lea ping_gi Lipa atiginaLhamegitl jpg ran ammie_blui jpg ammie_gixeDl jpg ammie_giace D2 jpg ammie_giace Dicul jpg ammit_giacel4 jpg gi ki ipg immiath_classD1l jpg shane_listO1 jpg shane_listO2 jpg teslaralian_ghyves jpg pes image gil ipg aaian_ulmei_thumb jpg abby_gaadium jpg abby_gaadium_thumb jpg ambeishahjpg ambei_shah_thumb jpg andiew_dillanjpg andiew_dillan_thumb jpg andiew_whinslan jpg andiew_whinstan_thumb jpg ann_minnet jpg ann minne iaw jpg Listing of iSchool_Web Page 10 ann minnei_thumb jpg ann seaga jpg ann seaga thumb jpg apilsmithjpg apiilsmith_thumb jpg ba ba a_imm a Lh jpg ba baia_immiath_thumb jpg baibara_jansenjpg bai bsia_jansen_thumb jpg bellie_meginness jpg beltie_meginness_thumb jpg bila nde san ipg san_thumb jpg bilLlukenbill jpg bilLlukenbill_thumb jpg _ ba_xie_thumb j
4. 00006 en_US use la 00007 inc php helpTac inc php en_US editorial index intro jour publishing sile submission user help did help x ml lac did lapicdid includes diiver ine php lunclian inc php index php js general js lib adodb adadb csvlib inc php adadb datadicline php adadb el d inc php adadb elidi handle inc php adadb e a pea inc php adadb exceplians inc php adadb ile atai inc php adadb lib inc php adad b pager inc php adad b inc php adad b peil inc php adadb phpd inc php adadb lime inc php adadb xmlsche ma inc php adadb inc php daladict drivers Macintosh HD Users jarredwilson Documents School Graduate School UT University of TexaWeonra 221 20 taihnd E prot Listing of iSchool_Web Page 21 perl p vatlable inc php read me 1 1 sliter inc php taex pail inc php tahtml inc php xmlschema did libraries smarly locale cache 5 ml impattex pait Nativel pailPlugin en_US ml impaitex pait Sam pel partPlugin en_US xml impaitex part Userlmp partPlugin en_US xml lacales inc php en_US images lacale xml lacale did pages about A boulHandlel inc php index php admin AdminFunctiansHandkt inc php AdminHandler inc php AdminjauinalHandles inc php AdminLanguages Handkt im php did min
5. 1 502 1308 57482132 74122 7 lt 4 14 1 54 2 5 878354924 915 3 1 5064 965 035 5723 495 SebbSecBh7SeS ics 1 SDG8 ficaSeS 31D2cS BDbS7dbBBaS147b ics 1 508 5340151074847 ab75ddc 5 7eB5765 ics 5 A28 287815cd17583 cc ll4482470daD ic 5 468 S2cadicDbBBdS S34cdBcBllaGD26 ics 522 503 7eaSDDbS27 Sd3daaS4D73d36l ics calendai_list gelLcalendats csh geLcalendats csh gelLcalendatis lag calendars 1 204 8504085491281 10 lt 1185 6 26 1 1 22 4 44845 3 319 7e2S54chbe D7 bd Bs ics 1 2104 SabeSal2 2961 bD33 Gd 607 BI bib ics 1 2108 BB2120724656 17dS0Sddel Sa7l2 ics 1 2100 2821 61165 8343 3 451164 5 1 212 1374c323b5D87 dBsabbede b4187 ic 1 502 1308c57482132 8se7412207cdeld ic 1 504 2 5 878354924 915 3 1 5064 965 035 85723 495 SebbSecB7 Se ics 1 S068 ficaSeS3102c5 BDbS7dbBBaS147b ics 1 50 68341451074847 7544 57 5785 5 428 2678156417583 2 61144824704 0 5 5 468 S2cadicD bBB48 534cd8 8Bl1s8026 ics 5 522 5017ea510Db527 5434ss54073438l i calendar lisi gelcalendais csh geLcalendais ag publish php publish_log tet upcaming_events inc canlig inc dist php canlig inc php 44 COPYING day php delaullcanlig php enai php Tunc Lio ns admin_lunctians php calendai_lunctians php dale_lunctians php d ise lunclians php evenLi ical pasei php ical pasei phpa
6. edilatialPalicies tpl edila ialTesm 1 jaunalSpansaiship tpl site tpl sileMap tpl submiss ians tpl subsciiplians tpl admin impa lO1 1 index jauinals tpl jauinalSettings tpl languages sellings Lpl syslemCanlig tpl syslemCanligUpdaled sysleminia tpl article aili ke tpl intars Litial tpl pdlinters view tpl author Listing of iSchool_Web Page 22 active campkted ipl index tpl navsidebar 1 submission submiss ian tpl s ubmiss ianEdiling tpl submiss ianReview tpl submit comment camment ipl camments tpl common ear tpl header tpl mess s ideba tpl copyedilor actie lpl campkted tpl index tpl 1 submission submiss ian tpl edilor index tpl issues navsidebar 1 nalilyUsers tpl natilyUsersEmail tpl schedulingQueue tpl yeleciSecliz nEdila ipi submiss submiss chies tpl s ubmiss iansInEdiling tpl submiss iansinReview submiss ns Unassigned email email tpl gateway lacks s tpl help laater tpl header ipl helpTac tpl searchResults tpl lac tpl lapic tpl view tpl images 38 edpracess png edpiacesslaige png help gil icons inla gil lettei gil lacks s gil mail gil menulist gil pkp gil index jauinal tpl s ile tpl information inlarmalian lpl imsusll install tpl ins Lal Ka m ple le tpl upgiade tpl upgia
7. caaline_liick j pa ca aline_l ick_thumb j p3 cassie_alvaiada jpg cassie_alvaiada_thumb jpg hels me Lege ipg chela Lege i lhu mb jpg c ist an c is an_l ace_thumb j p3 cannie_bioaks jpg cannie_bioaks_thumb jpg c1aig_blaha jpg c1aig_blaha_thumb jpg delsulLiyg delault_thumb jpg diane_bailey jpg diane_bailey_thumb jpg dan_cailelanjpg dan_cailetan_thumb jpg darathy_highl jpg elizabeth_claik jpg eliza beth_claik_thumb jpg elly_slevens jpg elly_s levens_thumb jpg lleming_seay_thumb jpg hany ma tin jpg hany ma Ahumb jpg jacqueline_peery jpg jacqueline_peery_thumb j pg jen_maaie jpg jen_maaie_thumb jpg jan_kalka jpg jan_kalka_thumb jpg kaiLmantsch jpg kai_mantsch_thumb jpg kamai nassar jpg kamat_nassa1_thumb jpg kalhlezn haulihan jpg kathken_haulihan_thumb jpg kim_s mith kim_s mith_thumb jpg Listing of iSchool_Web Page 16 laui ie_zapal jpg lauiie_zapals iaw jpg lauiie_zapale __thumb jpa lea_engk_thumb jpg lecia_barker jpg lecia_ba ke _thu mb j pg libby pe le ek j pa la iene_ ay jpg thumb jpg luke_dunlap jpa luke_dunlap_thumb jpg mal ia_esteva j pa mal h_es ler a_lthu mb jpg jpg mallhew_lkease_thu mb jp3 megan_winge Lj pa megan_winge thumb jpg melanie_leinbeig jpg melanie_leinberg DDL jpg melanie_leinberg 02 melanie_leinbeig DD3 ipg melanie_leinbeig_thumb jpg mic
8. s lide DDB6 htm s lide DDB6_im age D33 gil s lide DESD him s lide DDS1 him s lide DDS2 him s lide ODS4 him s lide DD94 b kgidund gil s lide DOSS him s lide DDS5_b kgiaund gil s lide DDS6 htm s lide DDS7 htm s lide DDSB him s lide DDSS him slide DlDD him slideD111 him slideDlDi him slide DL D6_im ge DDB gil slide D1D7 him slide D1D7_image D1D gil slide DLOB him slide D1DB_im age D21 gil slide D DBE_im qe D2 2 jpg slide D1DB_ image D23 gil slide D1DB_ 24 jpg slide D1DB_im age D25 gil slide D1 DB_imageD26 gil s lide D1OS him slideD111 him slide D112 htm slide D112_image DD2 gil slideD113 him slideD113 im ge D53 iypg slideDl114 him slide D114_im age DS6 gil slideD115 htm slide D116 him slideD117 him slideDl18 him slideDl115 him 3 pace qil Samuellehn onDiclionriez chels_and_baak endband jpg index php index php origins Limages 2 baxe slagehlel jpeg 2 baxestageh tes jpg chelabaoks1 jpg d u ingend band JPG d u ingliealment JPG spines jpg Listing of iSchool_Web Page 28 s pinescans jpg s pine ipg rabals Lxl rooms admin index php AUTHORS cal_holdi ng 1 204 8504085491281 106118586826 165 1 208 4244645 3 319 7e294cBbe D7 bd Ba ics 1 2104 SabcSal2 2961 bD33 6d607BI bab ics 1 2108 BB2120724656 17dS0Sddel Sa7l2 ics 1 2100 2821 61165 8343 3 451164 5 1 212 1374 323 5087 ld Saabbede bd lB
9. Fade Size 1 2 GB 184 KB 264 31 8 MB 1001 byles ADE KB 1 18 3 7 MB 894 byles 7 2 MB 2 RB 218 13 8 MB 9 7 MB 128 280 228 178 6 FB 4 KE 1 5 MB 128 332 9 1 FB 123 MB 784 KB 3 5 MB 17 2 MB 13 3 MB 51 bytes 2 4 GB 399 byles 1 3 MB 6 KB 3 1 MB 16 Medilied 2 29 12 at 3 46 20 PM 11 at 1 55 07 PM 3 19 13 at 11 19 10 1 11 at 1 54 07 PM 5727012 at 10 36 26 21418413 al 3 56 33 PM 3 26 12 at 12 22 05 PM 3 15 11 at 3 23 46 PM 11 3 09 at 3 10 05 PM 1112812 at 11 45 00 1 15 10 at 2 28 25 5729 12 at 4 12 58 PM 1211812 at 4 43 29 PM 11 17 09 at B 13 13 PM 11 27 12 at 1 51 34 25 11 at 2 31 24 PM 8727012 at 4 51 11 PM 103111 at 10 55 35 PM 4 4 13 at 1 00 01 PM 11 18 15 at 2 38 48 PM 578712 at 2 19 13 PM 1 11 t 5 14 45 PM 3 10 10 at 11 24 16 7122012 at 4 42 05 PM 1715 10 at 12 57 08 PM 1 26 06 at 6 DB D7 PM 2131412 at 4 15 25 PM 10 2 12 5 54 01 217112 at B 35 22 PM 1715 13 at 2 45 04 PM 5 10 10 at 12 56 18 PM 2125113 at 8 41 52 AM 5710 at 9 11 46 PM 2024111 at 11 45 15 5 29 12 at 9 29 43 1717713 at 1 49 55 PM Macintosh HD Users jarredwilson Documents School Graduate School UT University of Texas PB Hi gs AA phita at BASH pre Appendix B full file listing Listing of iSchool_Web Page 1 abo ul _noles abaulul php enens enews_20D9 12 D1 php enews_2D10 D6 Dl php ene s_2011 03 D1 ph
10. bald png bal lB gil balB png baigl gil baigl png baig16 gil baig16 png baig2 gil baig2 png ba g32 gil baig32 png baig4 gil baig4 png baigB gil baigB png bathl gil barhl pna barhl gil baihl6 png barh2 gil bath2 png barh32 gil barh32 png bath4 gil ba h4 png bathB gil bathB png bail gil baril png Macintosh HD Users jarredwilson Documents School Graduate School UT University 2014pBigit 20 E bail63il barilfi png ba i2 gil bari2 png ba i32 qil bari32 png baid gil baid png baiBgil baiiB png baijl gil baijl png ba jl gil 1 2 baij2 png baiji2gil bari32 png baridail baij4 png baijBgil baijB png himl2 gil himl2 png sqD png sq1 png 42 sqi png sq4 png sq5 png 46 47 6 49 sqg png analaga gil analaga png aichive gil AieYauTypel gil assignments gil assignments_sel gil baal gil baial png ba al ba al png ba a2 gil baia2 png ba a32 gil bars32 png baad gil baiad png baiaBgil Listing of iSchool_Web Page 13 baiaB png gil babl png baibl gil baibl6 png barb2 gil barb2 png barb32 gil baib32 png bairbd gil baibd png bai bB gil baibB png baicl gil baicl png ba cl6 gil barclf png baic2 gil baic2 png baic32 gil baici2 png baicd gil baicd png baicB gil baicB png baid1 gil baidl png baid16 gil baid16 png baid2 gil baid2 pn
11. hame_caids jpg hame_caieeis gil hame_caieens_sel gil hame_camputing gil hame_campuling_selgil hame_cautses gil hame_cauises_sel gil hame gill ipg hame_kilgailin gil hame_kilgailin_sel qil hame_peapk qil hame_peapk_sel gil hame_piagiams gil hame_piragiams_sel gil hame_ieseaich gil hame_reseaich_sel gil hame_sel gil hame_space gil himi2 gil himl2 png 25 docu menL ican gil lacebaak_ian png linkedin_ican png quicklime_ican jpg iea kidea _ican gil 14s_leed_ican png syllabus gil lwille _ican png winmedia_cangil iGive_laga png inla_addiess jpg inla ma lian _ins Li Lule jpg iS haa Lex leri mall ischaal_piama jpg iSchaa URL gil kailes La baulgil ken_Ikischmannjpg kilgatlin jpg list_bullet gil list_bullet2 gil list_bullet3 gil list_bullet4 gil list_bullet gil list_bullet6 gil list_bullets arange_dal gil angel iangle gil aranget iangle jpg aiange_uiangle png live laqa _2009 laqa_uitle gil logos Laqa eps laqa jp3 laga_standaid jpg UT_SO LLaga_H_2c giay png UT_SOLLaga_H_2cC eps UT_SOLLaga_H_2cC png UT_SOLLaga_ T_2C C eps UT_SOLLaga_VTH_2CC eps Lucas mailman laige jpg mailman jpg nav_bg gil new_animaled_kga gil mehi zleller iS haa LA LA _Midwintei News lellei jpg j pa Macintosh HD Users jarredwilson Documents School Graduate School UT University
12. BG BRS 583843ED5555pidaliz d tpl php 6696 6 G96 398 EF X 2co n LacL Lpl ph p 66046864 G86CA2 DBS mail Ltpl php B GBE G BEEF 5485255 uses tpl php 6B26BEA6BEDD31E5S ansinReview tpl php SSG BG BF GBFSBEFFS we sian tpl php SGC GCH4 GCG DA BRFSS ianEmaillag tpl php 6D 6D336 DID2F1 DA layaul tpl php 6E 6E326E3B7531 pdllnteistitial tpl php 6E E 6EBA6EBA31D256 cCamplete Lpl ph p 3355703 703 7138BDECS25 selling S ved ipi php 5435703 TDD FDDSSG 12555index ipl php 72472D472 FOP nal tpl php 72572A TIA 15BC2 i sueTac pl php 3255743 745 A 745467F1 ex pai Dml ipl php 523743 745 7450454 B2 leyVise ipiphp 5337753778 77889C 1C655 le p4 ipl php 77577757771FD32 W submissian tpl php 7BA7EFA7EF7DDA ipi php 794751 47916004 25 ins tiuctians tpl php AICS 7CSDSDA Ta index tpl php 70870 7 1 782 1 Sins all ipl php SST DAI DCA FOC 645 25575 TEDA 751382 DASS le p3 ipl php 7ES7E157E1EBAA E zed i La ia Pa lice s ipi phip 5575 7EF3 7EFE44 Dis iansArchives tpl php 3815815 8153918C355lulurelssuei pl php SSB1 818 818753A D6 piaa liead SEL ELE 81885 B55555 piaalGalley tpl php 835835 8357E33895655 c Live ipl php SMB7 870 87070182525 selupHe de r 7 7 7 2ABS 5index tpl php S SES BED 500081 GSSlaimEnais ipl php 5 BA A BOSD DDFS nsinEdiling tpl php SE
13. DBLaad Dela ul Ls 5 hoal txt MaveCaunselnia php DB MavePeapleinia php DbResetiSchaalesh D Soul ce las ilicalians 1 1 DBS3u1ceSe mes ters bet DBSauiceSpecia Kalaga ies Lx L DBS3auiceSpecializatians Lz 1 Old DalabaseCanversianNales 1 1 iSchaaIDBLagin ine javascript animaledcalls pse js elaslic js haverlntentjs ibax js images ui bg_llal_b_assasa_4D 1D0 png ui b3_llaU 75 140 100 55 1 See_1 lt 4 0D png ui bg glass 1 400 ui bg_glass_75_dadada_lx4BD png ui bg_glass_75_eGebe6_1x4DD png ui bg_glass_ 5_lellec_l 4D0 png Macintosh HD Users jarredwilson Documents School Graduate School UT University of Texab ednngd 221 Attiiand E prot Listing of iSchool_Web Page 31 ui bg_highlight sall_ 5_ceceee_1 x1 DD png docs ui icans_222222_256x24D png css ui icans_2eB3ll_256x24D png ui icans_454545_256x24D png ui icans_BRBBBB_256 240 png ui icans_cdDabs_256 24D png id gallery css id gallery js jquery ui 1 B 16custamcss query ui 1 B 18 zuslam min jquety ui lime pi ker d dan js jquery js maalaal 1 2 1 xaie yc js yupedish js K analilh Even LDB Lagin inc neyi s in base DEC rea leNews 1 1 DBRe sele t pl cih DiSou ce ichfve 1 1 Hews DBLagin inc abi alele hlaccess resume_dsisabose DbResetSctiptResumescsh DBSauiceCaBreeilnterestCategaiies Lek DbSaurceDegiee Types 1 1 ResumeDbllagin inc Shared DB Funct
14. DELETE ON News TO admin localhost commands After the databases were created and permissions were granted the data was inserted into the database from our sql files using mysql u admin p h localhost iSchool lt iSchool sq and mysql u admin p h localhost News lt News sql For a complete list of commands used in installing the databases see Appendix C Rendering the Data in a Browser At this point the website was basically functional with most of the PHP and HTML files rendering correctly in the web browser However there were a few pages which would not load under the localhost URL One particular page which when the link was clicked from the home page would try to redirect to the live http www ischool utexas edu URL It was discovered that this was happening because the component files included the current PHP file as well as an older HTML file The older html file had been superseded however the settings in our Virtual Machine s Apache server made the HTML file the first choice for launching After correcting this setting in the Apache server the PHP page loaded correctly at localhost Finishing the Virtual Machine Once all the component files and databases were installed and the website was operational within the Virtual Machine the Machine s desktop browser history and other extraneous files were cleaned up Instructions to open the browser were included in the form of text on a JPEG file
15. EFECDEDS Sse aches ipl php SA EnA ECA AEC BEBA heade SSA FFA FDSAF DE SFDES backls sues tpl php BDABD33BD335 54255 ipi php 6 B 2402548256040 0 ansinReview tpl php SS 2 8294 8250C4 nsInEdiling tpl php 8358343834DBEBB1 index tpl php 55843 B4D B4DEE41 P impa LE ai ipl php 406860 Bibi 544526 B638B6DAB6DB3DDS submissians tpl php SSB 7874 8743 1 B5 6i ss ues pi php S8B7487438731494 Sl aya u L Lpl p hp SSB S g F4 BB4F 14 m men L Lpl ph p SSBC BC D3 BC DEDEF 5555 u ppF ile Lpl ph p SAB EA BEF BEF G7151 m ple led Lpl ph p SSBF BFB BFEE 3 SMC 1 502 SLi tle Index ipi php MC 24C224C2252 5 Dach ive ipl php SMC 21C2 8 C 2 BF EB BCS P ra lileFa SC 26 CIC AC 2CFS 1258p dal ead tpl php 3538 C3888C5 5 MS ssianReview tpl php SSCA CAG CAGE DDS 25553 izle ipl php C464 54 CA SEA FDS use Pialile tpl php Sd CAC CACAN A DBRS n vsidebarlpl php SSX 552343 ipl php SMC BC DA C 3551 sues ipi C DAC DSS EA 1 MMC B25CB2DF35105555kickis SC BC B53CB5B9DS submis sian BAC 4CH 5208545656 laa er ipli SMC BAC BESCBEE 8271556 yliseriEm 215 4 1 1 CCACCBACCB17D365 biz MMC DACDF4C D763 DSA a und Ip SMC DACDS4C 05713385 SC EAC EG
16. Strategist Sam Burns with the installation we decided that the Web Curator Tool installation was not going to be a successful undertaking and opted for the previously installed and crawled HT Track files Installing HTTrack After our exploration of Web Curator Tool we decide to use HT Track to harvest the website HTTrack is a free website copier that downloads the directories HTML files CSS files and images from a website to a local directory for offline viewing It automatically rebuilds the original site s relative link structure so that you can browse the local site in the same way as you would if you were viewing it online The software has two releases WinHTTrack for Windows computers and WebHTTrack for Linux Unix machines While Heritrix and Web Curator Tool require technical expertise and a familiarity with the command line or help from a systems administrator HTTrack is user friendly and straightforward to use Users with limited technical knowledge can configure the program and start crawling a website in minutes without the need to consult a manual Users who are more technically savvy can adjust the parameters of the crawl such as choosing the types of files to harvest or how many links to follow Heritrix and WCT also allow parameters to be adjusted if the user has the technical skills Our mirrored site captured in about four hours looked similar to the crawled site so we did not feel it was necessary to adjust the par
17. archival community will need to decide on shared standards and guidelines Preserving websites continues to present real challenges to the digital archiving community and new issues will continue to appear as online technologies evolve However this project has shown that new techniques can be successfully applied to the challenge of preserving web resources and we hope that others will continue to experiment with visualization as a method of web archiving and developing more user friendly web crawling tools Appendix high level file and folder listing Name abdul admissians advising careers causes develapment events lavican ica ms lhanl_page css glabal php images images 2D05 index php inla ma Lian ischaal css ischaaless bkp iSchaa News leed xml iSchaalHewsleed xs jabs jabweb kilgailin labs lacal ine ajs atientatian peapk Pplagiams teseaich tabats tet sale_email php schaalwide seaich php shared Listing of iSchool_Web Page 1 Kind Fade Fade Faker Faker TextWiangle text dacument Fake Fale Fake Windows ican image Fake CSS style sheet TextWiangle text dacument Fake Fale TextWiangle text dacument Fake CSS style sheet Dacument XML Dacument XSL Stylesheet Document Fake Fale Fake Faber TextWiangle Document Fale Fale Faber Fake Fake Plain Text Fale TextWiangle text dacument Fale TextWiangle text dacument
18. hael_laiid jpg michael_laiid_thu mb jpg michael_winship jpg michael_winship_thumb jpg nicak_iabichaux jpg _iabichaux_thum b jpg awen m nally jpg awen _ nally_th u mb jp3 pauLaumetryan_thumb jpg quinn_slewsil jpg quinn_stewarl_thumb jpg tebecca_eHesjpg tebeccah_hill jpg tebeccah_hilLthumb jpg sam_buins jpg sam_buins_thumb jpg saahan iaga thumb jpg samah kim jpg sarah_kim_thumb jpg s hane_williams jpg s hane_williams_thumb jpg jpg s heila_siegki_thumb jpg slan_gunn jpg s Lan_gunn_thumb jpg s lephanie_lawery jpg 32 s lephanie_lawery_thumb jpg s ue_mu phyjpg s ue_mu phy_thumb jpg tanyai abauin jpg Lanya_ abauin_thumb jpg tany_chetian jpg tany_chetian_thumb jpg unmil_kaiadkar jpg unmil_katadkar_thumb jpg william_as pi sy ipg william_as pi ay_thumb jpg yan_zhang jpg yan_zhang_thumb jpg peapk gil peapk_sel gil Plagiams gil Plagiams_sel gil PythanPawered png teadings gil teadings_selagil Red URLBall gil teseaich qil teseaic h_sel gil tesauice gil tesauices gil tesauices_selgil teluin_titk gil iss_bultangil sanchez jpq SmallTy pelBall gil spmergil 3 pacer_black gil sq0 png sql png 42 sq3 png 44 45 sq6 png sq7 png sqB png sq5 pnq 49 9
19. htiml Ilhc D3232004 himl Uhe_details pdl lap_ranking_D5132005 php nbull html updegiave himl ul_linances_2010 pd1 ulapia_DS1420D5 php 18 04 DR2DD3 him watch_diaz_D61D2DD3 himl web_pianee s_DGD32003 html websile_canlesl php news_lemplale php mewslellers 2012 02 2012 03 2012 04 2012 05 06 2012 10 nae_tanes jpg phalas u pds les 1 1 news Lemp Let 04082004 1 1 ul_ditect_newsleed php UTDirectNewsleed idl view php pas ilian_availabe ph p 1anking php taams php vision php visian_ciicks jpg visian_citcks2 jpg visian_circles_lull jpg vision_litk gil admissions nolez deadlines php laims php Tunding acs php awaids php deadlines php diexe_lnaga qil giaduate_lellawships php imls_hgajpg in_stale php index php ischaaLawards php pieservation_lelbwships php pieservalian_lellowships_apply php iesauices php la php liavel php index php inlemnatianal php lacal ine Macintosh HD Users jarredwilson Documents School Graduate School UT University 81Spr ql Attend E prot masie s php phd php placedures php schala s_in_ e side nce php lions cas php ask php sdvising admin php admin_hgin php admin_lagaul php admin_view_cal php advising css advising inc advisar php advisai_edit_advisai php advisai_ieques cancel php advisai_view_list php ava
20. our repeated efforts to install and deploy Web Curator Tools were met with several challenges At the moment the only supported platform for using and installing Web Curator Tool is Linux However the group did not have access to a Linux operating system during our first install attempt Instead the group used separate installation instructions developed by Web Curator Tool to install the software on a Windows machine The first installation attempt lasted for over seven hours involving installing and configuring PostgreSQL phpMyAdmin Apache Tomcat 5 5 and the Web Curator Tool installation files After installing and configuring all of the components to run Web Curator Tool attempts to deploy the war files for Web Curator Tool in Apache Tomcat were unsuccessful After two weeks of attempted re installations reading through the error logs and still unsuccessfully deploying the war files the group decided to attempt to use a Linux environment to complete the installation Using the COM2 computer in the Digital Archaeology Lab which runs an Ubuntu operating system the group installed and configured PostgreSQL Apache Tomcat 6 0 and the Web Curator Tool files While we were able to successfully install the component pieces we could not get Java configured properly due to a previous installation of Java on the COM2 machine that we were unable to uninstall and start with a newer version of Java After enlisting the assistance of Communications and Web
21. phd _p ag am_sludy dac phd _p ag am_sludy pdl phd _sludenl_handbaak_ve _lall_2 DDS phd_student_handbaok_vei_lall_2011 pdl resesic h_excelleme pdl s chaal_lib ary_practicum_appdac s Ludent_medicaLiekase dac s Ludent_Liavel_ieleas e dac s Ludent_liavel_iequestdac la_applicalian dac la_applicalian pdl la_jab_duties dac liaveLpalicy php l Lieimbursement xls wise_applicatian 2D11 dacx wise_applicalian 2D11 pdl wise_applicalian dac 22 wise_applicatian lhant_page css glabal php images abautgil abaulsel gil admissians gil admissians_sel gil alumnijpg amslcg analaqa qil analaga png baal gil baial png baral qil ba al png baia2 gil baia2 png baiai2gil baiai2 png bajad gil baiad png baiaBbgil baiaB png barbl gil baibl png ba bl qil baib16 png barb2 gil baib2 png barb32 gil barb32 png barbd gil baibd png bai bB gil baibB png baicl gil baicl png ba cl gil ba cl png baic2 gil baic2 png barc32 gil baic32 png baicd gil bacd png baicB gil baicB png ba dl gil badl png 16 91 416 Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 Attend E baid2 gil baid2 png baid32 gil baid32 png baid4 gil baidd png baid B gil baidB png baiel gil baiel png ba el dil ba el png ba e2 gil baie2 png bare32ail bie 32 png ba e4 gil bared png baieBgil baieB
22. this purpose The files uploaded included all the current and past iSchool website component files except for video tutorials The video tutorials were not included because of their large size For a complete file listing see Appendices A and B Also included in the files downloaded where the two databases which run the iSchool website iSchool and News The databases were in the form of sql files which included the SQL commands to build the various tables and insert the files into the local database In the Virtual Machine the tgz file was downloaded using an FTP client and then unpackaged for use The usernames passwords in the files were changed from their original to username and password This was done so that the original passwords did not become known comprimising the security of the live site All of the website component files were placed in the system s www folder var www and the index php page which was used earlier to test the operability of the localhost in the web browser was replaced with the iSchool website s index php file Two databases were created using the terminal command CREATE DATABASE iSchool and CREATE DATABASE News Using the super user account a new user named admin was created and permission was given to that user to access the two new databases That was done with the GRANT SELECT INSERT UPDATE DELETE ON iSchool TO admin localhost and GRANT SELECT INSERT UPDATE
23. tpl y debar tpl s debai_year tpl tada tpl week tpl yea Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 Attend E prot admin tpl calendai_nay tpl day tpl delaulless evenLipl laater tpl images manth_medium tpl manth_small tpl pieleiences tpl print tpl 15 search_bax tpl seminar tpl s debar tpl week tpl yeartpl lest TIMEZONES upcaming php validate inc week php yearphp sale_email php schoolmide James_Andiews pdl Kalpana_Shankar pdl Katie Vann pil Nathan_Ens mengei pdl Phillip_Daty seaich php shored AccauntDBLagin ine s_ds la base DEC cca un Ls DBLaadDelaultsSci ipL lx L DbResetSctipt csh DeSaurceAccaunts Lek DBSaurceUsers 1 1 DBS3uicellseis 22 CAccessary ine CAccessaryCallectian ine CAccaunLine CAccauntCallectian ine CAccauntLlink ine CAccauntlinkGallectian ine Listing of iSchool_Web Page 30 CAccauntReques line CAccauntRequestCallectian ine calendsr_jvascript calenda himl calendar js images cal gil laqa gil nexlgil yesi gil pixel ail prev gil prev yesi gil ic qil cap stone da la be se D Canversian3ciipl php DEC aps lane 1 1 DBLasdDelaullsCaps lane Let DbResetCapslanecsh DBSauice iea
24. ty_lullipg tice live ty_thumb jpg saiah_cunningham jpg 30 saiah_cunningham_thumb jpg seaga_thumb jpg sheKan_lulljpg s hekan_thumb jpg s lukenbilLatiginal jpg s lukenbil thu mb gil s paks s parks_thumb jpg stamp ad ian jpg bias jpg blukenbill j pg chen jpg clayman caving lan cunningham k u ppa j pa di 9 ilan jpa daly jpg daty_thumb jpg jpg galleway gil gr cy pg haiman jpg haiman2DD2 jpg immiath jpg melzgel jpg awens jp3 pavelkajpg pal ack j p3 tice live y ipa yeaga jpa s lukenbillgil 3 paiki ipa luinbull jp3 updegiave jpg williams jpg wyllys jpg zamaia jpg s Ludent jpg turnbull_news jpg turnbullthumbjpg updegiave_lull jpg updegiave_news jpg updegiave_thumb jpg vaathis_thumb jpg westbiaak jpg Macintosh HD Users jarredwilson Documents School Graduate School UT University 4014pBigit 20 taihnd E prot wes Lbiaak_thumb jpg whazil_lullipg williams_thumb jpg wylis thumb jpg zamaia_thumb jpg FAQs gil gnu hesd liny ipg giading gil gisding_sel gil giaphics gil home lemp_dance jpg hame gil hame2 gil hame2_sel gil hame_abautagil ha me_a ba ul_sel gil hame_admissians gil hame_admissians_sel gil hame_bay jpg hame_caids jpg hame_caieers gil hame_caieens_sel gil hame_camputing gil hame_campuling_selgil ham
25. 1GDOCFE453 shingSystem Lpl php 5318 19E 3 15EDC FB Pas ngePasswaid ipl php 555513 514814 FABS Slagle 46518418241 825B F23526 Lipl php 6 18410341 8300 873 sec lin lpl php 553185185 18515 AF7SSS ssianEditing 5333203202 212C28835 gePassward ipl php 5542103 215 215 BBC DPS navsidebar tpl php 5555213218 218C8813568 al Cu m plete Lpl php 21821792172 45234233 4235 F RES AWM seaichUser 2352363236EDF2 ian Eventl M521321732375DDD155 ummaliy l 5243244 4244D71FS inlaimatiar 245246 248153 53556 a mpieled MS253251 25185D28555 26 26D 26 DGBTF CHS ja una ls lp SM2832813281FBS1C65tac ipiphp 298291229193 4 294200 4250 2266 DAS entlagEr M2C 2C14 2C1 FD DODAM s sue tpl p KICA ICEA 2CB4B445 meladala t MMZE 2EDAZEDF GSES S ssianRevi 53303 305 330845 EFR tpl php 5583113314 4314AF 32 3298 32515119 sUnassig 3532343205 32613DB5S5i he dei lpi 53343 348 4346 PDFS B328reg is ie Si le 36 36D336DBS1BBS 5 38 364 364 8890 4 27837 37 28 pralile tpl MSR 38C 3 38C 5 1622588emailipip SM38338D3 3ED742 0 yd 15415 44050656 6525 use Pralile 426 42688 42542F342FD737ASW editai 5 43594343434D5ECFPS5 me ss ge 35499456 4363 AR GAM le p2 Lpl p 43543E3 43E5BED6 submitHeae 44 441 4412
26. 22DDS php haiman_1 DD62DD4 p hp hei iman_suil jpg ia_D3112 DDS php 03282003 iasummil_D30B2004 him 05012003 04302003 Listing of iSchool_Web Page 2 imls_giants_DS1 72DD4 php imls_successes_D7132DD4 php immiath_12172003 html index html ischaa LSSA_D7DB2 DD4 php 09142005 kilgarlin cente 08242004 04222003 himl landc_anniveisary himl LHRTawaid text 03 him libiary_e mplayens_09222D05 php ILeavet jpg laudenslaget_iep_D6 252004 php lukenbill him memillsn_buin_DS242DD5 php 0909200 3 milk _iepiesentatiee 03142003 maver s_n shakes 04192004 nam _chanqe himl new_baaks_D5132D05 php b_manage _102B2002 him news ilem himl atian awad 01212 004 a _12152004 arientatian_2DDd4 php awens_kadeiship_DS 3D2DD4 php pavelka_D4D72DD5 php piesidentalmanagement_D6 162DD5 php raza_unida_D4DB2DD4 himl RelerencePublished1D232 DD2 himl tice live ray ala 05162003 ray q anL1 1282013 himl sanchez_ailick pdl sounds aving 57212003 himl s ludent_awaids_04132D04 him s ludenl pape D4142104 himl s ludenliiecagnized 11302004 leaching_awaid_DID72005 php 11112002 lest php lexas_legacy_D4242DD3 himl 03092 DDS php dinne 01272003 Lla_canleience_D3232DD4
27. 547D5 slalus lpl 58454459445 SDDDE6 in te islilial 451452 45C4 DDES umma 1 47 31 Sd 83481 48127900 pee Review 1549493 44931 DG FD nligUpde 4443 4A DER SAC lepS SAA 54AE 34361127P5iu na Gelli SMI E 34254 246 DED ueDala t SAE AEM AEA 3FA3 2 em plate F d F AFA 4F AF FF FS sidebar 5655 1 513 5 DA AAA FARS index SS 5 2 EDS sys le mon 54 546 546AF219 5 nceMWanag 56 56335637DBB1 index 1 SSS 74577 AS 77 BOF EBslaaler tpl p SBF ASBFS710 1525 me Lada lakd 35 95587 55570430B526co mmenl SS 93 553 5554 E3 S5 IS Sind ex 1 1 55 S SE 4S SE064 SSSA 35453 SASA DDG 7 index tpl p MSSAA5SAA55AASADD9maaqem Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 201 4 E prot 39 Listing of iSchool_Web Page 24 335 83 58535853C 3555 83 5BF3 5BFCA 184556cu renlipi php SSB 5BFA5BFFD7 735454 ng u ge sipi php SMSC ASC24 C21 7ST EMSs ile tpl php MSSD35DDASDDD45 045 iew ipl php ASS FA 5F555F3A43 Sse Lling ipi php 3535 FA FB 5FBSFCSASSSg lleyFaim ipi php M bl 1E 6 1EDDEF Pats las tPasswoid tpl php SMG2 G2R 5628185455 ianCamment tpl php 4562562946295020 Bescampleted ipl php 63363D463D357DD su mmay Lpl php SS64 64E 8487861 7556 mail tpl php 5866 666 6660CE6 deCampkte ipl php SSG
28. 6 institulians php pep_checklist php pep l q php pep_guidelines php pep_inl ad uc lian php pep_appailunilies php pep_piesentation php pep regislislin php pe san_delails php pas lei_sessian php posler_session_archive caps lane_pragiam_lall_2D0 pdl caps lane_pragiam_lall_2D10 caps lane_pragiam_lall_2D11 pdl caps lane_pragiam_s piing_2 D1D pdl capslane_pragiam_spiing_2 D11 pdl capslane_pragiam_s piing_2 D12 pd3l capslane_pragiam_sum_20DS pdl caps lane_pragiam_summei_2D10 pdl capslane_pragiam_summei_2011 pdl index php summe _2DD9_na1 Lan jpg summei_20D9_thamas pdl piacticum_libiasies php p r zlicu m_p e set v lian php plaject_details php teparl php thesis php conservalion interns 2001 _noles AbbyHayaaad_FRepa D1 pd AbbyHaywoad_Repai DB D2 pdl AbbyHayaaad_Repa DI pd AL FinRpipd1 AL pl2 pdl AL pl3 pdl AL pid pdl AL pi5 p41 ALI pl_1 pdl 4 1 _1 15 _ 1 1_1 Aichived ntemnRe pails ph p Archives Rpts html AS i pl2 pdl AS pl3 pdl Beth He lle _Re pottDD1 pd1 DP FinalRpL pd 42 DP_iptl pdl DP_ipt2 pdl DP_ipt3 pdl DP_ipt4 pdl DP_ipt5 pdl FT linal pt pdl FT_iptl pdl FT_ipt2 pdl FT_ipt3 pdl FT_ipl4 pdl Genevieve Piece Repo iDD1 pdl Genevieve Pie ice Repa lDD 2 pd1 Genevieva Pie ice Repo ilDD 3 pd1 HaliyRa be ilsan LECL pdl imag
29. AC EG DAAS ES ca mpleled 74 711114 FAC FAAC FAADE E4525 bsctiplia DD4 DD14 DD LECF AES ubsciplia D1 D164D1 84455 5 p D2 D2F4D2FO4 35 Essent tpl ph D34 D354 D3 9246 DOSE sans L Dd 5 D4C 3 D4C 1843965556 sionHist DAC 56455 Del SSMDS DEB D5 BES 185556ca mpleled SSSDS 5 D58 D5BDDECS5552ca n Lex L LF SM SDS3DSB5D58897 SS se 11 Res u SD7307A 5D73X44 BBS alive 1 SDA 5 DAB 4 DAB 2 25C journals DB DHA DB4B75A DSRS DC A DCE DC BEDSS 5 peFa SDE 5 DEL DE148 54 SDE DER DEBE DE1 D335 Sponsa DEAA DEAS 20 1 1 S SDF DF123 DF1AC4C BSB u m 1 DFS DFBDSE7Cd556viee ipl pi SSE 25 6274582 7 27 1 1 SSE 2 E2 FOE 2FBF4 RAS s ide bar SSE 3 E3D E3 ADATSA heade tpl S E44E454Ed 35 EAA Scam men L L SSES ES14ES 1DG1 BSS a lida te ipi ESAESFAESFBSB72 nalilyU seis SSE SA ESC 3 ESC 35 96 nialiSync t SEA B 3 EA 36 3 D4 3SS pee Review F13F143F143 777A ep5 ipl pt F23F215AF219CERD7 1eqislte F32 F321 BF1ASSS ileMap Lpl F33 F33 5F384F382 964 1 u geSelli A F3BAF3BD6G 729526he dei SMFS 55 559 93 4 tpl pt SFG FGA FAALE submission 5 FE FREZE 6 SBCs epl tpl pt FEA FEBSFEBDB441525m nagem
30. B EB 80504 Ad 35555 live ipl php SMBD BD74B DTEAES PSS sianRegiels ipl php SBE BER BE GEL SC Dats ancedSeaich ipl php SOBRE BEES BE ESBS 14 522 n LEma il ipl php SBF EFO BDA CAMs ub missian tpl php SSOD SDD 35116 3981525 GalleyHTML ipl php 4514918491 BCS1 CS iptianFai 655149144514 5149658 apluieCite S S14S1 451 110 23556cc pyediLipi php 51491 F4S1 FA 6 36 Pstim partSuccess tpl php 5924023805234112 4 9454EAS4BBBS5A5 aillagEntiy tpl php S76 S71 711 6 GAG 552 ilePi aCi le Lpl ph p SSC GCA SOSE4 DCPs piaaliead tpl php KACA GCF 5 5CFA754 4525 agin Lpl p hp FFABF Fi4188526 ammendation ipl php DALI AD1717385555 u ppF ileVieeripl php DA AD 4 AD43 175555 u pp Files Lpl php ADB A 187342 DAM sianEdiling tpl php SSA 11413 41 3DBGF naws ide bar ipl php 5430 142153 A15521135855 a Ga lleyTap ipiphp MS 2332033 2DD454 Bes ummary tpl php SSA 3483354 6 32556ve ling tpl php 8413 44186 51 Bes u ppFile ipi php 753718 ATITICC SMsearchUsers ipl php SA THAT AA 7 FS SAA TAATOA ATES 341a LicleCa n lex ipi php 944 04 4 ABA D4 1 E 23 iplia nT ype Lpl php 533593 49602 Dim A 845 054 00 S04 DF BotseaichResults tpl php BE 5AB EG LE ESS Lex Ls Lpl php D44 D244 D2 2 FE 92 aya u L tpl php SSA E44 ETSA
31. Developing New Methods for Web Archiving Preserving the Dynamic Content of the iSchool Website by Kathryn Darnall Laura Vincent Elliot Williams and Jarred Wilson INF 392k Digital Archiving and Preservation May 1 2013 Developing a System for Archiving the Website This project was tasked with archiving the School of Information s website The School of Information iSchool at the University of Texas at Austin was launching a completely re designed version of the website in March 2013 and the iSchool administration wanted to preserve a copy of the old website The four project members were students in INF 392K Digital Archiving and Preservation taught by Dr Pat Galloway and were assigned to develop a method to preserve the out going version of the website Previous work Our work built upon the efforts of two groups in prior iterations of the INF 392K course who archived the School of Information s website A group of students in 2005 made the first attempt at archiving the website This group which included Rick Taylor Mark Downs Stephanie McFarland and Melanie Cofield focused on understanding the intellectual property issues involved in archiving the website explored methods for crawling the website and evaluated DSpace s usefulness as an access venue for archived websites Although the group members project reports suggest that they did not have as much success as they hoped for their work provided important inform
32. Entry tpl Listing of iSchool_Web Page 23 submiss ianEventlag tpl submiss kin Eve niLag Enliy ipi submiss ianHistary tpl submiss ianNates tpl submiss ianRegiets tpl submiss ianReview tpl submiss kins rchivei tpl iansinEdiling tpl submiss iansinReview tpl userPialile submission comment instiuctians tpl syc ul meladals sup pFile tache t_compile SSDD4 DDG DDG 265FS ca pyedil ipl php SSD251025 128F18CC 5553 ulharindex ipl php 555504 5147 04717122526 seclianFaim tpl php 55504 5 D4C3D183326 ipl php 55405 054 54 534075 D7C D7C BLS Fak Mindex Lpl php SDE DE4 DE45D2 7C blindex 1 S SDE DEA A DEY FSSE3S metadata tpl php SSDS DSF DS7DGEC Is DA 50435 DAES 3FESSS aale ipi php SDE DES DA3AASS Ben 1 1 SDC 5 CCA CATITA ayoul tpl php SSD DDS L DS FD2 Dats me L dala SDF DFG DFG D2 005 u ppF ileViseipiphp DF DFAS 2 17 PatscileE nd Na Le Lpl ph p 102106 108825375356ca nlex iplphp 46 10 1DD 10515DD4526em il 5554105 5 10EBCD4 7356 u bmitSideba lpi php 121122122 DFA HS laya u L LpI ph p 5855123124 4124234526 seleciRev ewe ipi php 12 1288128 95 18552 es el 5358143142 1423585 25 seLDue Date Lpl php 16 16D 16 D2EBG 55 17317F 17FD4482 5 6vieeP ge tpl php 65184183 15195416D
33. G IMG_D121JPG IMG_D123JPG stykecss ix_lab jpg ix_lab php lab lacal ine pape lab jpg pape _lab php phala php saund_iaams php lech_libiary php lacal ine cesses admin form article Attick inc php 34 A licleCa mmenL inc php A licleCa mmen LDAO inc php uicleDA inc php A licleF ile inc php AilickF ile inc php AtlickGalkey inc php AtlickGalleyDAG inc php A licleHTMLGa ley inc php AiticlkNale inc php A licleNa le DAG inc php Authar inc php AulharDA inc php log PublishedA ticle inc php PublishedA SuppFile inc php SuppFileD36 inc php author form comment Camment inc php CammeniD 3O inc php conlig Canlig inc php CanligPaise inc php core mitersilai inc php Care in php Da 0 bjec Linc php Handle inc php atai inc php Regisliy inc php Reques Linc php Suing inc php Virtual nay ker db DAS inc php DA ORegisliy inc php DA ORe suliFaciary inc php 1 DBDataXMlParsei inc php DB Re suliRange inc php inc php SQLPaneiinc php XWLDAQ inc php Tile Hanaga inc php File Manage inc php PublicFileManage inc php File inc php Tempat ary File DAO im php Macintosh HD Users jarredwilson Documents School Graduate School UT University 2014pBigit Attiiand E prot T
34. La nya_c Lanya_ aba u n jpg Lanya_ abauin_thumb jpg laia_iagulijpg taia_iaguliithumb jpg tata_iagulli jpg lara_iagullithumb jpg lany_cherian jpg tany_cheiian_thumb jpg unmil_kaiadkar jpg unmil_kaiadkat_thumb jpg veianica covi ng lan pg veianica cavinglan humb iyg i ginia_baade n j pa vigina baowden_thu mb william_as pi zy jp3 illiam_as piay_thumbjpg wallgang_niede thumb jpg waod y_davis jpg waad y_davis_thumb jpg yan_zhang jpg yan_zhang_thumb jpg peapk gil peapk_sel gil pragiams gil plagiams_sel gil pulLquale png PythanPaweied png teadings gil teadings_selgil Red URLBall gil research qil teseaic h_sel gil tesauicegil tesauices gil tesauices_selgil retuin_lille gil iss_bultangil iss_padcasL gil sanchez ipg yepalatai gil s de_bendyDD1 jpg SmallTy pelBall gil spmergil s pace_black gil sqD png sql png 42 43 44 sqS png Macintosh HD Users jarredwilson Documents School Graduate School UT University of Texab ednngd 2014pBigit 214 E prot sq6 pnqg sq7 png 4 549 9 y ludentl jp3 s gil syllabus gil syllabus_sel gil title gil Lille 2 gil lille_shaitgil lowe jpg uigieygil barange gil TypelBall gil us n_laga_117 il ulseal jpg ul_towe _loga jpg ula jpg jpg ula_lilth_Ihaimap_sm jpg uta_liisU_llaai_m
35. OlTawn Lxzt DBS3uiceGenies Lx l DBSauicelevselOlWaik Lz 1 DBSaurceSalsiies txt DBSauiceSe mes ters txt Caps lane DBLagin inc CCapslanePersan ine CCapslanePe sanCallectian ine CC lasi inc CC lsiCalleclian inc C lt CauirseCallectian inc CCauiseOplians inc Cllem inc CllemCallectian ine ClemType ine Clle mType inc CKranalithEventinc CKranalith Eve inc CNewsCalegaryCallectian ine C Hesr CPEPGen CPEP n titution inc CPEP n titutianCallectian inc inc CPEPPersanCalleclia n DEA D i nc CPEPPiajectine CPEPPiajectCallectian ine CPEPPiajectSeaich ine CPEPSalaryCallectian inc CPeisan inc Pe isanCalleclian inc 46 Pe sanOplian inc Re vervalian inc Re liz nCa lleclian inc CFesume inc CResumecallectian ine CResumeOptians ine CSSLCDaLa ine CSSLCDalacallectian ine CUTUser ine CUTUseiCallectian ine eid_lagin_lunctian inc elaslic_css elaslic css elastic js elastic print css equip me nl iss ba se DEC res leEquipmentCheckaul Lx lL EquipmentResetSciiptcsh DBLaad Dela ulls S Di3ou ies Lek DESau celle ms Lx L 5 celte mTiaubleTy pes Lx l DBSau celle mTy pes Uz l DBSauicellseiTiaubleTypes Lz L Equipment heckaulDBLagin ine ischool_ds base DBCiealteiS haal LzL
36. Settings Handler inc php index php article A licleHandle inc php index php author AulhaHandler inc php index php SubmissianCammentsHandke1 inc php SubmilHandier inc php TiackSubmissianHandket inc php comment Canlig_File class php COPYING lib debug ipl imler ml plugins Smaily clas s php Smatty_Campiler class php CammenlHandler inc php index php copyedilor Ca pyedila Handles inc php index php SubmissianCammentsHandkes inc php SubmissianCapyeditHandkt inc php edilor Edita Hande inc php inc php gateway SatewgayHandleji inc php help HelpHandlel inc php index php index index php Index Handlet inc php information index php InlarmatianHandles inc php install index php InstallHandlei inc php issue index php IssueHandk inc php layo ulEdi lor index php Laya ulEdila Handle inc php SubmissianCammenlsHandle inc php SubmissianLayaulHandle inc php cgin index php LaginHandlet inc php manager in php Files Handle inc php Impa lEx pa lHandle inc php index php Jau nalLanguages Handler inc php Manage Handle inc php PeapleHandle inc php SeclianHandle inc php SelupHanale inc php Subsc iplianHandle inc php O lHandler inc php index php Piaalis de Handlei inc php 37 SubmissianCammentsHandker inc p SubmissianPraalieadHa
37. US editaiial ta pi DDDD31 inc php en_US edilaial ta pic DDDD32 inc php en_US editaiial ta pic DEDD33 inc php en_US edila al ta pi DEDD34 inc php en_US editaiial ta pic DDDD35 inc php en_US edila ial La pic DDDD38 inc php en_US edils ial ta pi DEDD37 inc php en_US edila ial to pi DEDD3B inc php en_US edila ial la pic DEDD35 inc php en_US editaiial ta pic DODAC inc php en_US edila ial ta pic DEDD41 inc php en_US editaiial to pi DDDD42 inc php en_US edila ial ta pi DDDD43 inc php en_US index iuc DDDDDD inc php Listing of iSchool_Web Page 20 la pic DEDDDD in php en_US inuia tac DEDDDD inc php _ 5 la pic DDDDDD inc php _5 lo pic DDDDD inc php en_US jauinal tac DEDDDD inc plip en_US jau nal DOODO inc php en_US jauinal tac DDDDD2 inc php en_US jautnal tac DDDDD3 inc php en_US jau nal DEDDDS inc php en_US iaurnal la pi DEDDDD inc php en_US la piz DEDDD inc php en_US tapi DDDDD2 inc php en_US jauinal tapk DEDDD3 inc php en_US la pi DEDDE4 inc php en_US la pic 000005 inc php en_US la piz inc php en_US la pi DEDDD7 inc php en_US jau nal la pic DEDDDE inc php en_US iaurnal la pic DODDOS inc php en_US la pic DDDD D inc php en_US la pic DDDD11 inc php en_US jau nal la pi DEDD12 inc php en_US iaurnal la pi DEDD13 inc
38. UT University of TexaWeonra 2014pBigit 20 E Brood paitlalias php labs annex_sm jpg aichaeakgy_lab php baak_kb j p3 boak_lab php cables jpg php campulet_usets jpg canseivalian php cansenvalion_b php digilizatian_suile jpg digilizalian_suile php images lab_student_warking_2 jpg lab_studentwarking_4 jpg lab_students_warking jpg lab_students_warking_3 jpg index php iLlab equipment php haadies jpg index php palicies php printe DD1 ipg printing php pul pk_ied_ciass jpg s tall php waikstatian jpg bb Tiles lab Cam pute Specilicalians pdl img Listing of iSchool_Web Page 18 IMG_DI3SDJPG IX lab laqa ipg luis_hanciscarevills jpg najpa nav gil pakjpg quale gil 1andalph_bias jpg tandalph_bias2 jpq sam_buins jpg signature jpg thumbagil lite jpg bara ca lumns gil whitwarth jpg wiappel gil yan_zhang jpg index himl labeast index html js laga png player swl pas ler_dema ldv pas le1_dema_l ldv pas ler_dema_2 l4v webcas webpage tgz news himl na header html peapk himl tesauices himl schedule himl 246534_3564737274323_87232503B_n jpg ingle alumn himl 323831_10100519174 0_1208759497_a jpg ludies html bgjpg ballam gil he dei pg header2 jpg header3 jpg huang jpg icanl al ipg imaqeDD1 png IMG_D1D4JPG IMG_D1D6 JPG 145 0111 5 145 0113 5 IMG_D115 JPG IMG_D116 JPG IMG_D117 JP
39. _ D7 122005 Macintosh HD Users jarredwilson Documents School Graduate School UT University 22 AF taihnd prot caralyn_statk jpa china jpg 02102005 _ 04282004 _11202004 1028200 3 html cameacalian_s pi ing 2005 php cunningham uppa 05202004 php 01202005 9 _lellawship_D5172DD4 php dean_dillan_s peaks 03022003 dillan himl d ilkan_ha lla pk 5 04072005 dillan_keynate_D22B2DDS php dll_lellawship_1D2D 2DD3 himl dan_davis_ala_D21D2DD3 himl dan_davis_ala_text himl dan_dislinguished_D7072004 php edible_baaks_D3D42DD3 himl edible_baaks_D33D2004 himl edible_baaks_D4D72DD3 html educause_D2D12DDS php explore ul 030 2005 lac_stall_book_exhibil_1D2B2002 himI laculty_inthe_news himl laculty_a penings_DS1D2D04 php lacully_pasilians php laculty positions O91020 php laculty_pasitians_aviLOSDB2003 him gallaway_ds pace_D3D3 2004 gallaway_ingenta_DS122DD5 php galeway_pieview_04242003 himl glilas_piesentatian_ 4262004 himi qaagle_D6232DD3 giay_giant_12152DD3 him giad_spiing_ 4 php g d l r iling_p ajecl_11 D42DD2 html giadspD2 himl qi dualian_D21B2DD4 himl qi dualian_D4D22DD3 himl gr duslin 04 video php gr duslian 11132103 himl gradual ion pholo _ pring 04 gs 8 1 quatemala_D91
40. __leinbt ig_thumb jpa michael_laiid jpg michael_hiid_thu mb jpg michael_winship jpg michael_winship_thumb jpg Linez_thumb jpg mike_millaid jpg mike_millaid_thumb jpg maigan_memillian jpg nathan_ensmengel jpg Listing of iSchool_Web Page 11 nalhan_ensmengel png nathan_ensmengei_thumb png nicak_iabichaux jpg nicak_iabkhauxthumb jpg awen_m nally j pg awen me nally ise png awen_m nally_thumb jpg pal_gallway jpg pal_galhwey_thumb jpg palieihenbach jpg palieichenbach_thumb jpg pauLaumenyan jpg pauLaumeiryan_thumb jpg pauls_day_thumbjpg phildaty jpg phildaty_thumbjpg quinn_stewsiL jpg quinn_slewsil_thumb jpg tandalph_bias jpg tandalph_bias_thumb jpg tandy_linch jpg tandy_linch_thumb jpg tebecca_eHer jpg tebecca_eMei_thumb jpg tebeccah_hill ipg tebeccah_hilLthumb jpg tan_pallack jpg tan_pallack_thumb jpg sam_buinsjpg sam_burns_thumb jpg iaga jpg saahan iaga_thumb jpg saish_kim jpa saish_kim_thumb jpg scallieeve_thumb jpg s hane williams jpg s hane_williams_thumb jpg sharan_lawcell_thumb jpg sheilasiegkiipg s heila_siegkei_thumb jpg s hirley lukenbill gil s hirley lukenbilLihumbail s lan gunn jpg s Lan_gunn_thumb jpg s lephanie_lawery jpg s lephanie_towery_thumb j pg sue_mui phy j p3 s ue_mu phy_thumb jpg s vellen ada ms jpg s uellen_adams_thumb jpg susan_leinandes jpg susan_lei nandes_thumb jpg 27 la nya kemenl ipg
41. a_applicatian pdl la_applicalian_al doc caps lane_3 BBL_lag dac capslane_piesentatian_guidelines cand ilianal_admissians_lai m pdl canlinuing_sludenlappdac canlinuing_sludent_app pdl easda eas pdl laculty_mechanicalLtuik_puichase pdl laculty_tiaveLgiant pdl laculty_ti ave Liequest dac laculty_u ave Liequest pdl Gender Equily Cammilment pdl index html index php index php indiwidual study dac individualLstudy pdl INFSBBR_evaluatian dac manual dac Listing of 5 Page 6 supeiis a inla superis a inla 2012 4 isata_laim pdl lic_applicalian dac lic_applicalian pdl masters epail guideda masters guide new_sludent_app dac new_sludent_app pl atientation_lalL2D10 dac pep prapasal 3 12 1ew11 12 dacx pep piapasal 3 12 1ev11 12 pdl pep piapasal 3 12 dacx 3 12 41 pep p apasal 7 12 dacx pep p apasal 7 12 pd1 pep p apasal nav12 dacx pep p apasal nav12 pd1 pep_piagiam_evalualian dac pep_piapasal dac pep_prapasal pdl pep_piapasalLwarksheel dac pep_sampk_agieement 7 12 dac pep_sampk_agieement dac pep_sampk_evalualiandac pelilian_tiansle dac pelilian_tianslei pdl phd _advis ing_laim dac phd _advis ing_laim pdl phd _applicatian dac phd_applicatian pdl phd_handbaak dac phd _handbaak pdl phd _p ag am_desc phd _p ag am_desc pdl
42. aak jpg daug_aaid jpg exhibiL h ll ipg exhibiL hall jpg FicalHist gil GiaduatianAnimatian gil gialeluLdead png health_inlaimatics_caver jpg haave jpg iasummiljpg iGive_laga png immiath_giaupl jpg immiath_giaup2 jpg immiath_giaup3 jpg immiath_giaup4 jpg immiath_health_textbaak jpg 17 inlarmation_inslitule png ischaaLililas jpg james_howisan jpg james_hawisan_ied_s mall jpg ci ipg lezia bakal pg libby jpg lincaln_bible jpg laniedang jpg maty_tynn jpg memillan_buin_DDl jpg memillan_buin_ghves jpg memillan_buin_giaup jpg nancy_ealan jpg nathan_ensmengel jpg panishjpg recave iy jpg tice live ty jpg tice live ty_news p yaa_piaclamalian ipg s Ludents_with_boaks jpg s Ludents_with_boaks_laige jpg tanya_c kemenL jpg lhiee_deans jpg TiibalBoakCaei jpg turnbullnews jpg updegiave_news jpa wathal png whilewhak_games jpg william_as pi ay jpg index php nens archive ad himl alise_D1212DDS php alumna_cavei_11D42DD4 php 4 09212005 alumnidinne _02102003 himl alumnidinne _09232003 himl aichives html asist_112 22004 php baseball_OS152005 php belinda_n_andiew jpg bias_ibm_01272004 himI bias_ibm_DS2720D5 php bias_jains_ul_61252003 him boak_les Lalumnidinne l1 05200 boak_les Lalumnidinne _ltext himl capslane_D4142005 php capslane_laii_OS142005 php caps lane_piesentatians
43. acal ine appdac log dac mentai log pdl mentat_applicatian dac mentat_applicatian pdl men la _kag mentai_kg pdl mentat_piagiam php Sile Males 1 1 anline_jab_sauices php anline_iesume_iesauices php 19 pas jabs php pas Liesumes php public_jab_seaich php lesume_assislance php tesume_delails php lesume_iesauices php tesumes php s pdl lesumes_caverleLlers pps sabrysurveys 2 DD3lall pdl 2 DD3lall_elee dac 2 DD3s pring pdl 2 DD3 piing_elec dac 2DD3summer pdl 21D3 ummer elecdac 2 bMs pring_ekedac ischaaL2BD2giads pdl ischaaL2DD2giads ppt ischaa L2DD3g d pdl ischaaL2DD3qiads ppt ischaa L2DDbig r d pdl ischaa L2DDig d ppi ischaaL2DD7qiads pdl ischaaL2DD7qiads ppl ischaaL2BDEgiads pdl ischaa L2DDEg r d ppix ILsakuysuivey_2DD2qgi ds pdl 200241 ILsalaysuvey_20D3giads pdl ILsalarysumey_20D3giads ppt ILsalarysumvey_20D4giads pdl Lssalary u vey 20049 ads ppl ILsalarysutvey_2DD5giads pdl ILsalarysunvey_20D5giads ppt ILsalarysuivey_20DGgiads pdl Lsalary u vey 200641 Lsals y survey 2017q d pdl ILsalarysumey_20DBgiads pdl ILsalarysuivey_2BDEgi ads ppix salary_suiveys php schaal_jab_seaich php seivice_suppliei_jab_seaich php s ludent_evaluatiandac s Ludent_evaluatian pdl s Ludentseimwies php s ludentevallT dac s LudentevalUT pd lech_jab_seai
44. adata into an aggregate archival file This metadata includes a unique identifier for each contained file as well as a digital hash to show that the record hasn t been altered during the harvesting process The metadata included in the WARC format helps to ensure the integrity and authenticity of the archived content Unfortunately troubleshooting with Web Curator Tool remains difficult The system according to developers is not designed for installation by end users that s what we have system administrators for which means that instructions for installation are designed with a System Administrator in mind not with a user who lacks familiarity with Java based programming Troubleshooting errors is primarily the responsibility of the user as is the installation and proper configuration of all of the component 1 The Web Curator Tool Project Web Curator Tool accessed 28 April 2013 http webcurator sourceforge net 2 ISO 28500 2009 WARC file format http Awww iso org iso iso_catalogue catalogue_tc catalogue_detail htm csnumber 44717 3 Hanzo Archives Learning Center WARC Files http Avww hanzoarchives com learning warc_files WCT presentation by dcc co cuk software pieces required to use Web Curator Tool PostgreSQL or mySQL Java and Apache Tomcat Documentation for resolved errors is also limited and generally assumes a familiarity with the software that an end user would not typically have As a result
45. aid32 png ba d4 gil baidd png baidB gil ba dE png ba el gil baiel png ba el gil ba el png ba e2 gil baie2 png ba e 32 3il baie32 png bared gil Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 AF taihnd E prot baied png baieBgil bareB png baill gil baill png 1116 911 baill png bail2 gil bail2 png barl32 gil bail32 png bail4 gil baild png bai lB gil balB png baigl gil baigl png baig16 3il ba gl png baig2 gil baig2 png baig32 gil baigi2 png baig4 gil baig4d png baigB gil baigB png baihl gil barhl png barhl gil ba hl png baih2 gil ba h2 png barh32 gil barh32 png barhd gil bahd png baihB gil bathB png baril png baril ail baiil6 png bari2ail bari2 png bari32ail bari32 png baid gil barid png bariBail baijl gil Listing of iSchool_Web Page 8 baijl png ba j1 gil ba jl pna ba j2 gil baij2 png baij32gil baiji2 png baridail baij4 png baijBgil baijB png bendy gi Lliant jpg boak_aH jpg boaks_yelhw jpg bay jpg cards jpg careei_senvices jpg caree gil careers _sel gil ma use jpg gt campuling gil campuling_sel gil ses gil caurses_sel gil discussian classiaam thn jpg discussian gil discussion_sel gil dodge laculty_ieseaich jpg Dow nOnGiaund jpg email gil equipment digilaLcameia_laige jpg digila_L
46. ameters for another crawl One important aspect of HTTrack to be aware of is the presence of the hts cache directory This directory keeps the original bits and headers data that were downloaded from the website whereas the browse version was modified to be viewable locally The cache should be kept to maintain the authenticity of the website and for preservation purposes while the browsable files are available for accessing the site offline For the purposes of our project we wanted people to be able to view and navigate the iSchool website as it was before the redesign in March 2013 HTTrack was a solution that was easy to implement and fit our specific needs However it does seem to be designed for capturing a small number of websites since multiple sites harvested together are put in the same folder Therefore it is a useful tool for small archives or others wanting to do web archiving on a small scale Larger institutions wanting to capture many websites may want to consider using Heritrix or tools that are built on Heritrix such as WCT as it is specifically designed to handle large scale web archiving As one final attempt to save a copy of the website in the recognized ARC format we explored using HTTrack2Arc a Java based transformation utility created by the Portuguese Web Archive that turns the results of an HTTrack crawl into an ARC file Upon first glance the tool seemed simple to install and run from a Linux command line with wh
47. ap j pa ula_listtlaai_mapsm jpg ula_small jpg web gil web_iequesls jpg image s 2005 _noles campuling gil mna hame_abautgil mna hame_abaulsel gil mna ha me_ad miss ans gil mna hame_admissians_sel gil mna hame_caieets gil mna hame_caieers_sel gil mna hame_campuling gil mna hame_campuling_selgil mna hame_cauises gil mna hame_cauises_sel gil mna hame_peapk gil mna hame_peapk_sel gil mna hame_pragiams gil mna hame_pragiams_sel gil mna hame_teseaich gil mna hame_eseaich_sel gil mna nav_bg gil mna abautgil abaulsel gil Listing of iSchool_Web Page 12 admissians gil admissians_sel gil alumnijpg analog analaga gil analaga png baial gil baial png 1 il baialfi png baa gil bara2 png baia32 gil bars32 png baad gil baiad png ba a Egil baibl png baibl gil babl png baib2 gil baib2 png barb32 gil baib32 png barbd qil barbd pna bai bB gil babb png baicl gil baicl png baicl6 gil baicli png baic2 gil baic2 png baic32 gil baic32 png baicd gil baicd png baicB gil baicB png ba dl gil baidl png ba dl gil 416 baid2 gil baid2 png baid32 gil baid32 png baid4 gil baidd png 28 baidB gil baidB png baiel gil baiel png ba el gil bare 16 png bare2ail bare2 png baie32 gil baie 32 png baredail bared png baieB gil bareB png baill gil baill png baillG gil badl6 png bail2 gil bail2 png bail32 gil barl32 png bail4 gil
48. ate School UT University 2014pBigit 20 taihnd prot 21 Listing of iSchool_Web Page 5 tussian inc php s avak ine php skivenika inc php s panish inc php swedish inc php u adilianal_c hinese inc php manth php manth_ischaal php pieleiences php README rss index php iss php 1 51 1 2 xz ml gil seaich php 5 default admin tpl calendai_nav tpl day tpl delaultcss laa ler tpl header tpl he dei lpi images manth_medium tpl manth_small tpl pieleiences tpl print tpl iss_index tpl seaich tpl search s debai tpl y debar_year tpl wee k lpl green admin tpl calendai_nay tpl day tpl delaultcss euai pl laaler tpl header ipl images manth tpl manth_lsige tpl manth_medium tpl manth_small tpl pieleiences tpl print tpl iss_index tpl seaich tpl search bax semina tpl s ideba tpl tada tpl week tpl yesi ipi grey admin tpl cake ndai_nay tpl day tpl delaultcss tpl images manth tpl manth_laige tpl manth_medium tpl manth_small tpl pielerences tpl print tpl 1ss_index tpl seaich tpl seaich_bax ipi seminat tpl week tpl yeartpl ischool header iple admin tpl calenda
49. ation and groundwork for the present project Following interviews with iSchool staff this group decided that their primary goal in archiving the website was to preserve its informational value as a historical record of the iSchool and in particular of the ways in which the school presented itself online Because of this they chose to store all dynamic content as static HTML pages for ease of preservation and access the original code behind the site was not needed to meet the needs of their designated community Intellectual property IP was a prominent concern for this group Two particular elements of the website raised IP concerns student created technology tutorials and licensed images They determined that tutorials created by student employees in the IT lab were the intellectual property of the creators and they therefore worked with IT staff to implement procedures by with lab staff gave permission for their materials to be archived They also discovered that many of the images used on the website were not owned by the School but instead were licensed for a specific period of time from a third party This caused considerable concern for the group who raised the issue of these images with the iSchool administration but the images were ultimately included in the archived version of the site The 2005 group explored two options for crawling the website HTTrack and Heritrix They ultimately decided that Heritrix was too difficult to use an
50. cameia_s mall jpg iaam_miiaphane_lage jpg _ _ mall jpg faculty adiian_thumb jpg barbsra_immiath jpg bai bsia_immiath_thumb jpg bias_thumb jpg blukenbilLlull ipg blukenbill thumb jpg catal_canean jpg eatal_caiean_thumb jpg chela jpg lt chidestei jp3 chideslte _thumb j pa cisca_thumb jpg clayman_thumb jpg 24 cavinglan_thumb jpg cunningham kiuppa_thumb jpg davis_lulljpg davis_thumb jpg dillan_thumb jpg daty_lull jpg daty_thumb jpg jpg eidelez_daty_lulljpg eidelez_daly_thumb jpg eidekz_lull jpg eidelez_thumb jpg launtain jpg gallaeway_lull gil gallaway_thumbail gary_geisles ipg gary_geislei_thumbjpg gile jpg g y_lull jpg giacy_thumb jpg grelchen_hallman ijpg gielchen_hallman_thumb jpg hallma k_lulljpg hallm s k_thumb jpg haiman jpg harman_lulljpg harman_thumbjpg heath jpg heath_thumb jpg hawingtan_lull jpg haw ingtan_atiginall jpg haw inglan_thumb jpg immiath_lull jpg immiath_thumb jpg jansen_thumb jpg jae_sanchez jpg jae_sanchez_thumb jpg lance_hayden jpg lance_hayden_thumb jpg latieneiay_lull jpg la iene_ ay _thum b jpg luis_lianciscarevills jpg luis_hanciscarevilla_thumb jpg megan_wingeLipg megan_winge thumb jpg mei sky me sky_thumb jpg miksa lull ipg miksa_thumb jpg OLDbar baia_immiath jpg awens_lull jpg awens_thumb jpg Macintosh HD Users jarredwilson Documents School Graduate School UT University
51. ch php updsles acade mic_jab_seaich php Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 20 Brood assacialian_jab_seaich php interns hips php lech_jab_seaich php UTMentar Inla2DD9 pd1 UTStudentipp2D DS dac we rkshop_downicads dac careei_lai_tips pdl ciealing_iesumesdac ciealing_iesumes pdl warikshaps php courses advising php class_details php class_details_lestphp classes php couis es_lisL php care inl3 B2c_lall_200B php 872 2006 caurse_desciiplians php se delail php se_numbe ing qil se_ alalian php ses php help php index html index php index php inslruclion la_re sources exam pk_caunse_pages zip images ischaal_cau se_ a dac lacal ine lacal ine msis_cautse_quidelines php phd_methads_cauises php wise php wise_classes php development advisary_cauncil php burstipg discavery_lund php endawmentlist php endawments php images noles bkmaik jp3 calendai png Listing of iSchool_Web Page 4 camp ipa ll g jp3 Inttwiagil giadS4 jpg haak jpg ican gil suppaitgil lexas_giving gil lexas_giving jpg yellaw_weave gil index php inlarmalian_saciely php lacal ine sludentgwing php suppdilail sUpparlers php theim jpa the imbar
52. chiving the back end of a website and creating a virtual machine in which an archived version of the site can be run is one potentially fruitful approach to the challenge of archiving dynamic content For future endeavors the amount of work necessary to undertake this task and the IP and security concerns involved mean that further refinement will be necessary before virtualization can be used as a wide scale web archiving approach One area where our group closely mirrored other groups attempts was getting mired in the difficulty of installing an open source web crawler past groups attempted to use Heritrix but were unable to obtain a comprehensive automated crawl of the website whereas our group attempted to install and use Web Curator Tool with the expectation that the user interface would allow for an easier crawl Unfortunately open source does not mean user friendly and a significant portion of our group s time was spent attempting to install software that is ultimately not designed for end user installation Future web crawling efforts should in our recommendation make use of Web Curator Tool provided it is still the most user friendly and comprehensive open source software available however they should also involve a System Administrator as early as possible in the process Web Curator Tool is user friendly after its 15 installed but the involvement of a System Administrator is crucial to a successful install and configuratio
53. d opted for HTTrack which requires much less configuration and is operated through a GUI As will be discussed below our group had a similar experience suggesting how little has changed in the field of web crawler technology in the intervening 7 years One of the implications of using HTTrack rather than Heritrix is that the crawl produces a complete copy of the website s file directory rather than a single file The 2005 group found this to be a significant problem for ingest and access in DSpace as they tried to ingest each file separately This meant that ingest and metadata creation took up a significant amount of their energy Perhaps more significantly they found that DSpace offers limited support for viewing HTML as it is not able to maintain the links between different pages and image files This points to a larger issue of access within web archiving one which our group has attempted to solve through the use of a virtual machine A second group of 392K students worked at archiving the iSchool website in 2006 including Janice Carter Kyong Rae Lee Carlos Ovalle and Nikki Thomas This group approached the task in a different way than the previous project They decided their primary goal in archiving the website was to support the records management needs of the School and therefore focused their efforts on preserving only those parts of the website that constituted official record material This allowed them to avoid many of the more
54. deCampkle issue aichive tpl cunentipl issue tpl view Apl view Page laye ul Edi lor active tpl campkted tpl index tpl na ide 1 submission submiss ian tpl manager emails files impor lex port index tpl languageSeLtings tpl seclions selup subscriplion proolresder 1 lt ampleted tpl navsidebal tpl submission submiss ian tpl reviewer aclive tpl Macintosh HD Users jarredwilson Documents School Graduate School UT University 2014pBigit Attend E prot campkted tpl index tpl navsidebar 1 submiss ian tpl biz ipli caplurecite tpl cileEndNaletp cilePiaCcile tpl c ileRe let ence 1 laater tpl header tpl index html meladala lp plinteFiiendty sent tpl suppFiles tpl suppFileView tpl radmin index jautnals seltings tpl validate 1 1 weisians tpl search advancedSeaich tpl authai Details 1 authailndex tpl searchResulls tpl lille Index seclionEdi lor index navikle bal 1 mme ndalia n seachlseis tpl sekectReviewer ipl sekecil er ipi selDueDale ipi submission s ubmiss ian tpl s ubmiss ianEditing tpl s ubmiss ianEmaillag tpl s ubmiss ianEmailLag
55. difficult IP questions faced by the 2005 group Although they preserved the website for different reasons than the 2005 project both groups determined that the necessary significant properties of the website included only the information displayed and the look and feel of the site neither project attempted to archive the back end PHP code or database structures that ran the site The 2006 group installed and used Heritrix to crawl the site Because of the problems in ingesting the materials created by HTTrack as discussed above they determined that Heritrix was a better option Although they were eventually able to get Heritrix running on an iSchool server the group ran into multiple problems during the process of installation and crawling including Java errors and scoping the crawl appropriately This group also put a lot of work into determining the requirements for the archived website materials to be considered authentic They specified authenticity requirements at multiple phases including during the active period of the website during the archiving process and after the materials had been uploaded into DSpace This detailed analysis determined that website records maintained by the School of Information IT staff and uploaded into DSpace with a documented chain of custody can be considered to have maintained their identity and integrity and are therefore authentic This work was extremely helpful for our project as we were able to use t
56. e FSAFSEAFSE6 CA 6C index Lpl p FBAFBSAF8554436 plugins Lpl FBAFB62F862556C3 editai Decisi Macintosh HO Users jarredwilson Documents School Graduate School UT University of Te x ab eprireg 2 20 BE H pret 40 Listing of iSchool_Web Page 25 SMF FOTAFO7RSC1 226 metadalaView ipl php ichaaLaccaunt php AFCA AFCAFCADTSS d ulingQue ue tpl php SMFEAFEQ FEBERSI SSM ubmission iplphp 7836 Lpl php 95217 SSFF A FFF AFFF AAA ipl php lacilities php laculty_and_stall php linanciaLaid php KSFF AFFF AFFF4BFAG MedilaiDecision lpl php 2965 t_conl ig user changePassward 1 email tpl index tpl lagin tpl laginC hangePasswaid tpl lastPasswaid tpl piali tpl tegister tpl legislerSile tpl 10015 d bXML SOL php impa lEx part php includes cliTaal inc php install php migiale php piecampik php rebuikiSesichindex php tunScheduldTasks php upgiade php orienla lion academics php advising advising php checklist pd checklist php 121 laq php index php PelitianTiansle pdl legistiatian php advising_and_iegistiatian php advising_bar_ckaiance_laim jpg agenda_lall_2DDB doc anaw qil buiKding_and_lacilities php caeei_s eivices php compuling buy php cannecl php help php il_accaunt php index php academics gil acade mics_aver gil advis
57. e and table respectively that they can access this specific command allows the user to read edit execute and perform all tasks across all the databases and tables Once you have finalized the permissions that you want to set up for your new users always be sure to reload all the privileges To activate the new permissions issue the following command FLUSH PRIVILEGES Your changes will now be in effect To test out your new user log out by typing quit and log back in with this command in terminal mysql username List user accounts SELECT User FROM databasename user Import into the database mysql u username p h localhost databasename lt data sql Display databases SHOW DATABASES Display tables in database SHOW TABLES IN databasename Display data in table SELECT FROM databasename tablename
58. e_cautses gil hame_cauises_sel gil hame gill ipg hame_kilgailin qil hame_kilgailin_sel gil hame_peapk qil hame_peapk_sel gil hame_pragiams gil hame_piagiams_sel gil hame esesich gil hame esesich sellgil hame_sel gil hame_s pace gil himl2 gil himl2 png icons dacumenticanagil quick lime_ican jpg teakideo_ican gil syllabus gil syllabus_ah gil win media_kan qil inla_addiess jpg iSchaa kailestabautgil kilgailin jpg Listing of iSchool_Web Page 15 list_bullet gil list_bullet2 gil list_bullet3 gil lis bulletd gil list bullet5 gil list_bullet gil live boakmatk_ican gil laqa_2 DDS j pq laga_titk gil mailman laige mailman j pa nav_bg gil navigation hame_abautgil ha me_a ba ul_sel gil hame_admissians gil hame_admissians_sel gil hame_caieens gil hame_caieens_sel gil gil hame_centers_sel gil ha me_cam pu ling qil hame_camputing_selgil hame_cauises gil hame_cautses_sel gil hame_exampk_sel gil ha me_kilga lin qil ha me_kilga lin_sel gil hame_navigalian_lem plate gil hame_navigalian_lem plate psd hame_peapk gil hame_peapk_sel gil hame_piagiams ail hame_pragiams_sel gil hame_ieseaich gil hame_eseaich_sel gil hame_esauices gil hame_esauicesd gil hame_resauices_sel gil README 1 1 search qil lap_abaultgil lap_abauLselgil lap_admissians gil lap_admissians_selgil lap_caieers gil lap_caieens_sel gil lap_cauises gil lap_caurses_sel gil tap_hame gil la
59. empa ary File Torm Faim inc php Faimtl q inc php validation help Help inc php HelpTac inc php HelpTac DAO inc php HelpTa pic inc php HelpTa pic DAG inc php HelpTapicSectian ine php iLEn Lacak inc php install form Install ine php Installer inc php Upgiade inc php issue form Issue inc php ssue clian inc php ssueD36 inc php Jauinaline php JauinalDad inc php JaurnaGetlings DAO inc php HatilicalianStatus DAO inc php Seclian inc php Se tianDAQ inc php SectianEditais DAO inc php mail AaAjticleMaillemplate inc ph p mplale inc php Emaillemplate DAO inc php M il inc php MasifTe mplale inc php SMTPMailei inc php manager form oni format OAL inc php inc php plugins ImpaitEx pai tPlugin ine php Plugin inc php PluginRegistiy inc php Listing of iSchool_Web Page 19 RT inc php RT zimin inc php RT3uucL inc php RTXMLPaiser inc php sched uledTask ScheduledTask inc php ScheduledTaskDAO ine php search AtlickSeaich inc php ArlickSesich DAO inc php AtlickSeaichindex inc php SesichFilePa ser inc php SeaichHTMLPaneiinc php security Rale inc php RaleDAO inc php Validatian inc php session Sessian ine php SessianDAO inc php SesvsianMan gei inc p hp sile Impa lOb1 inc php Site inc php Site DAO inc php Yeis kin inc php Ven kinCheck inc p
60. es index php inlein_teparl php JimThutn_Re tb 1 pd JimThuin_Re par 1202 41 JimThuin_Re par LDD3 pd31 M linalipt pd3l ALi ptl pdl ALi pl2a pd pi28 pd1 ALI pl3 pdl JLi pl4 pdl ALuesid pil Repa LOLL pdl 002 31 1003 pd LauienSueusyand_Pepa DD1 pdl LauienSueuyand_PeparDD2 pdl LindaBaiane_A piil2 DD4 pdl LindaBaiane_Feb2DD4 pdl LindaBaiane_lan 2 DD4 pdl LindaBatane_Mai 2 DD4 pd LindaBaiane_May 2004 41 MelissaBiadshaw_ Nav2DD3 pd3l Melis sabiadshaw_Oct2D03 pdl Melis haw_S pi20D3 pdl NL_ipl2 pdl HL_ipl3 pdl HL_ipl4 pdl HL_iptl pdl RE linal pt pdl RE_iptl pdl RE_ipl2 pdl RE_ipt3 pdl 4 31 SH_linal_ipt pdl SH_ipt2 pdl Macintosh HD Users jarredwilson Documents School Graduate School UT University 22 Attiiand E prot SH_ipt3 pdl SH_ipt4 pdl SH_ipt5 pdl SH_ipt_l pdl ShannanPhillips_RepaitOD1 pdl ShannanPhillips_Repaitl 2 pdl ShannanPhillips_RepaitO03 pdl SR_FinRpt pdl SPR_ipt2 pdl SPipt3 pdl SP_ipl4 pdl SP_iptS pdl SPiptl pdl Wendyk aeme _Ap il2 DA pdl Via ndyKisemiei Jan D D4 pdl Wendykiaemel_lune2DD4 pdl WendyKiaeme _Mai20D4 pdl WendyKiaemei_May2 DA pdl distance_education php dual deg ees php endaisement laimdac endaisement laim pdl general inla php giad_pack php images 9193 iling_pra
61. g baid32 gil ba d32 png baid4 gil baid4 png bard B gil baidB png baiel gil baiel png ba el qil baielf png baie2 gil baie2 png baie32 qil baie32 png bared gil baied png bar ail baieB png baill gil baill png baill6 gil 29 baill6 png bail2 gil bail2 png bail32 gil barl32 png bail4 gil baild png bal lB gil ba lE png baigl gil baigl png ba gl8 gil baig16 png baig2 gil baig2 png ba g32 gil baigi2 png baig4 gil baig4d png baigB gil baigB png bath1 gil bathl png baih16 gil 1 bath2 gil bath2 png bath32 gil barh32 pnq bath4 gil ba h4 png ba hE gil baihB png bail gil batil png ba i1 ail baiil6 png baiz gil baii2 png baii32 il bari32 png baid gil baid png bariBail bariB png baijlgil baijl png baril qil baojl6 png 2 511 baij2 png ba j32 gil Macintosh HD Users jarredwilson Documents School Graduate School UT University 813pr ql Attiiand E prot baiji2 png baridail baij4 png baijBgil baijB png boak_al jp3 boaks_yelbw ipg boy jpg ca ds ipg eivices jpg gil caesis sel gil s mause jpg campulers ipg camputing gil campuling_sel gil caurses gil cautses_selgil discussian gil discussion sel gil Daw nOnGiaund jpg email gil equipment digilaLcameia_laige jpg digilaLcameia_smalljpg taam_mikiaphane_laige jpg _ _ m ll ipg Taculiy adiian_thumb j
62. he authenticity requirements set out in the 2006 report to ensure that our materials maintained their authentic character Our Strategy Based on the examples of previous work our team decided that we want to continue evaluating web crawler technologies in hopes that improvements had been made since 2006 While significant problems with crawling exist mostly of a technological nature web crawling is well established as a methodology in web archiving and we were interested in pushing into new unexplored territory In particular we were interested in the challenge of archiving dynamic web content which is rapidly becoming one of the most important issues for web archiving Previous groups had shied away from addressing this problem as they determined preserving dynamic content and PHP code was not necessary to meet their goals an early meeting with our group and Sam Burns the iSchool s Content and Communications Strategist Sam encouraged us to think about web designers and system administrators as part of our designated community That community would be interested not only in how the website was displayed its look and feel but also in the technical architecture underlying the site The PHP code used to generate web pages contains a great deal of information about how the site is constructed which would be valuable for administrators of future versions of the website Archiving the PHP and databases used to generate the web
63. hp Vers inc php submission author commen copyAssignment cepyedilor edils signmeni edilor Torm ulAssignment layo ulEdiler preclAssignment proolresder reviewAssignment reviewer seclionEdilor subscriplion Cunency inc php Subsciiptian inc php SubsciiptianDAd inc php SubsctiplianType inc php SubsciiplianType DAO inc php tasks 35 RevizeFe mindei inc php lemplate Temp late Manage inc php user form User ine php Dad inc php xml XLHod ine php XMLPa set inc php XML Parser DOMHandiei inc php XML M ilet ine php canlig inc php canlig inc php canliq TEMPLATE inc php dbscripls data did installaml ajs_sche ml upgrade upgiade xml versian xml dec Changelaq COPYING FAQ IMPORTEXPORT INSTALL LICENSE README README CYS README DEV RELEASE release noles ChangeLag 2 D 1 2 0 2 README 2 0 1 README 2 0 2 REPORTING BUGS UPGRADE help cache en_US editarial tac 0D DDDD inc php en_US edila ial 1ac DD 0001 inc php en_US editorial tac DD DDD2 inc php en_US edila ial tac DD DDD3 inc php en_US editarial tac 00 DOM inc php en_US edila ial 1ac DD DDDS inc php en_US editaiial tac DD 0006 inc php en_US edila il tac DD 0007 inc php Macintosh HD Users jarredwilson Documents School Graduate School UT University 2014pBigit 214 prot en_US edilatial tac DD DDDE inc php en_US edil
64. i_nav tpl day tpl delaultcss eventipl eventlist tpl laater tpl header tpl images manth tpl manth_ischaal tpl manth_laige tpl manth_medium tpl manth_small tpl pieleiences tpl print tpl yeajich tpl yeaich_bax tpl s debar ipl s debai_year tpl tada tpl upcaming tpl week tpl yearlpl red admin tpl calendai_nay tpl day tpl delaultcss euai pl he der images manth_laige tpl manth_medium tpl manths mall tpl pieleiences tpl print tpl s3 index ipi seaich bax lpi s debar tpl s debai_year tpl tada tpl week tpl yeartpl admin tpl cakndai_nay tpl day tpl delaultcss erat tpl laater tpl images manth tpl manth_laige Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 AF taihnd E prot manth_medium tpl manth_small tpl pielerences tpl print tpl iss_index tpl seaich tpl yesi ch bax ipi semina lpi sy idebai tpl yea lest him TIMEZONES upcaming php week php yearphp lavican ica forms admin_iakes dac advanced_study_applicatian dac advanced_study_a pplication pdl amazan_mechanical_tuik_iequest pdl award applications tequest_lat_tiavel dac BACKUP cantinuing_student_app pdl new_sludent_app pdl la_app pdl la_applicatian dac l
65. ians ine Shared Ei iaiFunclians inc Shared Functions inc standardisia_table_sorling CHANGELOG txt camman js README Lei s Landa La ble sdiling js slyk css limei himl lablesorler addons pager build js jsminjs min js packer js sebas terli wile File js build xml changelag Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 AF tian prot example ajax himl exam ple e mpiy Lable exam ple ex lending delaults himl exam ple melts heade s htm exam ple meta paise shtml exam pk mela sait list himl exam pk aplian debug himl exam pk aplian digils himl exam pk aplian sait laice himl exam pk aplisn sail key html exam pk a plan sa L lis Lhiml exam ple aplian sait aidei html exam ple aplian Llexl extiactian himl exampk aplians headers himl exam ple pager himl exam ple paisers himl exam pk L igge hlml exam ple liiggers html exam ple widgels himl img index html js jquery latestjs jquery meladata js jque y la js jque y La ble sa lesls assels cell metadala himl checkbax himl cal pan himl dema himl index html himl pa3e html Lhe mes blue 47 48 Appendix C SQL commands These are the commands to create users and databases in MySQL Login to MySQL First you need to login as
66. ich we felt somewhat comfortable after our attempts 5 http www httrack com 5 http e records chrisprom com httrack evaluation fj https code google com p httrack2arc to install WCT Upon attempting to run the program we discovered that it required additional Java configuration that was beyond our skill set Having learned to be critical of even seemingly simple command line operations from our efforts with WCT we decided not to spend a large amount of time attempting to trying to learn enough Java to run HTTrack2Arc While the ARC format would have been desirable as a simpler means of storing a copy of the site we feel confident that the HTTrack files provide the necessary information to document the iSchool website s appearance However rather than attempting to ingest each item individually as the 2005 group did we tarred all of the files together and ingested them as a single object into DSpace Creating the VMDK File Creating the Initial File The VMDK Virtual Machine Disk file was created using Oracle Virtualbox The group began with an attempt to install Virtualbox on the COM2 computer which was successful however running or creating VMDK files was not possible The COM2 computer does not have the features needed to support virtualization so the vmdk file was created in a Mac OS To create the virtual environment we opted for a Linux Ubuntu operating system because the original website content was generated and mod
67. ified using a Linux operating platform The most recent version of Ubuntu 12 04 was downloaded from www ubuntu com and a new VMDK file was created with Ubuntu 12 04 as the operating system After starting the VMDK file for the first time the Ubuntu installer ran and generated the operating system for the Virtual Machine A super user was created and Apache MySQL and PHP were installed using the sudo apt get command in terminal The Apache and PHP installations were both tested by using the localhost address in a web browser before determining that the VMDK file was ready to have the iSchool website files imported and added to MySQL to render the website in the virtual environment 8 The version of Oracle Virtualbox used was 4 2 10 the most recent release at the time of the vmdk file s creation 9 Versions of software installed were mySQL 5 0 95 Apache 2 2 3 and php 5 1 6 all the software versions currently in use in the iSchool s management of the website Importing the Data The website component files were obtained from the iSchool s Content and Communications Strategist Sam Burns on April 4th 2013 Sam who administers the website and is familiar with both its content and structure placed all of the website component files HTML PHP INC JPEG PNG SQL databases among others into a compressed TAR tgz file and uploaded that file to the vauxhall server vauxhall ischool utexas edu in a folder specially created for
68. iig init inc php list lunctions php OLDtem plate php aveilapping_events php sanilize php template php limezanes php useiauth_lunctians php images cancelled gil campkted gil canliimed ail impaitantgil naLcampkied gil phpical laga gil tecurting gil s pe gil tentalive gil valid tss png includes lagin php tada php index php languages altikaans inc php biaczilian ine php bulgaiian inc php calalan inc php czech in php danish inc php dulch inc php engli h inc php ei petaniq inc php linnish inc p hp liench ine php galega inc php geiman inc php hungarisn inc php ilalisn inc php Macintosh HD Users jarredwilson Documents School Graduate School UT University 2014pBigit 20 E prot karean inc php lithuanian inc php nar wegian ine php palish inc php pat luguese ine php tussian inc php s kavak inc php s lavenska inc php s panish inc php swedish inc php tiadilianaLchinese ine php lagaul php manth php manth_ischaal php pieleiences php README reseie_iaam phpald taam_laim ine aams png rss index php iss php tss1 D php 1 2 D php xml gil seaich php 5 default admin tpl cal ndai_nay tpl day tpl delaultcss enai lpl header tpl images manth tpl manth_medium tpl small tpl pielere
69. ilability edit php availability remove php availability set php canlig inc cieate_advisai php database advising_cieale sql edil_advisat php email validation php exclude php exclusian_edit php laale inc heade inc index php lacal ine lagin php lagauL php mailpas swaid php OLDkgin php period _cieale php period _edil php period _ emave php signup php s Ludent_cancel php lest php lestcaakie php laggle_public php view_cal php php careers 2 bds pring pdl academic_jab_seaich php alumnicaieeis php alumnical ez 2 l le php Listing of iSchool_Web Page 3 APL 0 dac APL Mentaiinla2D09 pdl A PL3Sludenl3 pp2 DDS dac aichive_jab_seaich php assacialian_jab_seaich php Blank Ekgidiail careers envices jpg cans liuctian gil e mplayel services php Tiles Biielcase 4B png evaluatian dac eva luslian pdl giaduation_cap jpg hand_shake jpg ischaalLkga png ischaalLhga2 png ischaaLlaga_lexL png laqa jp3 mentai appdac mentai app pdl hg pdl symplicity_ mplayet_header himl symplicity_laater himl symplicily_s Lud ent_heade geneialjab_seaich php gaveinment_jab_s eaich php index php interns hips php inte iew_thankyau_lellets pdl inten iew_thankyau_lllets pps jab_seaich_iesauices php jabseaichsiles him libiary php lis_internatianaLseaich php lis_usa_seaich p hp l
70. ime needed to explore every file and redact every possible piece of sensitive information For both of those reasons we chose to make the two collections that contain materials obtained directly from the iSchool administration the Virtual Machine and Individual Component Files collections closed to public access The materials will be made available upon request and approval by the DSpace administrator who is expected to provide access to any iSchool staff member with a legitimate need to see the files This was not an easy decision for us to make since one of the most exciting elements of the virtual machine strategy was its usefulness as an access tool By closing the 14 Virtual Machine collection we will not have the ability to see if this is truly a useful way for researchers to access archived websites However the tradeoff of restricted access was necessary to allow us to use the files we obtained from the iSchool administration and respect the rights of the creators and subjects of the website Conclusions Future Recommendations Website archiving is still a relatively new field and archives tend to rely on web crawls as system for documenting public internet sites However most web crawlers are still incapable of archiving php code so items such as database queries or unique attributes of a website that require user interaction to render in a browser are still not captured by web crawlers We believe that our strategy of ar
71. includes information about how to interpret the content once it is displayed one way of thinking about it is that Structure Information turns 10 OAIS Model Section 4 2 1 3 12 the bitstream into characters and Semantic Information tells a user how to interpret those characters The Semantic Information we chose to include consists of a list of the programs and their respective versions stored in the virtual machine and used to generate the website within that environment We viewed this as essential information to include so that the materials within the virtual machine could be accurately used and understood additionally this information allows the materials stored as Individual Component Files and their relationships to the website to be better understood Ideally by providing this information about the programs used to generate and host the website the Individual Component Files could be used to re create a functioning version of the website in case the VMDK file becomes corrupted or is no longer accessible We considered including additional Semantic Information especially because many of the archived files are code and we could have included information about how to read and interpret the HTML PHP and other code used to generate the website We decided however that we could rely on our Designated Community to have the necessary knowledge to interpret the code if they needed to The final category of RI is Other Representation Infor
72. ing gil advising_avei qil austin gil auslin_aver gil black_s pacet gil balla m gil cleai_spacei gil campuling gil campuling_aver gil davis_thumb jpg acil ilies gil lacilities_aver gil linaid gil linaid_avet gil Is gil Is _aver gil aigsail args awe qil uues gil tap gil wekame gil wekame_ave gil index php inlia_to_gtaduale_sludies php ischaaLstudents jpg ischaaLtechnalagy jpg lacal ine new_sludent_guide alientatian css atientation_ schedule php schedule php s Ludents_and_aiganizalians php s uvey php technakhgy php lechnakagy php save wekame php peo ple cammi llee s ph p admin qle php alumni php alumni proliles akx_addisan php 41 hershey meyeiphp anuLnanavali php chai_baath php daug_sluail php elly_stevens php images akex_addisan jpg akx_meyers ipa nanaali j pa chai_baath jpg daug_stuail jpg elly_slevens jpg jade_andeisan jpg jin_wu jpa Kelsey_Sigiid jpa Kel ey_Sigiid_news jpg kijana_knightjpg lisa_schmidL jpg melanie_calieH jpg jade_andeisan php kijana_knight php lisa_schmidL php melanie_calieK php sigtid_keley php wu_jin php alumnipraliles js alumnipraliles php alumni pialiles swl alumni 2282005 alumni_updates_D3D12DD6 php alumni_updates_D6D12DD4 php alumni_updates_D62D2DDS php alumni_updates_DS242DD4 php alumni update _11DS2DDS php alumniupdates_index php cammilLtees php ca mmi llees _les L php laculty php lac
73. ject_laga gil pawepaint ican gil index php is_sampk_piagiam lacal ine masters aieas php candidacy php cunkulum php cunkulum php endaisement laimdac index php mas le s_p inlable php pes larqz phd php phj_annual_ievie _2DD6 dac phd _annual_ieview_20D 6 pdl phd_handbaak pdl specializmlions cas php index php ssk php research _noles cdll_piagiam php Listing of iSchool_Web Page 27 digilaLlibiaties php lunded_iesesich php health php iadu php cun enli esesich php iadu_dactaial php iadu_laculty php iadu_giant_piapasals php iadu_masleis php iadu_ieseaich_prajects php images iml kiga gil ipLkga gil index php iwig php jauinals php lacal ine publications php teseaic h a palsaza 10282D03 him reseaich a palhaza 1D 2B2DD3 P PT 2 10262003 blank_nales him endshaw him lilelist xml 16125 backg dund gil mas le125_imageDD1 gil mas le125_imageDD3 gil mas le125_imageDDS gil masle125_imageDD gil mas le125_imageDDS gil mas le 25_imageDll gil mas lei2 _imageD1l2 gil mas le125_imageD13 gil mas te 25 im geD14 gil 1 129 15 1 mase 12 9_image D17 qil mastei29_im ge D1B gil mas te 25 im qeD18 qil 1 29 2 D gil mase 2 9_image D27 qil mastei29_im ge D28 gil mas le125_imageD29 gil mas le 25_imageD30 gil mas le125_imageD31 gil mas le 2_i mage 032 qil mas
74. jpg therma php updstes index html ways give himil wayslagive php events admin AUTHORS caLtesthiml 5 ewentlist ine gelcalendais csh geLcalendats csh geLcalendais lag gelLcalendais_tes tosh gelcalendais_tes Lesh gelcalendais_tes 1 09 ischaal ics publish php publish_log Let upcaming_events ine upcaming_evenls inc canlig inc disL php canlig inc php canlig inc php COPYING day php delaullcanlig php enai php evenllist php Tunc Lio ns 20 admin_lunclians php calenda _lunclians php dale_lunctians php d ise lunclian php ical_paiser php inil inc php list_lunctians php OLDtempisle php aveilapping_evenls php sanilize php lemplale php limezanes php usersulh lun lin php images cancelled gil campkted gil canlitmed gil impattantgil nal_campkted gil phpical laga gil ecu inggil lentalive gil valid 1ss png includes evenl php lagin php lada php index php languages alikaans ine php biazilian inc php bulgarian inc php calalan inc php czech in php danish inc php dulch inc php english inc php spetanta inc php linnish ine p hp lrench inc php gakga inc php geiman inc php hungatian ine php ilalian inc php japanese inc php katean inc php lithuanian ine php narwegian ine php palish inc php pat Macintosh HD Users jarredwilson Documents School Gradu
75. le 2_i mage D34 qil mas le125_imageD35 gil mas le125_imageD36 gil mas lei29_imageD3 gil mas le125_imageD3B gil masle12S_imageD3 gil mas le125_imageM40 gil 43 maste 2 9_im ge D41 gil may te i29_im ge D42 gil mastei29_im age D43 gil masle 25 im ge D45 gil masle 25 im e D48 gil mas te 25 im qe 047 91 masle 25 im ge 045 911 masle 25 im geD51 gil mas lei2S_imageDS2 gil mas lei25_imageDS4 gil maslei25_imageDS5 gil masle12 _imageD57 gil mas le130_imageD1l6 gil mas le13D_imageI4B gil navigalian_bai nex nex disabled gil aulline_callaps e gil aulline_callapsed him aulline_ex pand gil aulline_ex panded him aulline_naviga lia n_ba pievactive gil prev disabled gil slide DD51 him s lide DDS 1_b kqgiaund gil slide DD53 him s lide DD54 him s lide DDS4_im age DM gil s lide DOSS him s lide DDS S_b kqiaund gil s lide DDS 6 htm s lide DOS G _backgiaund gil s lide DD5B him y lide DDSB image DS D qg il s lide Dbifi him s lide DOG G_image DD6 gil s lide DD71 him s lide DD73 him s lide DD74 him s lide DD7i him s lide DD76_b kgiaund gil s lide DD77 him slide DD7E him s lide DOF B_backgiaund gil s lide DOF B_image D44 jpg s lide DO7S him s lide DDED him s lide DDE2 him s lide DDE3 him s lide DDE4 him s lide DDES Macintosh HD Users jarredwilson Documents School Graduate School UT University 813pr ql Attiiand E prot
76. lic interface for the website we began our work researching web crawlers with the expectation that Heritrix would ultimately be the appropriate choice for archiving the website After researching additional options along with user documentation indicating that Heritrix did not have a user friendly interface and that generating automated crawls was a challenge we selected Web Curator Tool Web Curator Tool described as an open source workflow management application for web archiving is the result of a collaborative effort between the British Library and the National Library of New Zealand Web Curator Tool is built on a Java based platform and combines several web crawling technologies such as Heritrix and NutchWAX in order to create a user friendly open source system for automating web crawls Web Curator Tool is run in a browser using Apache Tomcat as a server and is therefore much easier to engage with than Heritrix which uses terminal and command line functions to automate and complete crawls The latest versions of these applications output WARC Web ARChive files the successor to the ARC format and a new international standard format developed and maintained by the International Internet Preservation Consortium IIPC Part of our decision to use Web Curator Tool was the desire to generate archival quality web crawls in this standard format which combines the original web content exactly as it was captured along with met
77. m Documentation collection is particularly important as it provides a place to provide additional documentation about the materials that we created and how they can be preserved and accessed Representation Information We were very aware during the process of creating the virtual machine that all of our work to create the VMDK file would be worthless if we were not also able to ensure that the file would be accessible over the long term Accordingly we used the OAIS Reference Model as a means to think about what types of additional information would be necessary to ensure that the file would be usable in the future Representation Information in the OAIS parlance We archived this material in the 2013 Website System Documentation collection The OAIS model defines three types of Representation Information RI Structure Information Semantic Information and Other Representation Information The first of these categories Structure Information includes information that helps turn a bitstream into meaningful information usually provided as a file format specification The majority of the files we ingested are in common file formats TXT PNG PDF etc and we determined that those specifications are adequately documented elsewhere The VMDK file however is in a slightly more unusual format so we opted to include a copy of the format specification as part of our archived materials The second type of RI is Semantic Information which
78. mation a catch all term for additional documentation that enables the archived materials to be understood and accessed In our case we determined that instructions about how to view the VMDK file were necessary since virtual machines and the software used to access them require some specialized knowledge to use In the System Documentation collection we included a user manual for the Oracle VM VirtualBox program that we used to create and view the VMDK file This is not the only program that could be used to view the archived VMDK file but it is one free open source option In addition to this user manual we also created a set of instructions for viewing the VMDK file using VirtualBox and saved those instructions in the Virtual Machine collection alongside the VMDK file itself These instructions were intended for users with no experience using virtual machines and include step by step instructions as well as screenshots to ensure that the VMDK file is accessible even if the specialized knowledge of how to run a virtual machine is not available Open Closed Collections 13 One of the important implications of our choice to archive all of the files used to generate the iSchool website was the effect on who should be allowed to have access to the archived site In early meetings with Sam Burns he expressed some concerns about the security of the website being compromised if the PHP code and database structures were made available One importan
79. n of the software In keeping with the OAIS model it would be ideal if all materials saved and used in a future vmdk file could be kept open access The individual website components such as database structures HTML and CSS files remain the intellectual property of the website developer and future endeavors should more closely examine password protecting sensitive information within the VirtualBox environment The creation of the VMDK file itself is contingent upon access to not only the publicly visible aspects of the website but all of the additional materials required to render the website in a browser Since access to future endeavors might be limited to only publicly visible aspects of the site the creation of a VMDK file or the virtualization of a website may not be a practical choice for future web archiving efforts One element of this proJect was an intentional duplication of documentation preserving the website in multiple different formats e g screenshots crawls and a virtual machine As the web archiving community matures it would be advisable for best practice guidelines to be established as to what is necessary and sufficient documentation of a website Duplication of documentation is understandable in a project such as ours since we were intentionally trying to determine the best methods for preserving a website However for web archiving to accomplish its goal of creating a system to comprehensively document the Internet the
80. nces tpl print tpl 1ss_index tpl seaich tpl sesi ch bax ipi s debar tpl s debar_yeai tpl Listing of iSchool_Web Page 29 week tpl green admin tpl calendai_nay tpl day tpl delaultcss etdi ipi eventipl laater tpl header ipl images manth tpl manth_lsige tpl manth_medium tpl manth_small tpl pieleiences tpl print tpl tss_index tpl search tpl seaich_bax tpl seminat tpl wee k lpl yesi pi grey admin tpl calendai_nav tpl day tpl delaultcss enat tpl laaler tpl images manth tpl manth_lsige tpl manth_medium tpl manth_small tpl pieleiences plint tpl iss_index tpl seaich tpl seaich bax ipi seminat tpl tada tpl week tpl ischool admin tpl 45 calendai_nav tpl day ipl delaultcss delaulicss 20DS0R24 euai pl images manth tpl manth_ischaal tpl manth_laige tpl manth_medium tpl manth small tpl OLDupcaming tpl pielerences tpl tss_index tpl seaich tpl search_bax tpl y debar tpl s debai_year tpl tada tpl upcaming tpl week tpl week lplaiig yeartpl red admin tpl calendai_nav tpl day tpl delaultcss laater tpl images manth tpl manth_medium tpl manth_small tpl pielerences tpl plint tpl tss_index tpl seaich tpl seaich_bax
81. ndbs ine phy reviewer index php Revize e Handiei inc php SubmissianCammentsHandker inc p SubmissionResiew Handlei inc php index php RTHandierinc php radmin index php minHandles inc php RTCantextHandie inc php RTSeaichHandles ine php inc php RTver sk nHandiei inc php search index php SeaichHandlet inc php seclionEdilor index php SectianEditarHandke ince php SubmissianCammentsHandk inc p SubmissianEdilHandiei inc php user EmailHandlei in php PialileHandlei inc php Regis l alan Ha ndle inc php UserHandler inc php plugins impor lex port erudil nalive sample users public index html journals L 2 sile regisiry jauinalSellings xml lacales x ml scheduledTasks x lt ml s lapwaids ixt en_US xml AiLAichilectuie xml Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 Attiiand prot Astiaphys ks xml Bialagyxml Business xml Chemistiyaml Cag nitive Sience Science xml Ecanamiks xml Educatian x ml Enviranment x ml GeneialLScience Humanilie x Lile_Sciences x ml Mathematics x ml Physics xml SacialLSciences xml il versian did styles iew css cammentscss camman css help css 5 abo ul abaulThisPublishingSystem Lpl
82. p ene s_apiiL20D6 php enew s_apiiL20D7 php enews_apiilL2DDB php enew s_augusl_2DD3 php enews_dece mbe1_2 DDB php enews_lebruary_2DD5 php enews_january_20D3 php enews_july_20D6 php enews_july_20D7 php eneas_july_2DDB php 2004 enews 2005 enews 2009 ene es_maich_2DD3 php enews mach 2004 php enews 2003 enews_navember_2003 php eneas_navember_2005 php enews_actabei_20D6 php enews mbe 2004 php images 1616 jpg Cammence ment JPG Diane Bailey png Dan jpg Face baa ki mage gil iSchaallaga png lecia_barket jpg lincaln_bible jpg Miles jpg mith JPG Fal mith JPG TexasA ichiveal themavingimagelaga jpg UTNals jpg WeShallRe mainlaga png yan_zhang jpg yanzhang jpg index php visian php latums keywaid php latums php histary php images abaul_quale gil dillan_thumb jpg index php ischaaLvisian jpg lacal ine lacatian php logos index php laga_hng jpg laga_tall_datk ipg mave php new_index php nens CHAFF UTDiiectHewsleed dl images 2011 tets jpg 4 D3 jpg alumni de 04 alumni dc DE jpg alumni de DS jpg andiadDD1 jp3 and q di DbD2 ipg and qkdDD3 ipg and qdDD4 ipg andiai DDS jpg archives ipg anabs_de_aia jpg as piay_everyday_inlaimatian jpg auslin_his lary png bist baqk ipg bias_image ipg blukenbill news jpg deb_peel ipg dillan jpg dillan_b
83. p_hame_sel gil lap_nay_lemphle ai 31 lap_peapk gil lap_peapk_sel gil lap_piagiams gil lap_pragiams_sel gil lap_ieseaich gil lap_ieseaich_sel gil lap_iesauices gil lap_iesauices_selagil new_animatled_lga gil nica ler jpg ald ca eti OLDeaeeisenvices jpg OLDtitle gil alangeigil page_spice ammie_giace jpg IDD1 jpg bendy_git IDD2 jpg bendy_git IDD3 jpg ID jpg ICDS jpg bendy_git 1006 bendy guyDDl ipg quyDD2 ipg bendy_guyDD3 jpg bendy jpg 05 c hikd y lapla p ipg https lane jpg immiath_class jpg lab_hamegii lj pa lea ping_git Lipg atiginaLhamegil jpg ran ammie_blui jpg ammie_gixeDl jpg ammie_giaceD2jpg ammie_giace D3cul jpg ammie_gixe Dd jpg gi ki ipg immiath_classB1 jpg s hane_listO1 jpg s hane_listO2 jpg teslaialian_ghyves jpg pes image gil people andiew_whinstan jpg andiew_whinstan_thumb jpg ann minnei jpg _ aw jpg ann minnerlhumb ipg apiilsmithjpg Macintosh HD Users jarredwilson Documents School Graduate School UT University 221 Attiiand E Brood aprilsmith_thumb jpg ba ba a_imm a Lh j pa baibaia_jansen jpg barbara_jansen_thumb jpg bellie_meginness jpg bellie_meginness_thumbjpg bilLandetsan jpg bilLande san_thumb jpg cailaciine jpg
84. pages also posed interesting questions about access and how future users could interact with the site It is possible to simply ingest the text files containing the code and SQL commands used to generate the needed databases into a digital repository and this became part of our ultimate archiving strategy However those pieces on their own would not provide a functioning version of the website that users could view and interact with To do that a system would need to be provided that housed all of the necessary files and software together and maintained the correct relationships between the pieces Again acting with Sam s advice and support we decided to attempt a virtualization strategy to preserve a working version of the website We built a virtual machine using Oracle VirtualBox and placed into it all of the necessary files and directory structure as well as software finally saving all of that together as a single VMDK Virtual Machine Disk file Our initial theory was that this would solve both the problem of how to archive PHP and dynamic content and the issues of access and storage faced by earlier groups While creating a functioning website inside a virtual machine was not as easy as we had hoped and we had to face issues of security and IP our virtualization strategy seems to have been largely successful Web Crawling Documenting the Public Interface Based upon previous groups attempts to use Heritrix to crawl and archive the pub
85. pg gwizdka jpg jacek_gqwizdka_thumb jpg jacqueline_peery ipg jacqueline_peery_thumbjpg james_howisan jpg james_howisan_thumb jpg janele_dupantjpg jen maaie jpg jen_maaie_thumbjpg jihyun jpg jihyun_pa k_thumb jpa joel_lang_thumb jpg john_jamisan_thumb jpg jan_kalka jpg jan_kalka_thumb jpg jay_pal mei_thumb jpg kaLmantsch jpg kaimantsch_thumb jpg ka ma nasidi ipg Macintosh HD Users jarredwilson Documents School Graduate School UT University 2014pBigit AF taihnd red _nassa _thum b j p3 katen_pavelka jpg katen_pavelka_thumb jpg kathken_adtian jpg kath ken_adiian_thumb jpg kathben_haulihanjpg kathken_haulihan_thumb jpg kay_gaach jpg kay_gaach_thumb jpg ken_Ikischmann jpg ken_Ikischmann_thumb jpg kim_s mith jpg kim_smith_thumb jpg lauiie zapala jpg lauiie_zapal iaw jpg laurie_zapalec_thumb jpg lea_engke_thumb jpg balka ipg lecia_barkerthumb jpg libby_pelerek jpg latieneiay jpg latiene_iay_thumb jpg luke_dunlap ipg luke_duntap_thumb jpg maicLcakman jpg maigaie hermes meye jpg magaet hame meyei_thumb jpq mal i estea jpg mat iz estea_lhumb jpg maty_tynn jpg maiy_maaie_thumb jpg matthew_kase jpg matthew_lease_thumb jpg 1 megan_wingel jpg megan_wingel_thumb jpg melanie_leinberg jpg melan ie_lein be g DD1 ipg melan be q DOZ jpg melanie_leinbeig 203 melanie
86. pg bab_wallan jpg bab_walian_thumb jpg baisseau jpg baisseau_thumb jpg bannie_bizazewskijpg bannie_bizazowski_thumb jpg biaake jpg biaake_sheldan jpg biaake_sheldan_thumb jpg cakin_baye jpg cabin_baye _thumb jpg camille_mandiga_thumb jpg catla_ctinel jpg ca la_c ine _thumb j pa 1 car en ipg ca al_ca edn ca aline_l ick jpg ca aline_l ick_thum b jpg cassie_alvaiada jpg cassie_alvaiada_thumb jpg cathe ine_s chneidel ipg chela me chela_me Lege i lhu mb jpg christian schky jpg chiistian_schley_thumb jpg jpg cia an_l ace_thumb j pg cha kluke jpg cla k_lule_thumbjpg chay_s pinuzzi jpg clay_s pinuzzithumb jpg cannie_bioaks jpg cannie_bioaks_thumb jpg ciaig_blsha jpg c1aig_blsha_thumb jpg 26 dairy Ltackerthumbjpg david_aictui jpg david_aictui_thumb jpg david_biackus jpg david_biackus_thumb jpg david_giacy jpg david_giacy_thumb jpg david_maa jpg david_maa_thumb jpg delsulLiyg delault_thumb jpg diane_bailey jpg diane_bailey_thumb jpg diane_baily jpg dan_cailetanjpg dan_catletan_thumb jpg dana_kui Lz ipg kui le_thum da alhy_hiahl jpa elizabeth_clatk jpg eliza beth_claik_thumb jpg elly_slevens jpg elly_s levens_thumb jpg Heming_seay jpg Heming_seay_thumb jpg lank_liv jpg lrank_liu_thumb jpg gary_haaver jpg gary_haavei_thumb jpg giaham_heathe jpg hatry_mattin jpg hary ma Lin_Lh u mb jpg ianiichaids_thumb j
87. pg barbaia_immiath jpg barbara_immiath_thumb jpg biss_thumb jpg blukenbilLlull ipg blukenbilLlhumb ipg cal car edan jpg caral car 1 chela j pa chen jpg chidestei jp3 chidestei_thumb jpg cisca_thumb jpg clayman_thumb jpg cavinglan_thumb jpg cunningham kiuppa_thumb jpg davis_lull jpg davis_thumb jpg dillan_thumbjpg daty_lull ipg daty_thumb jpg elian_lull jpg eidelez_daty_lull jpg eidelkz_daly_thumb jpg Listing of iSchool_Web Page 14 eidelez_lull jpg eidelez_thumb jpg lountain jpg gallaway_lull gil gallaway_thumbagil gary_geisei jpg gary_geislei_thumbjpg gile ipg gr cy lull ipg giacy_thumbjpg gielchen_hallman jpg gielchen_hallman_thumb jpg hallms k_lullipg hallmatk_thumb jpg haiman jpg harman_lull jpg haiman_thumb jpg heath jpg heath_thumbjpg hawinglan_lull jpg haw ingtan_aiiginall jpg hawinglan_thumb jpg immiath_lull jpg immiath_thumb jpg jansen_thumb jpg jae_sanchez jpg joe_sanchesz_thumb jpg lance_hayden jpg lance_hayden_thumb jpg 1 _1011 d riz thumb jpg 1 11 jpg luis_hanciscarevilla_thumb jpg megan wingeljpg megan wingel_thu mb jpg me sky jpg metsky_thumb jpg miksa_lull jpg miksa_thumb jpg OLDba bala im midih ipg awens_lull jpg awens_thumb jpg palmquistlull jpg palmquist_thumb jpg pavelka_lull ipg pave lka_thumb jpg san_thumb jpg pallack_thumb jpg Randalph Bias jpg tice live
88. php en_US la pi DDDD14 inc php en_US la pic DEDD15 inc php en_US jau nal la pi DEDD16 inc php en_US iaurnal la pi DEDD17 inc php en_US jauinal tapk DDDD1B inc php en_US idu nal la pi DEDD15 inc php en_US jau nal la pi DEOD D inc php en_US iaurnal la pic DEDD21 inc php en_US la pi DDDD22 inc php en_US la pic DDDD23 inc php en_US jau nal la pi DEDD25 inc php en_US la pic DEDD28 inc php en_US la pi DDDD27 inc php en_US iaunal la pic DDDD28 inc php en_US publi hing 1ac DDD DDD in php en_US publishing la pic DEDDDD inc php en_US publishing tap D DDD inc p hp en_US publishing la piz DEDDD2 inc php en_US publi hing La pic DDDDD3 inc php en_US publis hing La pic DDDDD4 inc p hp en lIS sile lac DDDDDD inc php en_US ile la pi D DDDD inc php en_US site la pi 000001 inc php en_US sile lopi DEDDD2 inc php en_US site lapi 00003 en_US submissian lac DEDDDD inc php en_US submissian tapic DDDDDD inc php en_US submissian la pi DEDDD inc php en_US submissian tapk DEDDD2 inc php 36 en_US submissian lapk DEDDD3 inc en_US submissian lapk DEDDD4 inc en_US user tac DDDDDD inc php en_USlusei lapi DDDDDD inc php en_USlusei lapi DDDDD1 inc php en_LUSlusei lapi DDDDD2 inc php 00003 en_lS use La pic 00004 en_USlusei lapi DDDDDS inc php
89. png baill gil baill png baill6 gil badl6 png bail2 gil bad2 png barl32 gil bail32 png bail4 gil bail4 png bailB gil bailB png baigl gil baigl png baig16 3il baig16 png baig2 gil baig2 png baig32 gil baig32 png baig4 gil baig4d png baigB gil baigB png baihl gil baihl png baihl6 gil baihl6 png barh2 gil baih2 png baih32 gil baih32 png Listing of iSchool_Web Page 7 bath4 gil 4 barhB gil ba hE png bail gil batil png bail gil ba il png ba i2 gil baii2 png bari32ail bari32 png ba i4 gil bati4 png baiBgil baijl gil baijl png ba j1 qil 1 ba j2 gil ba j2 png bari32ail baiji2 png baridail 4 baij Bail baijB png himl2 gil himl2 png sqD png sql png sq2 png sqi png 44 45 46 47 sqB png 4 3q9 png analaga gil analaga png aichive gil AieYauTypel gil assignments gil assignments_sel gil baal gil baial png ba al gil ba al png baia2 gil 23 baia2 png 32 gil baiai2 png baad gil barad png barBail baibl gil baibl png barbli gil babl png baib2 gil baib2 png barb32 gil barb32 png barbd gil baibd png bai bB gil ba bE png baicl gil baicl png ba cl gil ba cl png ba c2 gil baic2 png baic32 gil baic32 png baicd gil baicd png baicB gil baicB png ba dl gil baid1 png baid16 gil baid1lfi png baid2 gil baid2 png baid32 gil b
90. rces Leb DBSauiceStales Lx 1 debug php Dektalab php Dis playa m inc DisplayRagisualianfaim inc Edilab php EditRegistialian php lai merhlaccess Glabal ine GlabalF unz tians inc Help php him Mil him Mime Ma il APLxL backgiaund gil c hangelag 1 1 exampk l php example 2 php exampk 3 php example 4 example 5 php example html example ek exam pk zip himiMimeklsil php himlMimelda il 1 himlMimelklail Ls qz mimePail php RFCR22 php smip php himIMimeMa il php mimePail php RFCB22 php 33 smip php him IMimeMail php icons DisplayHelpoll gil Dis playHelpOn gil dabail qil Reques LEmail gil Seaich gil SeaichResults gil ImageSeeker jpg index php JabDetails php jabweb lws da biebHelp php Lis tRegistialians php Lagin php Lagaul php mail JabWebE mai tOutpul himl MaillabLis tings pl PastingsByFieldRepaoit php README txt Regisler php Repails php RanFiguies gil RanGiaph ail RanPeaple gil Sam pk Image jpg Sesich php SeaichAichive php SeaichExecule php SeaichResults php SeaichTest php SendE mail pdates php SendE maill pdates ph p cli SaundSae ings Invaicelmage jpg Submiab php SyslemDawn Index php lemp FACULTY SchaalCanveisian php README 1 1 STAF FiSchaalCanveisian php lesl php TODS xt UserExpail php UserOptians php UserRepart php kilgerlin abdul php index php Macintosh HD Users jarredwilson Documents School Graduate School
91. s ial tac DE DDDS inc php en_US edilatial tac DD DD D inc php en_US edila rial tac DDDD 11 inc php en_US edila isl tac DDDD12 inc php en_US edils isl tac DDDD13 inc php en_US edilatial tac DD DD14 inc php en_US editatial La pic DEDDDD inc php en_US dila ial la pic DEDDD inc php en_US edils ial la pic DEDDD2 inc php en_US edilatial tapk DEDDD3 inc php en_US editatial ta pic DEDDD4 inc php en_US edilaial ta pic DDDDDS inc php en_US editaiial to pic DEDDDE inc php en_US editaiial to pi DDDDD7 inc php en_US dila ial ta pic DOODDE inc php en_US editaiial ta pic DDODOG inc php en_US dila ial ta pic DEDD D inc php en_US edila ial La pi DDDD11 inc php en_US edila il ta pi DEDD12 inc php en_US edils ial ta pi DEDD13 inc php en_US editorial tapi DDDD14 inc php en_US editaiial La pi DDDD15 inc php en_US editatial La pi DEDD16 inc php en_US edile rial ta pic DEDD17 inc php en_US 000018 en_US editaiial ta pic DDDD15 inc php en_US editaiial ta pic DDDD21 inc php en_US edile ial ta pi DEDD21 inc php en_US edilaial DDDD22 inc php en_US editaiial topic DDDD23 inc php en_US editaiial La pic DDDD24 inc php en_US edils ial ta pi DEDD25 inc php en_US edila il La pi DEDD26 inc php en_US editaiial ta pic DDDD27 inc php en_US dila ial La pic DDDD2B inc php en_US editaiial ta pi DEDD25 inc php en_US edila al ta pi DEDD3D inc php en_
92. t concern was the passwords held in the include php files that generate content on many pages Although we were able to remove that password information as described above this alerted us to the possibility that more sensitive information might be buried in the code of which we were not aware Additionally the fact that we obtained the entire set of files that make up the iSchool website means that we were not able to on a granular level determine what materials we would and would not take Reports from the previous website archiving groups make it clear that there are significant privacy and IP issues involved with much of the material that is part of the iSchool s website In getting the original files from the site administrator we obtained not only the material that is currently at the top level of the site which would be captured by a shallow crawl but also the material that exists in deeper levels and may not even be linked to from the main pages on the site While in many ways this is very exciting since archiving the deep web is a particular challenge for web archiving it is also very challenging Working through all of the files that we were given to identify sensitive information that should be redacted would be extremely time consuming and require an in depth knowledge of the site architecture and directories While we are fairly confident that we addressed most of the security needs we were unable to devote the t
93. the mysql database root user using a mysql command line client Type the following command to login with a password at a shell prompt mysql u root p Create a database CREATE DATABASE databasename Create a New User The root user has full access to all of the databases However in cases where more restrictions may be required there are ways to create users with custom permissions Make a new user within the MySQL shell CREATE USER newuser localhost IDENTIFIED BY password At this point newuser has no permissions to do anything with the databases In fact if newuser even tries to login with the password password they will not be able to reach the MySQL shell Therefore the first thing to do is to provide the user with the permissions they will need to access the new database GRANT add privileges to a newly created user GRANT SELECT INSERT UPDATE DELETE ON databasename TO newuser localhost If you want to give them access to any database or to any table make sure to put an asterisk in the place of the database name or table name However with such settings this user is not able to install the databases as it cannot create tables To add all privileges to the user you don t have to list all of them but you can use the ALL shortcut as follovvs Hovv to GRANT all privileges to a user GRANT ALL ON databasename TO newuser localhost 49 The asterisks in this command refer to the databas
94. ulty_aiginal php giaups php images xml index php ischaaLaig_chail jpg lacal ine michael_winship jpg michael_wins hip_thumb jpg pean_details php pas itians php profiles large akx_addisan jpg aksx_addisan_quale jpg Macintosh HD Users jarredwilson Documents School Graduate School UT University 22 3pr ql Attiiand prot akx_heishey_meyers_quale jpg akex_meyers ipg anuj nanavatiquale jpg cha baath jpg chai_baath_quale jpg daug_stuail jpg daug_stuail_quale jpg elly_slevens jpa elly_s levens_quale jpg jade_andeisanjpg jade_andeisan_quate jpg kijana_knightjpg kijana_knight_quate jpg lisa_schmidtjpg lisa_sch mid quate jpg melanie jpg melan iz ki quate jpg thumbs akex_addisan jpg alex ddisan quale jpg alex hershey meye quale jpg akx_meyes jpg nanavaliquale jpg chai_boath jpg c har baqlh quqle ipg daug_stuail jpg daug_stuaitquale jpg elly_slevens jpg elly_stevens_quale jpg jade_andeisan jpg jade_andetsan_quale jpg kijana_knighLipg kijana_knight_quate jpg lisa_sch mid L j pa lisa_sch mid quate jpg melanie_calie ld melan iz ca lie ld_quate jpq specializations php ylall php s ludents php programs noles capslone biawse_genies php biawse_instilutians php biowse_prajects php capslane_laale inc can e vala php index php instilutian_details php Listing of iSchool_Web Page 2
95. used as the Machine s desktop background The VMDK file was then uploaded to DSpace Management of the Data DSpace management amp structure In designing the structure for these materials in DSpace we were strongly guided by the work of the previous iSchool website groups We believed that it makes the most sense to use the same general structure to make it easier for users to understand the kinds of materials contained in the archive Accordingly we followed the 2006 group s structure of creating two sub communities one for the archived materials and another for the documentation of the archiving process We called those sub communities 2013 Website Archive and 2013 Website Documentation The Website Archive sub community contains four collections 2013 Website Individual Component Files 2013 Website Screenshots 2013 Website Virtual Machine and 2013 Website Webcrawls Each of those 11 collections represents a distinct method of documenting and providing access to the website so although similar content is contained in each the materials they hold are different enough to warrant separate collections The Documentation sub community contains three collections 2013 Website Meeting Notes 2013 Website Reports and 2013 Website System Documentation We found the 2006 group s meeting notes very helpful so decided to provide our meeting notes for future groups use The Syste

Download Pdf Manuals

image

Related Search

Related Contents

ION iAD04 User's Manual  Operating Instructions  Service Manual Micro Bass Series  410, 410SB - Napoleon Products  User Manual - Planetechusa.com  Voice Editing Premium Edition 取扱説明書    Stovax 8700CFCHEC Stove User Manual  Conceptronic CLHDMI14EG18  Philips N Coffee maker HD7686/90  

Copyright © All rights reserved.
Failed to retrieve file