Home

Conex. User`s Manual

image

Contents

1. Note that the set of metadata properties depends on the document type plugins installed Some plugins may define additional document properties To know which properties are available in your actual config uration open the Details tab of the Document properties dialog A common rule of using a property name in search queries is to lowercase all letters and replace spaces with underscore characters e g Date modified date_modified The Tag property though exists is not searchable If no property prefix is set Quick search uses the properties defined in the Search in menu Phrase search By default a query expression containing multiple space separated terms matches the documents contain ing any of those terms For searching for an exact phrase including the spaces 1t must be surrounded with double quotes Hello WOrLaS 2 http lucene apache org java docs queryparsersyntax html Conex User s manual title New York author John Doe Wildcards Wildcards are used to replace one or more characters in a query 2 symbol is a single character wildcard For example h t matches any of words hat hit hot but not heart 6699 symbol is a multiple characters wildcard For example list matches any of words list listing listen listener etc Note You cannot use a or symbol as the first character of a search
2. Date search The values of the date properties date and date_modified has the following format yyyyMMddHHmmssSss Where e yyyy Year 4 digits e MM Month 2 digits e dd Day 2 digits e HH Hours 2 digits e mm Minutes 2 digits e ss Seconds 2 digits e SSS Milliseconds 3 digits To search on the date properties the wildcards can be used For example date 20090820 all documents created at August 20 2009 date_modified 200707 all documents modified during July 2007 It 1s also possible to search the documents in a range between two dates date 20070715 TO 20070815 all documents created between July 15 and August 15 2007 date 0 TO 2007 all documents created before 01 01 2008 Fuzzy and proximity search Fuzzy queries are used to find the terms similar in spelling to a given word For a fuzzy search add the tilde symbol at the end of a single term expression 29 66 roam will find documents containing words roam roams foam road and alike oon For finding words which are within a specific distance away proximity search use the tilde symbol at the end of a phrase For example to search for an oil and import within 10 words of each other in a document use the search oil import 10 Boosting a term In multi term queries it s possible to specify a boost factor of a specific term A boost factor indicates an impor
3. IK Cy DOI GSO US a e no 30 Conex User s manual 1 Installation and running 1 1 Prerequisites You need Sun Java 2 Runtime Environment JRE version 5 0 or higher installed in your system before you can use Conex You can download a free copy of JRE for your platform from http java sun com javase downloads Conex is a cross platform software independent from specific computer platform and operating system At the moment the following platforms are tested and fully supported e MS Windows 2000 XP Vista e GNU Linux 1 2 Conex installation 1 2 1 MS Windows To install Conex in MS Windows operating system execute Conex exe automated installer and follow instructions on the screen After installation is completed you can run Conex via the Start menu new Conex section will be created or using a shortcut on your desktop 1 2 2 Cross platform install To install Conex on other platforms or if you cannot use the Windows installer by some reasons use the cross platform distribution a compressed ZIP archive file Use any unzip utility to unpack the distribu tion archive into a directory of your choice After unpacking the archive you should see the following directory structure L7 conex ET Cone 1 lib 1 plugins conex bat launcher jar conex sh me mo mg The start up scripts for Windows conex bat and UNIX systems conex sh are provided for running Conex Note On a UNIX base
4. Plugins manager shows all plugins available to download and install The plugins are grouped into categories represented by dialog tabs To download and install a selected plugin click on Install button If newer version of an installed plugin 1s available click on Upgrade button To remove a plugin click on Uninstall button To update plugins information click on Check for updates button 1 4 Uninstall Conex 1 4 1 MS Windows To remove Conex from your MS Windows system select Uninstall in Conex section of the Start menu and follows instructions on the screen 1 4 2 Cross platform install To remove Conex installed from a cross platform ZIP file distribution delete the folder where you have unpacked the file 1 4 3 Delete the repository and data files The uninstall procedure preserves repository files configurations cache and other data files used by Conex This allows to restore the program environment in next Conex installation e g after version up grade Conex User s manual To remove the Conex data delete the directory where it is located You can find the actual path of this di rectory in the Repository path field of Conex configuration dialog Tools Configure Conex User s manual 2 Repository Management The cornerstone of the Conex architecture is the documents repository The repository is a centralized collection to keep records on the documents independently from their physical loc
5. existing documents rescan or re create a location 12 Conex User s manual 3 PCN Client configuration PCN Client functionality provides connectivity with a PCN server to create and update a user s profile of interests After PCN Client is configured Conex will submit the tags and metadata of the documents from specified locations to the server to populate the user s profile To use the PCN Client you need a registered account on a PCN server To configure the PCN Client select PCN Client gt Configure PCN Client menu item 3 1 Configure server connection If the server connection has not been configured before the server configuration dialog will appear auto matically after selection of Configure PCN Client menu item Otherwise you can call this dialog by press ing Configure connection button in the main PCN Client configuration dialog PCN Server authentication Please provide a user name and password on the PCN server PCN Server URL https 28 243 93 142 pen server User name pO Password Illustration 8 PCN Server connection dialog e In PCN Server URL field enter full absolute URL of the PCN Server instance e Enter your user name and password on the PCN Server e Press Ok 3 2 Configure contexts You have to create one or more contexts for the locations you want to include into your profile on the PCN Server If the client is configured at the first time the context management dialog appears aut
6. gt Og laghadk logge Ba lucen ene maga mad map mappe ma preduo al maps ma mah maris mess med a mew metadata methodology Toran coe dna mas made model muimeda musa mystery mythology 1 Teo me ee Ma ROIG NETOS ney meas ANO Mn Mp oe anayam mau mena mone AsiaD reus a aiar omiolog ontology 00p Opensource a creiaad owl p page pagerank pirer peesag pasea palena por peer perkarmanos pros Oey Pee any php CM Aen pee plug pu Ae peri pr is Eoy ami a 51 popular perro press prima prob bi procedur proosedings productivity pail progam programming eee paychi publeting pyi non quartun Quan rama rat nd road reduc suas regresa religion remind restar research resources resi rainav river rope rpm S ruby rubyonrals nesia russian s Sacred spee scianca ascot scripting search sacurily Sekt selector San samane Samanics So Beri Sail Swi Sa Sane Ske repo ni sia Sie ote 6c de Sn 505 social socialnetworks sociaisystems sch software cur cei e Soa Sarg spec spect pke ola aol standards sta Ey ds Such Suen A suport SUP em Sem Swe Swing A ah sysadmin Sjenadnimsi awa lag tage tagging anar Taxonomy technology denpla e Ten sig lexi TEN Tanase sete fete Beary Mesas ger tools topic ban vasken ee pese tutorial mir Illustration 20 Tag navigation Selection of a tag causes navigating to the subset of documents matching the selected tag To select mul tiple tags hold down the Ctrl key For quick finding and selection of a tag click inside the
7. tags widget and start typing Tag management operations are accessed via Tag menu items They are duplicated with the buttons on the tags widget toolbar and with the context menu 5 2 1 Populate tags Tag autopopulation is a process of assigning a given tag to the documents relevant to it A minimal threshold for document relevancy 1s set in the application preferences To populate a tag select 1t and choose Tag Populate tag menu item To populate all tags choose Tag Populate all tags menu item 5 2 2 Delete a tag To delete a tag from all documents associated with it select a tag and choose Tag Delete tag menu item 5 3 Search Search tab provides an interface to the full text and structured property search functions of Conex The search results are displayed in the tab s document list sorted by relevancy by default 5 3 1 Quick search To perform a quick search enter a query string into the text field on the Search tab and press button or hit Enter key The quick search queries can use special query syntax 24 Conex User s manual If no document property is declared with the special syntax the search is performed over document prop erties defined in the Search in menu To change the searchable property set open this menu by press E 2 ing button and select the properties in a pop up menu box 5 3 2 Advanced search To perform complex structured search queries click
8. the location contains documents written in multiply languages In this case Conex will try to identify the language of each individual document automatically may slow down the indexing process Conex User s manual 2 2 2 URI filter URI filter allows to define custom rules about which documents should be processed and which ones should be skipped in this location by defining two URI patterns Add new directory location Location Include all known document types Exclude COO Usa ragular expressions for lie names or URL s you wani to include and axcluda Illustration 4 URI Filter tab of Location dialog e Leave the Include all known document types checkbox selected to include all documents allowed by installed document type plugins For instance if PDF plugin is installed all documents with pdf filename extension will be processed etc e Unselect the Include all known document types checkbox to set a custom inclusion rule in or in clude only field If the inclusion rule is set only documents matching this rule will be processed e In the Exclude field you can define a rule for exclusion the specific documents from the location All documents matching this rule will be ignored skipped Inclusion and exclusion rules are defined as regular expressions of the file names or URLs Examples in clude doc All files with names ending with doc denotes any number of characters the d
9. type e In the Program file field enter full path to the executable program file or use Browse button to find and select the file e In the Command line options enter additional command line arguments of the program if needed If this field is empty only document path URL will be passed to the program Alternatively you can define program arguments line using two dollar characters to be substituted with a real document path e Select Default application for this documents type checkbox if this application should be used by default for Open document action if multiple applications are defined for this document type As soon as the application is set it will be added to Document Open document with menu and available for all documents of the same type Repeating this operation you can define as many different applica tions as you wish 17 Conex User s manual 4 3 Edit document properties 4 3 1 Quick edit in the detailed view mode To edit a property value of a document in the detailed view mode select a document and click on a table cell of the property to edit the value Select another document or press Enter when editing is done Press Esc to cancel editing and discard the changes Note that some properties are not editable 4 3 2 Document properties dialog box To edit the document properties in the dialog box choose Document Edit document properties menu item SCAN lt 2 gt SCAN Smart Conte
10. 3 2 14 Conex User s manual e Press Configure connection button to change the server connection configuration 3 1 e Press Ok after the configuration is done The changes will be submitted to the server immediately after closing this dialog box 15 Conex User s manual 4 Working with documents 4 1 The documents list The documents list contains all documents of a selected location or results of navigational and search ac tions The list 1s located at right wide column of Conex tabs or opened in a separate window The list is represented in two view modes Detailed view default Displays the list as a table with document metadata properties as the columns List view Displays a single column list with brief document info To switch between the view modes use View View mode menu or a8 buttons on the list toolbar The documents are selected by clicking an item in the list To select multiple documents click them in the list while holding the Ctrl key To select multiple documents in a continuous range select the first item and then the last item while holding the Shift key To select all documents of the list choose Select Se lect all menu item or press Ctrl A To clear the selection choose Select Select none menu item Ctrl D All operations with individual documents or selections are performed in the Document menu These menu items are duplicated with buttons on the documents list to
11. A eee en eee er 20 AA AANGQUAGe WA Oda e colina oon tuececeeens 20 A Bro 8 lac bY 0 f 1 t 6 Reon A Se E E E creer eer T ss 20 RNP ODUA gt 6 cea eee ee eee eee ee ee MERE eee i Cee Eee RRO eet eee ere ees 21 A a A eee ee ROC me Rae ne nee ten Eee ci ene E E ey nee eee ry 21 AS SOONG ines acne dl o E E EIEEE AS 21 AS ly QUICK SC ACM A 21 B32 AVANCA SCAO ess ta wert o antedesuen O a aaa a 22 O DAV CO SCAICN CS ein e ii ida 22 Conex User s manual ASA INSSOCIALING SCAN CMe as cosas cick hashed la rasa 23 4 3 5 PINGING similar GOCUIMOIMS nia e thebsde ceed emealecatesueads 23 Appendix A Search quen Decano gcc Miele a ake ede otae 24 Document pres e ooh ele bel aes eS 24 A N 24 A hc pee aren com RE cee ee ee ee eee re eee eee ere eer 25 BRICS E E A E A A eer 25 FUZZ YANG PrOXIMIV SEIEN s a Aid ea N TS 25 A A T hla 25 ESOOIS AIT QD CT ALON Sisner a a a R 26 PAINE ODC ALON enee e a a 26 TF OO UO cect hs et PP A 26 OPA e a a ee 26 NOTOPEaiO a aaa ST 26 cn eee te nettle eels ee ee NG ciel ct 26 ESCApInd gt pecial CMALACIONS cise ss erate tasted alte eae ede eee ee ieee cel ee iene At 27 Appendix B TroUDICSNOOIO zai te iaa 28 o IN 28 LOS WINCOW ad coi doo is cag deena 28 CONSOLE QUID Ult cident cect ers Dn aoa 28 LOGGING to ame a edson 28 COMMOMIS SUE Sm o a See dees 28 Backup and restore the repository and SettingQS c oocccocncocncocncncncncnnncnnnconnnonnnnnnnonnnonronnnnnnrnnnnnnnnnnnos 29 Appendix C
12. CE Laboranova Mamecig PON Profile SP4 System acosa acouraci activ dun age a ajax algo al a als analysis anchor ama apine aniapam api archediagy architecture Amay detona atom Ayp audio autcloous automat autumn Day bes benchmark beta biog Eca blogging blogs bare book DOOKS bpa bugs bugracking business buton calais ca a dhag chart chal chater ass class dass assi of clipart dak due cluster cms coding estee collaboration oe omparson comprees computing concaten Concept pa a e aE corderos conmenimanagement Cop FE card cope oss SUT WEF ENTI datar mi w data datum db dbpeda dess del icio us deic design desktop development dictionary dite Suggested tags knowledg practic studi artefact work person Less terms More terms Illustration 14 Edit tags dialog box 19 Conex User s manual Enter the tags into the Tags text field separating individual tags with spaces or select the tags from the lists below the field Existing tags list contains the tags already assigned to other documents An item size indicates a number of documents a given tag is assigned to Suggested tags list contains the words recommended by a tag suggestion algorithm If a word is an existing tag it s marked with a bold font An item size indicates relevancy of a given term to the docu ment You can adjust the length of the list with the Less terms gt More terms slider Alternatively the document tags can be edited Via the Documen
13. IG laboranova Conex CONcept EXtraction client for PCN networks User s Manual University of Nottingham 2009 Conex User s manual Table of Contents alisar o A O taste e ect ciedscicacdaaes 4 A A ere Pr ree rere 4 LZ CONC X MSAN ssa a a nage oso 4 EZ IIS IMAZ o een PPE Te Eee eee ee 4 122 COSSA LONAS O a Sica iada tater aae eds 4 LES PIU GINS MS AMAN e leo ls aun ia Dl e Se e o 5 LAL ASIANS Tall ONG AP a cas aoe eh alse L8 ba sel sen etal eben e eu 5 A a Siac hae ete ChE ie lat oak pane do 5 TAZ CROSS DIAL OLED WASEAN ise sso dle a O o 5 1 4 3 Delete the repository and data fil S cccccccceecceeceeeesececeecceccceeseecauecauesseeseeeseeeseeseeaesenees 5 2 Repository Manade nen a e aes Gd Y 2 1 Adding a location to the rePOSItOl ccccccccsecseceeeeeeecceecceesaeecaeecauesseeseeesaeeceueceeseescesenseeseeenseees 7 2 Med AAN a directory NOC ALO tester sa Sack yeah ret cel ead ht Sah gl ied a nae ene 8 22 COMMON IOCAONM OPINAS Se eee attra ah rss a eth aa tation ee ice EN 8 Zed AAMC A II ate isle 8 2 2 2 A A 9 2 SUSO LUNA ALC MV e ds el Lali 9 22 AMO A o o e eta 10 a A erie a os ome eae eee eae le 10 23 LEI CAMA DIO E NES 10 2 32 REMOVE a Oc alo tar eto toco 10 24 Update Me TEPOSION arratona ia Dr a tion aad eee 10 2 4 1 Updating a single location manually cooccoocccocncocnoncconncncnnncnnnnnonnnnannnncnnnnnnncnnnnnnnennncnnnons 11 2 4 2 REscanming a sMgle located ia dec mad
14. User s manual To clear the text field and cancel the filtering press ES button The query string for filtering may use special query expression syntax see 4 2 Open a document Documents from Conex documents list can be opened in an external application It is possible to set mul tiple applications for opening documents of different types To open a document with a default external application double click it in the documents list or select it and choose Document gt Open document menu item On Windows and Mac OS X platforms Conex by default uses system file associations to identify an application to open a document On other types of desktops or if this method fails otherwise you will be asked to set an application before opening a first document of a given type 4 2 1 4 2 1 Open a document with specific external application To open a document with an external application already defined for this document type choose it in Document gt Open document with menu To set new application to open the documents of a given type select Document gt Open document with Select program menu item select program to open document Program file i sale an application lo open thea documents al the type Command line options pO Additional command line aptions Use as a placaholder ol a sslactad dacumani Cancel Illustration 12 Select program dialog _ Default application for this documents
15. a specific author start typing the name in the text field below the list The list will be filtered by an entered value To clear the text field press ES button 22 Conex User s manual 5 1 3 Date widget Author Date Language o 44 2007 A 2008 2009 January February March April Illustration 18 Dates navigation The dates navigation widget contains a calendar to navigate the document repository by dates of docu ment creation Selection of an year a month or a day in the calendar causes navigation to a subset of doc uments created at the selected period of time 5 1 4 Language widget l Author 2 6 0 8 0700 0 a Illustration 19 Languages navigation The language widget contains a list of all languages of the documents in the repository To navigate to a subset of documents with a specific language select it in the list 9 2 Tag navigation Tags tab provides navigation through the document repository using tags assigned to documents The tags cloud widget in the right panel contains a list of all existing tags sorted alphabetically The size of a list item indicates a number of documents associated with the given tag When a tag is selected the widget automatically highlights the tags related to the selected one An inten sity of a highlighting color indicates a degree of relevancy between the tags 23 Conex User s manual
16. anced search query constructor For complete search query syntax refer to the Apache Lucene guide The search is case insensitive In addition if the documents were indexed with the Apply stemming option and this option is turned on a query will match any document containing a variation of a word with a same stem as the query term have For instance the query term constructor will match the words 29 66 29 66 construct constructor 99 66 construction constructed etc Document properties To search on specific document properties in the Quick search use the term prefix in form of a name of a 2 9 property and the colon For instance to search the word wiki in document titles only use the query term title wiki To search more than one word in the same document property use parentheses author John Mary Jane This query returns the documents containing either John Mary or Jane in the author property The following document properties are available for search e title Document title e description Document description e text Document text default e author Document author e notes Document notes e date Creation date e date_modified Modification date e url Full document URL e file File name e path File path e size File size in bytes for local documents e language Document language code e g en fr de etc
17. ations A user can browse or search the documents in the repository and open them from their original locations with the suitable external applications The repository aggregates content from different sources The folders in the local or network filesystems are used as the document sources by default A number of other types of sources are available with the in stalled plugins web syndication feeds mailboxes del icio us bookmarks and others Adding documents to the repository is automated A user only need to point Conex to a location she wants to add and the application will find and add every document from there Added locations are moni tored for changes new modified or deleted documents to keep the repository up to date The repository keeps records on user s documents with rich set of metadata properties title description original location author creation date etc The metadata properties associated with the documents are set automatically on a document adding and can be quickly edited by a user Their values can be used in search queries to find the documents matching the specified criteria A full text search engine is integrated with the repository When new documents are added to the reposi tory their content is extracted and indexed for search The repository is independent of the specific document formats The number of popular document for mats is supported either natively or via the plugins including HTML PDF OpenOf
18. d system you may have to set executable permissions for the shell script s cd conex S chmod 755 conex sh It is possible also to run Conex using executable launcher jar file Generally it is enough to double click this file in your file manager to get Conex started If it doesn t work check your system file associa tions or execute Conex launcher from a console S cd conex Java jar launcher jar Conex User s manual 1 3 Plugins installation Conex is provided with the plugins management framework for easy download installation upgrading and removing additional components within the application Manage plugins Document type plugins f Location type plugins User interface extensions Document type plugins are the optional parsers for specific document formats After installing this plugin the documents of new type will be recognized and indexed by SCAN a OpenDocument plugin new version 1 0 is available Parses the text documents in OASIS OpenDocument 1 0 format The recognized file pattern is odt a PDF plugin installed Parses the Adobe PDF Portable Document Format files The recognized file pattern is pdt Uninstall Th MS Word plugin Parses the documents in Microsoft Word 97 2000 OfficexP format The recognized file pattern is doc Y Check for updates stration 1 Pluains Manaaer dialoa To open the Plugins Manager window proceed to the Tools Manage plugins menu The
19. e E tet unt etl 11 2 4 3 Updating the whole repOSItOry ccccccccsecsececececeecceccceecueecusecaueceueceesseeesseseeseeseessesenseeees 11 2 4 4 Setting UP Update AUTOMALION ccccccceeccecceecececcceccceeceneceeeseeeseueceesoescusaeseeceesecseceseeseeces 11 A I EEEE I IE EE enn te ase anea E E T A E 11 KN OTRO WIN ACUM sais 13 ele a A A 13 SL LS USO MIZO Me CLAN CG VCW a a a A os 13 AS A A a O A A eee 13 SL MM do 13 SA O pad o o ACNE eee Neer me ee er ern Ieee tata oe rene eer eee ae 14 3 2 1 Open a document with specific external applicatiON coocccocncocncocicocncncnnononnnonanonnncncnnnons 14 3 3 Ed GOCUINENE PIO DCIS oi si creek tie tet ser deceo id 15 3 3 1 Quick edit in the detailed view MOVE occooccoccocccoccocnconcncnncnconnncnnnrononnnnonnoncnonnnnnnnnnnnnonnonos 15 3 3 2 Document properties dialog DOX oocooocooccoconocccooncconcconccononononononononnnonnnonnnonnncnnnnnnnnnncnanennnas 15 codes A aes eect theiens Sansa dates eater 16 SSEMMA S scores tated apse ir rabisdis 16 35 0 Documents AUIOLAQ GING id a di eeu eae is 17 3 6 1 Global autotagging Preferences cccccceeccseeceeccececseeceecceecseesuseseeeceueseeeceeseeceseeseesesensenees 17 A INAVIGAUION ANG Scare int crta tarsus dbipe ii dd 19 A WIP ACCTOO DIO W SING ont dd 19 AA UPAND WOO Dee ee ec ee dias 19 rAd MEA A O A eee eee eda Oe ee ete eee 19 riled WES 9 IB gt 81 9 e 8 1 ee nee eee et eee
20. e provide a detailed description of your problem and attach the console output if possible Logging Log window To see the warning and error messages in the application log click on the button in the bottom right cor ner of the Conex screen The log messages window will appear To force immediate memory garbage collection press GC button It may free some amount of RAM To clear the messages window and close it press Clear and close To close the window without clearing the messages press Close Console output To see the full program output while Conex is running open the system console window go to the direc tory where Conex 1s installed and execute the startup script conex bat or conex sh depending on your platform Logging to a file To configure file logging open scan conf file in the Conex home directory in a text editor of your choice and add an entry lt entry key scan logging file level gt ALL lt entry gt Instead of ALL you can specify the minimal level of messages to be written into the file CONFIG INFO WARNING or SEVERE By default the log file is named conex log and located in the Conex home directory You can change its name and location by adding an scan logging file path entry e g lt entry key scan logging file path gt var log Conex log lt entry gt Common issues Tagger Cannot connect to database error at startup Indicates that another Conex process is already running or t
21. fice ODF MS Office documents plain text files and email messages Repository management operations are available in the Browse application panel The panel contains the list of all locations in the repository which can be browsed with the documents list of this panel L gt qe er lo la la 5 BBC News News Front Page World Edition g Del icio us bookmarks Information Retrieval Documents INCOMING Illustration 2 Locations list Repository management operations are accessed via Collection menu items They are duplicated with the buttons on the locations list toolbar and with the context menu 2 1 Adding a location to the repository When new location is added Conex gets the documents found in this location by a specific location provider Location provider is a part of a Location type plugin which knows how to extract the docu ments from a specific location type The Directory location provider is built in and does not need a plugin to be installed The provider for new location is selected in Collection Add documents submenu Conex User s manual Content and metadata of the documents are extracted using an appropriate parser which depends on a document format A format of an each document is generally determined by a filename URL pattern Adding the documents to the repository is a time and resource consuming operation and it is thus per formed in background mode 2 1 1 Adding a directory l
22. g all location documents from the repository and adding them as new It s generally has the same effect as re creating a location and may be used e g after changing the global indexing options To rescan a location select it in the locations list and select Collection Rescan location menu item 2 4 3 Updating the whole repository To update all locations in the repository select Collection Update all menu item 2 4 4 Setting up update automation Updating the collection can be performed manually 2 4 1 2 4 3 or automatically in a specified time in terval To configure automated update open the system Configuration dialog Tools gt Configure Update collection _ Use interval updating ane Update on start Illustration 6 Configuration dialog Update collection e To set the global update interval select Use interval updating checkbox and enter an interval in minutes or hours To disable automatic updates unselect this checkbox Update interval can also be set for a specific location individually see 2 2 3 If Update on start checkbox is selected the collection will be updated every time when Conex is started 2 9 Indexing options To configure document indexing open the system Configuration dialog Tools gt Configure Indexing Default language anes dS Apply stemming Filter stopwords Changes will ba applied lo naw documents anh Illustration 7 Configuration dialog Indexing o
23. hat a previous session has been halted unexpectedly e g because of a power break Make sure that Conex is not already started with the same repository path If it is not the case delete repository db 1ck file and run Conex again 31 Conex User s manual Out of memory error Conex may run out of memory limits when parsing very large documents If it happens try to in crease an amount of RAM available for Java Virtual machine by editing a line in the startup script e g Java Xms256M Xmx256M Jar launcher jar In the example above RAM limit is increased up to 256Mb You can replace it with another rea sonable value Encrypted PDF files are skipped The feature is unsupported at the moment Duplicate document entries Tags without documents May be caused by an unexpected break of the application Run Index maintenance function Tools Configure Index Maintenance gt Start to check and fix the repository index Backup and restore the repository and settings It is recommended to backup the repository and configuration files before upgrading the Conex version To make a repository backup copy the contents of the Conex home directory To identify the path to this directory look at the Repository path field of Conex Configuration dialog Tools Configure To restore the repository copy the archived contents back to this directory check if Conex is not started during copying 32 Conex U
24. nt Aggregation and Navigation Users Manual Description smart Content Aggregation and Navigation User s Manual version 1 3 ViceVersa Technologies 2009 Tags scan manuals Select Author Alex Alishevskikh 7 Language English en File scan_ug pdf Location home alex Documents SCAN Type PDF document Created Mon Mar 10 12 29 47 YEKT 2008 Mindified Bian ar i 19 90 48 VERT Sonne Illustration 13 Document properties dialog box The General tab of the dialog box contains some most common metadata properties to view and edit Title The document title Description A description of what the document is about Tags A space separated list of the document tags You can edit the tags in the text field or press Select to call the Edit tags dialog see Error Reference source not found Author A document author Enter the author name or select an existing author in the combobox On the Notes tab of the dialog box you can edit the document notes see Error Reference source not found 18 Conex User s manual The Details tab contains a detailed table view of all metadata properties of this document You can edit those marked with To assign a property value for a number of documents at once select multiple documents and choose Document Edit document properties In the dialog box fill the fields for the properties you want to as sign the values to For empty fields the original val
25. ocation To add new directory location select Collection Add documents gt Add directory menu item Add new directory location Filter Settings Location title A Base directory ai _ Include subdirectories Language Eaton Illustration 3 Add new directory location dialog e Select a directory with the Browse button or enter the valid directory path into the Base directory field e The value of Location title will be set automatically to the base directory name You can change it as you wish e Select Include subdirectories checkbox to add the documents recursively from all subdirectories e Optionally select a language of documents in this location and set other location options e Click Add button to start adding the documents 2 2 Common location options The location options described in this chapter are independent from specific location provider and can be set for any type of a location 2 2 1 Language The Language selector of the location dialog box allows to set a language for the documents in the loca tion Language settings affect the way the documents are indexed specifically a language specific algo rithm of stemming the words and stop word filtering By default the language is set to one defined in the global configuration dialog see 2 5 With the Lan guage Selector of the location dialog box you can redefine it on per location basis Select Mixed autodetect option if
26. olbar and also in the documents list con text menu 4 1 1 Customize the detailed view The detailed view can be customized by choosing the document properties to display as the table columns in View Columns menu This menu can be called also by right clicking a column header To re order the columns of the detailed view drag the column headers Change the columns width by dragging the header edges 4 1 2 Sort the list To sort the documents list select a document property in View Sort by menu The documents will be or dered by values of the selected property Alternatively in the Detailed view mode you can click on a property column header to get the list sorted by that property To sort the list in ascending order select View Sort by Ascending order option View Sort by No sorting option means that the documents are ordered by relevancy to the query in the case of a search result or listed in no specific order in all other cases 4 1 3 Filter the list Filter search panel on the right side of the documents list toolbar allow to restrict the list content by docu ments that match a specified search query E N Illustration 11 Filter search panel To filter the documents list enter a query string into the text field and press 5 button To specify the document properties which the filtering will be applied to press LE button and select the properties in a pop up menu box 16 Conex
27. omati cally after the server configuration 3 1 is done Otherwise the dialog can be called by pressing Manage contexts button in the main PCN Client configuration dialog 13 Conex User s manual PEN contexts Set up one or more document contexts of your PCN profile Context name Laboranoval A work Illustration 9 Context management dialog e To add new context enter its name in Context name field and press Add button e To delete a context select its name in the list and press Delete e Press Close to exit 3 3 Adding locations to contexts In the main PCN Client configuration dialog you can select the locations to be included into the profile on the PCN server and assign them to the contexts Configure PCN client Context Locations _ research _ Laboranova C ICT 2009 Update profile every sH minutes PCN Server connection PCN connection is configured Configure connection Ok Cancel Illustration 10 PCN Client configuration dialog Select a context in the drop down list and mark the locations to include into this context Repeat this procedure for as many contexts as you wish Leave Update profile every minutes checkbox selected to turn on the automatic update of the profile on the server Change the updating interval if necessary The updates will be send to the server in a specified time interval e Press Manage contexts button to add or remove the contexts
28. on Advanced search or 3 button It calls the special visual query constructor Advanced search Search rules AND fon x ec ao inca Save search as Illustration 21 Advanced search dialog A query contains of one or more search rules grouped and connected with boolean operators And Or And connector means that the search results must be relevant to all rules in a group Or con nector that they must be relevant to at least one rule in a group Each rule consists of a document property condition and a test value Available conditions predicates and their negations are Contains Does not contain Searches for any listed term space separated Wildcard characters and are allowed in the terms Contains phrase Does not contain phrase Searches for an exact phrase including space characters Equals to Does not equal to Searches for an exact value of a given data property Wildcard characters are allowed After Before For date properties searches for values of dates after or before the given date To add new rule to the query click on And or Or button for an existing rule To delete a rule click on Delete button To save a query for repeating use enter a name for this query in Save search as field 5 3 3 Saved searches Advanced search queries saved for repeatable use are available in the list on the Search tab panel To execute a quer
29. ot symbol must be escaped by a backslash Desktop All files with names containing Desktop in the middle the backslashes must be escaped http www All URLs starting with http www doc Desktop http www Combination of these three rules the character acts as OR operator 2 2 3 Custom update interval 1 http Awww javaworld com javaworld jw 07 2001 jw 0713 regex html Conex User s manual Add new directory location Location Filter _ Use a custom update interval _ Apply autotagging for new documents Illustration 5 Settings tab of Location dialog By default all locations are checked for updates in a time interval defined in global application prefer ences On the Settings tab of a location dialog you can define a custom update period for specific loca tion To enable it select Use a custom update interval checkbox and set the interval in minutes or hours Zero value will disable automatic update for this location 2 2 4 Autotagging If Apply autotagging for new documents options is set all new documents of this location will be automat ically tagged after adding to the repository By clicking Autotagging options button you can call the dia log to change default tagging options for this location 2 3 Manage locations 2 3 1 Edit location properties To edit the properties of a specific location select in the locations lis
30. port or price operator prefix indicates a prohibited term The documents containing this term are excluded from the search results Olive oil Searches the documents that contain oil but not olive NOT operator If two terms are combined with NOT operator the documents that contain the term after NOT are ex cluded from the search result oil NOT olive Is equivalent to the example above Grouping To group the boolean clauses in complex search queries parentheses are used Examples title oil OR oil AND price author John Mary Jane AND import NOT date 2007 oil AND price OR brent AND urals 29 Conex User s manual Escaping Special Characters Some characters are reserved by query syntax and must not be used unescaped amp amp C tTEL IA 22 To escape these character use the backslash before the character Examples Washington city W Exactly Wi he said 30 Conex User s manual Appendix B Troubleshooting If you experience problems with Conex l Check the messages in the application log window or in a log file see Logging section below 2 Check the program debug output in the system console see Logging section below oF Read the Common issues section of this appendix 4 If no solution is found and you think the problem is caused by a bug in the application please submit a bug report Pleas
31. ptions Default language selector defines a default language of the repository The default language has two pur poses 11 Conex User s manual A language used by default for new locations Location specific language settings 2 2 1 over ride this A language used for parsing the search queries It is recommended to set this parameter to a primary language of your document collection If it is set to Mixed autodetect Conex will try to identify the language in every case Apply stemming option turns on the lexical analysis when indexing new documents If this option is set the stems of the words will be extracted and indexed instead of the words themselves For instance the words work worker and working will be indexed as a single term work so that the search for this term would return documents containing all variations of the lexeme work If Filter stopwords option is turned on the words listed as the stopwords for a selected language will be excluded from the index Stopword lists are used to filter so called common words such as the and this etc in English out of indexing thus improving search quality and efficiency Lists of stop words for each language are available as the plain text files in conf stopwords subdirectory of the Conex installation Note that changes of the indexing options will affect newly indexed documents only To apply changes to
32. ser s manual Appendix C Keyboard shortcuts Shortcut Action Enter Open a document Ctrl Enter Select a program to open a document Alt Enter Edit document properties F2 Edit a property in a selected cell F4 Edit document tags Shift F4 Edit document notes F5 Update a selected location Ctri F5 Update all locations Up Down Navigate the document list Ctrl Home Ctri End Go to the top bottom of the list Shift Up ShifttDown Select the list items Ctri A Select all Ctrl D Unselect all 33
33. ser than specified or no tags may be assigned at all It indicates that there are no existing tags relevant to those documents 4 6 1 Global autotagging preferences Depending on size and semantic nature of your document collection you may want to adjust global pa rameters to fine tune the autotagging process and to achieve the best results These parameters also affect the tag suggestion algorithm To set up the autotagging parameters select the Autotagging tab of the global Configuration dialog Tools Configure 20 Conex User s manual Tags specificity is an autotagging parameter which defines whether the terms extracted from a document directly must have higher priority than ones picked from a document context a cluster of simi lar documents High specificity leads to large and granulated taxonomies with small numbers of docu ments sharing the same tags while low specificity increases the value of general terms thus producing lesser tags number but with more documents per single tag o Tags novelty parameter controls a tendency of autotagging to invent new tags instead of re using existing ones When set to maximum existing tags takes no priority of new candidates Otherwise if ev erything else is equal existing tags have more chances to be selected for new documents Minimal nov elty means absolute priority of existing tags Tags autopopulation relevancy threshold defines a minimal value of documen
34. t and select Collection Edit location menu item Note that some basic properties such as a base directory etc are not editable To change these properties you have to re create the location 2 3 2 Remove a location To remove a location from the repository select it in the locations list and select Collection Remove lo cation menu item The documents will be removed from the repository only The operation does not affect any original files 2 4 Update the repository Updating is an operation of synchronizing the repository with actual state of the documents It checks the locations for new modified or deleted documents New documents will be added to the repository e Modified documents will be re indexed and updated in the repository e Deleted documents which are no longer exist in their original locations will be removed from the repository Updating is started in background mode either manually or automatically after a specified time interval is passed 10 Conex User s manual 2 4 1 Updating a single location manually To check a single location for new modified or deleted documents and update it in the repository select the location in the locations list and select Collection Update location menu item 2 4 2 Rescanning a single location While usual updating performs an incremental check of a location ignoring unmodified documents res canning do full re indexing of the location by removin
35. t properties dialog box By quick editing of the Tags property in the Detailed view mode To edit tags of multiple documents at once select the documents and choose Document Edit document tags menu Select Preserve existing tags checkbox to add tags from the dialog box to existing tags of the selected documents Otherwise tags from the dialog box will replace tags of those documents 4 6 Documents autotagging Autotagging is a process of automated assigning the relevant tags to documents Autotagging can be ap plied to all documents of a specific location or to individual documents selected in the document list To autotag the selected documents choose Document Autotagging menu item and set autotagging op tions in a dialog box Autotag 9 documents Preserve existing tags _ Do not create new tags Illustration 15 Autotagging options Set a desired number of tags to be assigned for each document with the Tags per document slider Select Preserve existing tags checkbox to add new tags to existing ones If this option is not set new tags will replace all tags assigned to the documents before Selected Do not create new tags checkbox will cause selection of tags from the existing tag set new tag creation is disabled Use this option if you want to control your taxonomy manually Note that if Do not create new tags option is set an actual number of tags assigned to some docu ments may be les
36. t relevancy required by Tag autopopulation algorithm see to assign the tag 21 Conex User s manual o Navigation and search 9 1 Faceted browsing Browse tab provides the widgets to navigate the documents collection by selecting the values of spe cific metadata properties facets Facet value selection causes navigating to a subset of the documents matching the specified value 5 1 1 Path widget Language Ea home la alex E Documents o 3 media A htipo 4store ora CA http acmqueue com o CA htto ai nip info uniramae it o A htto alias i com o CA htipoalistapart com o A htipy alvit de Illustration 16 Path navigation The path widget provides browsing the documents by paths of the original files or URLs represented as tree like structures Selection of a tree item causes navigation to a subset of documents in the selected path including all descendant items 5 1 2 Author widget a Jeroen Waster A Johann Riedel a Johann Riedel Marc Pallot Al a Johann Riedel Marc Pallot Allen hang A John van Wyhe a Keith van Aijsbargan a Lab User Copyright e IBM Corporation and othars 2000 Illustration 17 Authors navigation The authors widget contains an alphabetically sorted list of all author names found in the Author meta data properties of the documents Selection of an author in the list causes navigation to a subset of docu ments matching the selected value To find
37. tance of a term in the query so that the documents matching the boosted terms will have higher rel evancy in the search results To boost a term use the caret symbol with a boost factor a number at the end of the term you are searching 28 Conex User s manual 0i144 import documents containing an oil or oil and import will be 4 times more relevant than those containing import only Boolean operators Few term expressions can be combined in a single query through boolean logic operators OR AND NOT By default if no operator is defined boolean OR is assumed For instance author John Mary Jane is equivalent to author John OR Mary OR Jane or author John OR author Mary OR author Jane OR conjunction means that the result may contain at least one of the specified terms either John Mary or Jane OR AND and NOT operators must contain UPPERCASE letters only They can also be substituted with the following symbols OR AND amp amp NOT AND operator If the terms are combined with AND amp amp operator all terms must be presented in the search result title oil AND price Searches the documents that contain oil in the title and price in the text operator prefix indicates a required term The result must contain the term after symbol oil import price Searches the documents that contain oil and optionally im
38. ues will be preserved in the documents Press Ok after editing is done to close the dialog box and apply changes 44 Edit document notes A text note can be attached to a document to describe and annotate it The Notes property differs from others as Conex makes no attempt to fill it automatically at indexing stage it is completely user edited and provides a special user interface to view and edit it To view and edit the document notes select the document in the list and choose Document gt Edit docu ment notes menu item Then edit the text note in a pop up dialog box and press Ok Alternatively the notes can be edited On the Notes tab of the document properties dialog box By quick editing of the Notes property in the Detailed view mode 4 5 Edit document tags Tags are keywords or text labels freely associated with the documents in the repository to provide tag navigation facet There 1s no limits on a number of tags assigned to a single document The tags can contain letters numbers and punctuation characters except for quote marks and spaces Tags are case insensitive so that the tags cats Cats and CATS are interpreted as the same tag To edit the document tags select a document and choose Document Edit document tags menu item The Edit tags dialog box will appear Edit tags Iceberg_frs_UMay_updated doc Tags blog blogger weblog Existing tags ana Sd Conce Doc I
39. y double click it in the list or select it and press 4 button on the list toolbar To edit a saved query before execution select it in the list and press button To delete a query select it and press 3 button 25 Conex User s manual 5 3 4 Associative search On each search query the results are analyzed to identify valuable terms related to the subject of the query These words may give you some hints about what to search also on a subject and are suggested for the next query To add a suggested term into the quick search field press More gt button on the search panel and select a term in the pop up list 5 3 5 Finding similar documents A special type of search 1s finding a set of documents conceptually similar to a given one This allows to identify a set of documents on a specific subject taking a known document as a pattern This function is available from the documents list To find the similar documents select a pattern docu ment in the list and choose Document Find similar documents menu item The search results are displayed in new window as a document list sorted by relevancy to the pattern document Moving Less documents More documents slider you can adjust the relevancy threshold to populate the list To close the documents list window press Close button 26 Appendix A Search query special syntax The special query expression syntax allows to perform complex queries like with Adv

Download Pdf Manuals

image

Related Search

Related Contents

CALENDARME USER MANUAL March 16, 2007 LOGGING IN    IPI, Logicmaster 90-30 Software Package Version 6.50  Memory Stick Duo - Instructions Manuals  Optoma Technology WhiteWolf DF-WW9106F User's Manual  SC-M_11.17  Philips SHH1110 Headphones to phone connector  HS11 Manual - Thermopatch  Ewent EW9172 SATA cable  Philips MCL701 DVD Micro Theater  

Copyright © All rights reserved.
Failed to retrieve file