Home

Intera Deliverable D2.2 Integrated Resource Domain: d22_report

image

Contents

1. Tag value the location jon Unknown Unknown Unspecified Unspecified Edit Africa Africa Asia Asia Remove Europe Europe Australia Australia Oceania Oceania North America North America Middle America Middle America South America South America Click on a column header to sort Note The IMDI tools can only make use of controlled vocabularies that are saved to one of the subfolders contained under user directory IMDI TOOLS CVCACHE Files that are saved anywhere else on the computer cannot be accessed by the IMDI tools Note It 1s recommended that all controlled vocabularies are stored on the Web and are saved in one of the subfolders contained under user directory IMDI TOOLS CVCACHE In this case the IMDI tools will be able to check for newer versions of the corresponding controlled vocabularies and to update the cache accordingly Print Click on Print to print the controlled vocabulary Exit Click on Exit to exit the IMDI CV Editor 184 of 196 Intera D2 2 1 9 2 The metadata category The IMDI CV Editor asks you to specify the following kind of information about the metadata category name tag and title of the category mil E ini x Continents ger Language German w SITUE URL of the file Die Kontinente language URL hitp jwww mpinllMDU Schema Contimentsxm gt ofthe rure A list of li
2. RoleConsultant NameJan Full NameJan Smit Coded Social Family Role Ethnic GroupGoemal Age24 SexMale Educationsecondary schoo Anonymizedfaise Actor Information Role Consultant M ow Name Jan Full Name Jan Smit Code Social Family Role Ethnic Group Goemal Age 24 Sex Male Education secondary school Anonymized false Role The function of the participant in the session Choose a value from the pull down menu or type in an alternative value For example value comments Annotator Author Collector Consultant Computer Depositor Editor Filmer Illustrator Interviewer Photographer Publisher Recorder Referent Researcher Speaker Signer Translator 9 80 Name The name of the actor 1 e the name that other actors in the session use to identify him her It is usually not the same as his her full name See also the field Anonymized below Full Name The full name of the actor See also the field Anonymized below Code Short unique code to identify the actor It usually corresponds to the code that 1s used in transcriptions and annotations to identify parts that were uttered by him her See also the field Anonymized below Social Family Role The social or family role of the actor 1 e his her relationship to other actors participating in the session Choose a value from the pull
3. sek muk 00 00 22 840 00 00 23 660 m ep shin en 00 00 23 780 00 00 25 180 vim muk 00 00 26 090 00 00 26 900 eng nd yemnn e Taglist Viewer la muk 00 00 15 350 00 00 16 170 moe d e moe s oe yi 00 00 16 570 00 00 17 670 TreesSM_1 jr xi 00 00 00 00 00 01 00 00 02 00 00 03 00 00 04 BOW dT t BRONCE Um DEL IU 1 d I FUR Lr UN 1 I8 4 3 l Rts Mag ali Timeline Viewer qoend enno ail 00 00 02 00 00 03 00 00 04 available Viewers taglist timeline textviewe vi vi None i E None E a 4 E E T E Ei eng nd yemnnoe s em muk a t eng d in la m ee CEE saja d in yi sek muk m SST x en nni amma s een g Text Viewer d ennoe a shyep d in toe shyep d in moe d e moe s oe la muk yi o 00 01 620 00 00 02 470 20 00 05 940 mdep shin en 00 00 26 990 00 00 28 240 amma s een mi 00 00 28 920 00 00 29 990 m ep shin Gen 00 00 31 870 00 00 33 350 153 c Right click on a selected annotation file node to open the pull down menu then choose Show from the menu The file content is displayed in the Info Content panel e g Es IMDI BCBrowser for MPI on M03402985 File Options Search Help Metadata Descriptions Tree mi MyCorpus imdi Q MyCorpus e Descriptive
4. Weamg Sarg Link 7EB IMO Fi z sion1 Session1 mdi Ae Session2 Session2 imdi F TKB IMLDI Fi LT specify the links to the IMDI r 134GB 7 Session files b Specify the IMDI Corpus files that belong to the corpus or subcorpus E g in the IMDI Corpus file Goemai imdi 1 e Goemai corpus preliminary version specify that it contains the IMDI Corpus files Natural imdi 1 e Natural data and Elicited imdi 1 e Elicited data HI Exploring D AVYINNTSXPersonalsMDI Metadata files File Edit Miew Tools Help Ef Goemai imdi IMDI Metadata Editor v0 11 Schema P Ea U c eee JEJE x rs Ba ees He Viow Window Hep Contents of D Sal SSE MSIE files Conon cmm Type 3 Elicited immedi 3KB IMDI Fi Corpus IMD Les OKE amp SD Fi Corpus Name Goemai atural imdi 4KB IMDIFi z 3 Session imi FEB IMD Fi Corpus Title Goemai corpus preliminary version 8 Session 2 imdi FRB IMD Fi 3 Session3 imei FEB IKIDI Fi Corpus Links Descriptions Keys Natural imdi Elicited imdi ssion4 imdi 7KB IMDIFi specify the links ssians imai 7KB IMDIFi to the IMDI ir 134GB Z Corpus files 118 To specify a link do the following 1 Click on the Add button The CorpusLink dialog window appears Ef CorpusLink Hame Session 1 Corpus Link Session owe ew emen 2 Enter the name of the session or corpus in the f
5. EE choose Add to Bookmarks from the pull down menu s Tor Bookmar enter a name for the Natural bookmark Cancel The new corpus node is saved as a bookmark in the Bookmarks panel Every time you restart the IMDI Browser you can access your corpus via the Bookmarks panel see the separate manual IMDI Browser for details the bookmark is added to the Bookmarks panel Bookmarks IMEI Corpora HTTP DaBes Corpora CON Corpus CON Corpus impi local 120 APPENDICES Appendix 1 Recurring schemata This appendix gives information about the recurring schemata Descriptions Keys and Access Appendix 1 Recurring Schemata Descriptions A Descriptions schema contains a prose description that could serve as a reminder to you or others of the circumstances of data collection Descriptions schemata are not searchable The Descriptions schema is structured as follows o the language in which the Descriptions Location KES description is written Descriptions e j e g in English anguage English English Goemai Description of the uses oftrees We looked attrees in four different sites all in tf vicinity of Kwande beyond the football field The four media files correspond to these four sites The recording took place in early evening Because ofthe twilig the available descriptions the video recordings are nottoo clear e g there are 2
6. Root file D BirgitiData MyCorpus imdi Basket functions ioi xj E s 3 Rem Cl eee ES ES ES ES 4l G 1 Al None Ej 1 All None media i All None synchronized o ofa E Ini All MESE J 1 AI None o 000002269 g ees By default the Media Synchronized Viewer option displays the video image together with a dialog window This dialog window allows you to select the annotation tier that you want to view and the Viewer in which you want to view it To select an option click 1n the corresponding box to add a checkmark or click on the button All to select all Viewers for one tier Then click on OK to activate the corresponding viewer To deselect an option click in the corresponding box to delete a checkmark or use the buttons None to deselect all Viewers for a tier or Clear to deselect all Viewers Then click on OK to deactivate the corresponding viewer Click on Undo to undo the last selection All Viewers are synchronized in time 1 e when you play the video the other Viewers will display the corresponding annotation highlighted in red color 152 For example Lo D Hrgt Data Trees _Lmpg Subtitle Viewer pori goend ennoe grid E LER P n amp ort Grid View B ort List View goend ennoe t enq nd yemnr 00 00 04 860 00 00 05 940 s em muk a 00 00 08 020 00 00 09 250 f
7. circle of dirt Metadata asec2001May S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi 1 asec2001May18a Jjcircle of dirt Metadata asec2001May S Eisenbeiss corpus 1 1 imdi circle of d 1 01 imdi 11 asec2001May30a Jjcircle of dirt Metadata asec2001May S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi 12 asec2001Sepi7a Jjcircle of dirt Metadata asec2001Sep S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi 13 asec2002Feb27a jcircle of dirt Metadata asec20U02F eb S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi 14 asec2002Mar06a circle of dirt Metadata asec2002Mar S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi 15 asec2002Mar Be _ circle_of_dirt Metadata asec2002Mar S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi 1B asec2001Aug21b Jclothing game Metadata asec2001A S Eisenbeiss corpus 1 1 imdi clothing 9 1 02 imdi 17 asec2002Jan09b j clothing game Metadata asec2002Ja S Eisenbeiss corpus 1 1 imdi clothing 9 1 02 imdi 18 asec2002JanD09d J clothing game Metadata asec2002Ja S Eisenbeiss corpus 1 1 imdi clothing 9 1 02 imdi 18 asec2002Jan10b clothing game Metadata asec2002Ja S Eisenbeiss corpus 1 1 imdi clothing g 1 02 imdi 20 asec2002Jan10d clothing game Metadata asec2002Ja S Eisenbeiss corpus 1 1 imdi clothing g 1 02 imdi 21 asec2002Mar22a endpoint Metadata asec2002Mar22a S Eisenbeiss corpus 1 1 imdi endpoint 1 03 imdi 22 asec2002Mar22b endpoint Metadata asec2002M
8. Resource Link Size 7 000 KB Type jaudio section 2 5 2 4 Format audio section 2 5 2 MED rding Condit st ey N Al as eae 4 section section d 2 5 2 5 2 212 9 End 00 00 43 93 2 5 2 1 Media File This screen contains general information about the media file 1 e Resource ID The IMDI Editor automatically assigns a unique resource identifier to each media file This assignment is automatic and cannot be influenced by the user see section 2 5 Resource Link The link to the corresponding media file 1 e its name and location Note for researchers working at the MPI for Psycholinguistics Nijmegen Please specify only the name of the file do not make use of the open folder icon to the right of the Link field The name of the media file should be the same as the name of the IMDI file see section 1 2 1 11 and as the name entered in the field Session Name see section 2 1 1 Furthermore the file name has to be Unix compatible do not use file names longer than 14 characters do not use non letter or non number characters except for the underscore and do not use blank spaces Size The approximate size of the media file in bytes Note that this field is not standardized it is meant to be read by humans to give them an indication of the size before they e g download 1t not to be processed through the computer Typ
9. a prose description descriptions available one in English one in Goemai a link to another file click here to e g to a pdf file containing the session ea add another Descriptions schema or to remove the highlighted schema Language The language in which the description is written Either type in the language or select the language from the pull down menu Note In earlier versions of the IMDI Editor a code was entered into this field Whenever you open an IMDI file created with such an earlier version this code is displayed e g RFC1766 x sil eng instead of English In newer versions of the Editor the code is still stored in the file itself e g RFCI766 x sil eng but the user interface will display the code as a name e g as English 121 Text A prose description Link A link to an info file that contains further relevant information e g a Session protocol Note for researchers working at the MPI for Psycholinguistics Nijmegen Please specify only the name of the file do not make use of the open folder icon to the right of the Link field And please save the info file as either a txt file a pdf file or an html file You can create descriptions in different languages e g one description in English for the research community and one in the language under investigation for the language community In this case you have
10. but not in resource N10198 Media File In this case the actor should only be linked to N10155 Media File To link a resource click into the empty box to its left A checkmark will appear there 83 2 4 1 4 Descriptions A general description of the individual actor independent of his her Role in the session For example Ea IMDI Metadata Editor File View Options Help D eso o HEU E 3 Actor Summary Actor Information RoleConsultant NameJan Full NameJan Smit Coded Social Family Role Ethnic GroupGoemeal Age24 Seale Education secondar schoo Anonymizedfalse Descriptions description of the actor Emih O dw is one of my main collaborators in Kwande since winter 1998 He acts as a consultant but also helps to establish and organize new contacts He was born in Kwande ca 1877 and he also grew up there Both his parents are Goemai He has never left the area He is the older brothe of L nglish See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 84 2 4 1 5 Languages Information about all the languages that the actor is familiar with independent of whether or not the actor uses them in the session 7 IMDI Metada
11. displays the internal structure of screen headers an IMDI file Repository stores recurring information templates panel 58 2 The IMDI Session Editor This part of the manual guides you through the metadata categories of the IMDI Session Editor as defined in the IMDI 3 0 standard Note that the IMDI Editor also allows you to create project specific profiles containing additional categories However these additional categories are entered into Keys schemata and they do not form part of the IMDI 3 0 standard see section 1 2 3 1 for details on profiles and see Appendix 1 Recurring Schemata Keys for details on Keys schemata They are thus not discussed in this section The IMDI Session Editor creates IMDI Session files imdi that describe audio video and written resources Each file describes a single unit termed session or resource bundle This unit usually corresponds to a meaningful unit of analysis e g to a piece of data having the same overall content the same set of actors and the same location and time e g one elicitation session on topic X or one folktale or one matching game or one conversation between several speakers Every session or resource bundle consists of one metadata file 1 e an IMDI Session file plus an unlimited number of other resources 1 e media files annotation or written resource files and or info files The metadata file contains all the information that is ente
12. h Or use the shortcut key ALT plus the underlined letter s e g to access the item File New press first the keys ALT F and then N 1 Or use the shortcut key that 1s displayed to the right of the corresponding menu item e g to access the item New press the keys CTRL N only available for some items The following menu items are available New Open Click on New to create a new controlled vocabulary Click on Open to open an existing controlled vocabulary Note The IMDI tools make use of a number of predefined controlled vocabularies These vocabularies are stored on the Web and the links to them are stored in the subfolders contained under user directory IMDI TOOLS CVCACHE Any such vocabulary can be opened in the IMDI CV Editor and can then be taken as the basis for creating a new controlled vocabulary in a language other than English see section 9 1 However the predefined vocabularies themselves cannot be modified Show Available CV s a Yocabs overview Click on Show Available CV s to view a list of all controlled vocabularies that are stored in the subfolders contained under user directory IMDI TOOLS CVCACHE The Vocabs overview window opens and displays information on the Name the Tag the Link To Canonical CV the Description and the Language of a controlled vocabulary see section 9 1 for details e g AGN activity Conti
13. Content see section 2 3 Actors see section 2 4 Resources see section 2 5 References see section 2 6 When entering information into the IMDI Editor please keep the following points in mind 1 What kind of information would you want to search for Make sure that such information is entered into standardized fields or Key schemata 2 All information entered on a screen is only relevant to that screen This point is especially important for recurring schemata such as Descriptions Keys Language or Access These schemata occur on a number of different screens and although they always look the same they ask for different information information that is relevant to the corresponding main screen 3 Remember to make use of the Repository panel to store recurring information see section 1 4 Note for researchers working at the MPI for Psycholinguistics Nijmegen The metadata information entered into the IMDI Editor 1s meant to be visible to the outside world via the Web This only concerns the metadata information not the audio video and annotation files But please be aware that other people will have access to your metadata files Keep this in mind when you enter metadata information into the Editor and make sure to exclude all sensitive information 60 2 1 Session This screen contains general information about the session or resource bundle All the information that is entered into the Se
14. GOEMAI There are two options for navigating through the Content panel 1 You can use the mouse click with the mouse into a field to enter information on a sub screen header to activate the corresponding sub screen or on a button to execute the corresponding command 2 Alternatively you can use the following shortcut keys Press TAB to move to the next field or button Press SHIFT TAB to move to the previous field or button When a button is highlighted press SPACE to execute the corresponding command Press the left or right arrow key to move from sub screen to sub screen header 53 1 5 2 Schemata and fields Screens are made up of different units so called schemata Each schema contains a number of boxes into which the actual information is entered These are called fields e g IMDI Metadata Editor Tei es en 9 E TreesSM Project Collector D TreesSM 9 3 Liit Project Collector Content Content Gl Participa 5 Language EJ Access Projects File View Options E Collectors of E session MTreessm mesm Yia Biel ES Descriptions Text Description ofthe uses oftrees We lookg in four different sites all in the vicinity off beyond the football field The four m dia fi X correspond to these four sites The re
15. http www mpi nl IJMDI Schema Continents xml Continents ger Continents ger CGN locName http veww mpi nliCGN Schema CGN locName xml A list of Annotation Unit formats http veww mpi nlidDl SchemaiAnnotationUnit F armat x ml AnnotationUnit F ormat http veww let kun nl sign Iang IMDI vocabs F amily Partn Hearing status ofthe actor s ISO639 eng er xml partner Family Partner Family Partner A list of values for the Type ascpect of Participant http veww mpi nli Dl SchemaiParticipant Type xml Participant Type The length ofthe transcription file in bytes http vwww mpi nl CGN SchemarCGN byteCount xml Number of words in transcription CGN wordCots http veww mpi nl CGN Schema CGN wordCount xml The id s of languages http veww mpi nlillldDI SchemaiLanguagesID xml LanguagesID Cickonacoumeneadel Options for managing controlled vocabularies 190 of 196 Intera D2 2 1 The Vocabs overview window offers you the following options for managing the controlled vocabularies Add File Add URL Remove Read all Refresh selected Refresh all Click on Add File to add a controlled vocabulary file that is stored locally The Open dialog window appears and asks you to navigate to the controlled vocabulary file Click on Add URL to add a controlled vocabulary file that i
16. 2 Click on Metadata Search 2 Use the Metadata Descriptions Tree panel see section 6 3 1 Go to the Metadata Descriptions Tree panel 2 Double click on a node to open it 3 Click on it to select it It will be highlighted in blue color 4 Right click on it to open a pull down menu 5 Select MetaData Search from the pull down menu The following screenshot illustrates the IMDI Metadata Search window Query specification and results search parameters 583 Sessia r MDoGraup Participants Participant s F ullMamae a IMDI Metadata Search El x menu E ese acer T je Timm Fn oe Saul idl actor v 99 Fumane UN Print button search al hits 56 progress http www nni nl warldy corpura mendia ig Natural Conversation Metadarta jest indi search results http www mpi nl world corpora media ig Natural Descriptive Metadata T ees2M imdi u mpi nl world corpora media archive Search Service nts mgpgi nl warldicarparailWDl metadata imci dbf Browse search file Sm p averesultsin results search le damt saving the search results progress data hellwig Watural Descriptive r Metadata buttons to initiate and stop searches This section introduces the search options The following options are available 1 Specifying the corpus to be searched section 7 1 2 Specifying the search parameters section 0 158 Cha
17. It is possible to use project specific controlled vocabularies in a Keys schema 1 e to have a Keys schema display a pull down menu containing predefined values See Appendix 1 Recurring Schemata Keys for details In addition to the standardized fields and the keyword fields there are Descriptions schemata see Appendix 1 Recurring Schemata Descriptions on instructions of how to fill in a Descriptions schema Such a schema contains a prose description that could serve as a reminder to you or others of the circumstances of data collection 56 1 5 3 Links In addition to the information that you enter directly into an IMDI file you can create links to other files 1 e to media files see section 2 5 2 written resource files see section 2 5 3 lexicon files see section 3 1 info files see Appendix 1 Recurring Schemata Descriptions and other IMDI files see section 4 Such links are specified in the IMDI file itself and the corresponding files can then be accessed through the IMDI Browser e g IMDI BCBrowser for MPI on m00402404 7 p em was the only participant in the session The collector does not appear in ES Options execording And there were no other observers present Poft a t E t P uh E E P t ELL EPPS UE P P E E P P P ae Info Con Py Participant Information ici i ion ype Itant Name SM Full Name SM infi Varticipant in
18. bundles see section 3 2 7 2 Conversion of old IMDI files to the 3 0 standard The IMDI Editor version 3 0 supports the IMDI 3 0 standard Files that were created with an older version of the Editor can still be opened but they need to be converted to the new standard The conversion process is as follows 106 1 Open the old file in the IMDI Editor 3 0 see section 1 2 1 6 for how to open a file A warning of the following kind will appear Ea XSL Transformer x A The document has an old format Please transform the document to the correct format From IMDI 1 8 To IMDI 3 0 2 Click on OK to convert the file Otherwise click on Cancel to not convert it The file will be converted to the 3 0 standard using the default profile for that type of file see section 1 2 3 1 for details on profiles A backup copy of the file in its old standard will be saved with the extension old imdi A window appears informing you about this process e g D BirghesSoftwareManualsIMDI Editor Version3 Bl xl E 3 Warnings x avorites Tools Help search Z 4Foldes lt 4 WS GE X A Ea v Go Mame Size Type a IMDI_3 0 xsd 66KB XSD File BH IMDI Editor Manu 6 270KB Microsoft Word D tandard ion cf 2KB CFGFil the file in the old standard has been cpi LAE ers saved with the extension old imdi 2Tress5 2d imdi 17KB IMDIFile Copied unconverted IMDI 1 8 file to
19. language paf N Publications pd I Natora e conversation Dialect corpus information Dialect2 Youth session nodes session data TA destriptive B reese 2 Jfieaking amp Metadata m Speaking wav T Speaking tet 49 annon Info 4 So e LEH List Save Add Remove 6 3 1 Navigating through the corpus tree structure Whenever the IMDI Browser window opens up the node IMDI Corpora is displayed in the Metadata Descriptions Tree panel If you want to access any other predefined part of the corpus do the following 1 Go to the Bookmarks panel see section 6 2 2 Double click on one of the bookmarks The corresponding node is displayed in the Metadata Descriptions Tree panel In the Metadata Descriptions Tree panel make use of the following options to navigate through the corpus e Double click on any node to open it and display the next level in the hierarchy Note The Metadata Descriptions Tree panel distinguishes between open and closed corpus and session nodes These are represented through the following icons Some program commands do not work when a node is closed If any of the icon of an open corpus or session o icon of a closed corpus session commands do not seem to work make sure that the node 1s open Note Because of the large amount of data that 1s loaded it may take some time
20. longer Together with the EMELD project the MPI created a new mapping version which was discussed and except one small adaptation in the mapping approved at the recent E Meld conference in Michigan in July First tests from OLAC as service provider resulted in the fact that they had problems with the many metadata records that the IMDI domain now contains A discussion between the responsible IMDI and OLAC persons resulted in the conclusion that the OLAC service provider has to adapt its services to be able to cope with the IMDI records in a useful way Given very general queries some users may formulate such as give me all resources for German there would be thousands of records like one can expect for very unspecific web queries These thousands of hits would dominate the hits generated from small repositories for example It was agreed that it is a task for the service provider to find a balance The OLAC search engine will be adapted to find this balance in a way defined by the OLAC designers When these adaptations have been carried out all IMDI descriptions will be searchable again via the OLAC interface as well Still it should be noticed that the IMDI metadata descriptions are much richer and that some relevant data will just be ignored As background information we should add that IMDI and OLAC slightly operate from different foci OLAC was designed as an interface layer with semantics close to Dublin Core where many different language reso
21. m TreesSM_1 wav E S G e m TreesSM 2 wav 5 ge M TreesSM 4 wav m TreesSM 5 wav information is 8 rreessW 1 eat s Editor HTML 5 Links ES Ref di d TreesSM 1 bt TE Ispiaye TreesSM 2 bd 4 Descriptions TreesSM 4 bt under the node eei BEES Bie P References Annon Info he sessions TreesLL and TreesAS have a similar content The fieldnotes 25 6 2001 contain drawings the location of the various trees and the location ofthe Source LBHGVD1 SJun01 References 4 speaker and the camera The idea for collecting this pe of data is based on an original idea by David Wilkins Root URL Inttoiwww mpi nlworldicorpora IMDUmetadata IMDLim ho collected video material about a grub species and he trees it affects forthe analysis of demonstratives Basket functions sf List Save Add Remove Clear Type in a description of the references See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema 2 7 Changes to the metadata categories IMDI 3 0 standard The most important change implemented in the IMDI Editor version 3 0 concerns the supported metadata categories the Editor now supports the IMDI 3 0 standard and all IMDI files created with this version of the Editor will automatically conform to this new standard IMDI files created with an older versio
22. to hide a panel Or sms A arrstBenositor tried ta k to make it visible T Projects Hf are now again E Collectors amp r Content 2 35 Participants amp z Languages 2 E Access Note The structure of the IMDI Editor window has changed considerably in version 2 0 see section 1 6 for a summary of the implemented changes The structure of version 3 0 has only changed marginally but there have been a number of changes to the metadata categories see section 2 7 for details 1 2 The menu items The menu items are displayed in the top left corner of the IMDI Editor window To access any item do one of the following a Click on the item b Or use the shortcut key ALT plus the underlined letter s e g to access the item File New Session press first the keys ALT F and then N c Or use the shortcut key that 1s displayed to the right of the corresponding menu item e g to access the item New Session press the keys CTRL N This option 1s only available for some items d Or click on the corresponding icon displayed below the menu bar The following four icons are available New Session Information File View 1 23 1 2 1 File menu The File menu is used to create open close save print and exit IMDI file s It contains the following options Ini xj File View Options Help New AltN Ey New Session Ctrl N f New Le
23. 2 Removing users From the main index page click the remove a user link E Urossable Lorpus Access Kights lanepement Meornceoft Intenet Laorer proweded by MPI Mjmegen In the opened page all users are EEEE EE listed with username and creator as feet bese sono hs Access Rights Management If you want to list only the users Rene se that you have created check the Creator s only box User lastname You can select how you want the list to be ordered from the pull down menu The list can be ordered by user first name user last name user username creator first name or creator last name Click the Del button in front of the user you want to remove You can only remove a user that you have created 2 Adding and removing access rights managers Definers can create access rights managers also definers and give persons the permission to assign access rights to other users for a sub corpus that belongs to your domain of authority The created access right manager inherits your access rights for the specified sub corpus 1 e both can overwrite settings the other made but only the creator has the right to withdraw the given authority 2 1 Add a new access rights manager from top node From the main index window click the add a new access right manager from top node link Select the correct top node from the Top Node pull down menu Next select the person an existing user of the
24. 3 6 2 5 3 3 section EE 2 5 3 5 Derivation ATITIOTaTIO Content Encoding Character Encodi Language ID RFC1766 x sil ank 2 5 3 1 Written Resource This screen contains general information about the written resource 1 e Resource ID The IMDI Editor automatically assigns a unique resource identifier to each written resource This assignment is automatic and cannot be influenced by the user see section 2 5 Date The date when the written resource was created Please enter the date in the following format YYYY MM DD e g 2000 12 30 Resource Link The link to the file containing the written resource 1 e its name and location Note for researchers working at the MPI for Psycholinguistics Nijmegen Please specify only the name of the file do not make use of the open folder icon to the right of the Link field The name of the written resource should be the same as the name of the IMDI file see section 1 2 1 11 and as the name entered in the field Session Name see section 2 1 1 Furthermore the file name has to be Unix compatible do not use file names longer than 14 characters do not use non letter or non number characters except for the underscore and do not use blank spaces 97 Media Resource The link to the media file from which the written resource originated if applicable Anonymous Specify wh
25. 9 E Animals displayed in I ran the I Anmastt Imfo Content content 1s 4 Annon Info Source LBHGA References Kj TreesSM Kwande Metadata FrogStory Metadata Root URL file D BirgitiData MyCaorpus imdi Basket functions Description shv3 0 601 econv H ref ANIMALS 000 i EUDICOTO 0 0 EUDICOt 7 624 H EUDICOp J trs To moe shin aram J imbto moe shin aram I figl okay 1PI do story psinterj prony n H ft Okay we tell a story i s0u J 30 11 2000 ef ANIMALS 001 hEubIcoto 7 624 hEubicott 31 669 Note There are no restrictions on the format of your annotation files ELAN Media Tagger Shoebox CHAT etc Note You can constantly update your annotation files provided you do not change their file names which allows you to reflect changes in your analysis But note that for the moment only ELAN can be directly accessed via the IMDI Browser If you want to edit annotation files of a different format please navigate to the physical location of that file by using e g Windows Explorer 154 6 3 4 3 Info files Both corpus and session nodes can contain links to info files These files provide general background information about the corpus or the individual session If such a file is an HTML file click on it its content will automatically be displayed in the Info Content panel If the file is a PDF file
26. Bookmarks Description Worldmap HTTP global Worldmap global if The Goemai projectis part of a PhD project begun in August 1998 There are two aims to this project 1 To investigate the E semantics and pragmatics of locative verbs and derived elements Most ofthe collected IMDI corpora global IMDI corpora HTTP global MPI corpora global MPI Corpora HTTP global Metadata Descriptions Tree 9 Language amp Cognition Project information e africa Name Goemai e Goemai Title Documentation of Goemai Hellwig 2 Fa IMDI Metadata Editor inl x i Project paf i language pdf Q Publications paf 9 8 Natural conversation Q descriptive 9 E Treessm Q Metadata Location File View Options Help MACS HTML E Links 4 b L9 me Documenistoncfoomal information under Project is displayed under the node Project Root URL httpveww mpi nl world corpara llMDl metada egen The Netherlands Basket functions Cus Cee ae Cm Organisation Max Planck Institut f r Psycholinguistik Descriptions Language English locative verbs and derived elements Most of the collected experimental data is concerned with this zs opic 2 To document the Goemai language in form of a reference qrammar a dictionary and an annotated 65 Enter t
27. Click on No to overwrite the existing file with the new language Click on the cross icon in the upper right corner to return to the IMDI CV Editor without making any changes 186 of 196 Intera D2 2 1 If you have selected either Yes or No the IMDI CV Editor will display the underlying category plus the new language e g ga Controlled Vocabulary Editor Continents ger xml Continents ger Language ini x the German language version of the underlying category URL http viwww mpi nilMDWSch Description i af linguistic continents underlying category You can now edit the file e g give it a German Tag Title and Description see above and enter the appropriate German translation of the values see section 9 3 Note Please do not change the Name and the file name as it appears in the URL e The Name contains the name of the underlying metadata category plus the appropriate language extension e g Continents ger e The file name in the URL only contains the underlying metadata category e g Continents e The file will then automatically be saved under the name of the category plus the appropriate language extensions e g Continents ger This procedure is necessary for the IMDI tools to understand that the different language versions go back to the same canonical vocabulary The relationships between different language ve
28. Corpora stored at other archives or institutes follow different rules Notation Conventions The following notation conventions are used Screens schemata and fields of the Editor are written in the font MS Sans Serif SHORTCUT KEYS ARE WRITTEN IN SMALL CAPS Text in double quotes refers to menu items and screen displays Information on troubleshooting starts as follows 17 Table of Contents 1 Navigating through the IMDI Editor 22 1 4 The IMDI Editor window e eese eee nee rennen nnn nn nnn 22 1 2 THe menu HEMS uie caer ree eMe UR DRE EE NE RED R METRE ERE 23 1 21 SFe euin e De 24 BATANG N o aceite ot E ES 24 Meo led NEW SESSION iscusdease ades exceeded eaae a te Ede pdud ede dusud eue 24 1 2 1 3 New Lexicon Resource BUNle c ccccceceseeceeeeeeeseeeseeeeseeeseeees 24 1 22 4 NCW CONUS d ae p MEE EEUU 25 1 2 21 9 INOW AS alalogble ah dedere ei PEE GECERECE ERR DE ERI 25 EARO F c m 25 15217 Oper UR e duse att ce Sate Se ce Scie Gece ta Edd ada rectae itd due de 25 1 2 1 9 OMEM RECON iiid duties ines uds pet E 25 QR Ae ONO SC c M M 25 1 2 DOCS lt JA Adoos a is eee reste Mt rete pL MED ee NE 25 IRURE TR t 26 LZ Fs Aa gt Su x M CRE nO AAPA AM ANN AE 26 UE ou o SWEA ict 26 1 2 1 14 Page Setup uusssssssssssssseseeeeennnen mener nnn nnne nns 26 122
29. Domain has to be delivered in two steps as a prototype and demonstrator The implementation of the prototype required much technology preparation work to adapt metadata technology to the new IMDI version and to validate incoming metadata descriptions to integrate the metadata descriptions and to offer the integrated metadata domain via a well defined portal So all technology work is seen as preparatory to realize the demonstrator Therefore this report will touch the development work carried out and briefly describe the prototype 1 Technology Work The technology work is based on the new IMDI version that was discussed at the International IMDI Workshop in Nijmegen in November 2002 It was decided that the IMDI set will be changed and simplified slightly and that it will cover also metadata elements for written resources and lexicons For details we refer to the appendix C where the new IMDI set is listed The discussions with the international experts took much longer then expected until the new standard was accepted by the IMDI community In July 2003 the new version of the IMDI standard could be finished as IMDI 3 0 3 A complete set of XML schemas was developed to define the set and the needed controlled vocabularies see appendix D Further the IMDI infrastructure was influenced by the experience of the users with the old tool versions The new tools were adapted and new functionality was added One essential extension was made to the IMDI
30. Entry x i be used ifthe continentis Africa Description information to be entered OK a Value contains the underlying value Tag contains the form that is displayed in the user interface of the IMDI tools and Description contains a prose description of the Tag e The Value is obligatory e The Tag is optional It is only necessary if you have a single underlying controlled vocabulary a so called canonical vocabulary that corresponds to vocabularies in different languages For example in the above illustration the underlying value is Africa the Value and the Tag is Afrika 1 e the German word for Africa That is the IMDI tools will display the underlying value Africa as Afrika in the user interface e The Description is optional It contains descriptive information that is displayed in the IMDI tools whenever the tool tip is activated That 1s 1f you point with the mouse to a value a blue box containing the Description will be displayed 189 of 196 Intera D2 2 1 10 Working with controlled vocabularies in the IMDI Metadata Editor The IMDI Metadata Editor is a tool that allows you to create metadata information As such it makes frequent use of controlled vocabularies both predefined vocabularies and user defined vocabularies This section of the manual describes how the IMDI Metadata Editor uses the controlled vocabularies that were created in the IMDI CV Edi
31. IMDI Editor Country The country where the session data was collected Choose the country from the pull down menu or type it in It is a closed vocabulary 1 e only values listed in the pull down menu are accepted by the IMDI Editor Address The address where the session data was collected e g at school XY Region The region province town suburb etc where the session data was collected To add a region click on the Add button to remove a region click on it to highlight it and then click on the Remove button 63 2 1 4 Keys Keywords that are relevant to a the collection of the data and b the creation of the IMDI file For example Descriptions Location Keys screens of the IMDI person Keys file that are not yet who created gt us filled in the IMDI file Missing_information annotation unit TreesSM_3 mpq is lost things that need to be checked e g a lost file 2 Clear Session See Appendix 1 Recurring Schemata Keys for instructions on how to fill in a Keys schema 64 2 2 Project This screen contains information about the project within which the session data was collected All the information that is entered into the Project screen will be displayed in the IMDI Browser in the following way see the IMDI Browser manual for details IMDI BCBrowser for MPI on M03402985 Ioj x File Options Search Help
32. Kwande wav Worldmap HTTP global C mems Selected file Mead n standard sessios tf IMDI corpora HTTP global 3 TreesSm imi E Lue conor HTTP l soba i Filename MyCorpus imdi Open Metadata Desc Descri Files of type All Files Cancel E file is displayed in the eime IMD Browser window Basket functions List Save Add Remove Clear Exit Click on Exit to exit the IMDI Browser window 135 6 1 2 Options menu The Options menu supports the following four options Preferences Update Policy Note that the Exit item turns into a Close item when you open a second IMDI Browser window either through the option Clone Node see section 6 3 1 or through accessing your search results see section 7 4 If you click on Close only the current IMDI Browser window will be closed O x Options Search Help Ee IMDI BCBrowser for Preferences Configuration info i Check CV s 1 Show Hide bookmarks Click on Preferences to select the update policy and the fonts The Preferences dialog window appears offering you the following two options Click on Update Policy to determine the policy for updating controlled vocabularies stored in the cache The following policies are available E Preferences c Update Policy click here to select a policy x The user cache containing used Controlled Vocabularies can be updated automatically ollowin
33. Metadata Descriptions Tree panel 2 In the Metadata Descriptions Tree panel click on the info file world html The world map is displayed Info Content panel 3 To navigate you have the following options e Click on any part of the world map to enlarge the corresponding region e Make use of the Backward and Forward buttons to move to previous and subsequent views e Click on any language to open the corresponding node in the Metadata Descriptions Tree panel 144 For example B IMDI BCBrowser for MPI on M03402985 File Options Search Help click here to Inti move Bi No description d LL 4 back forward Bookmarks Description 1 click here to gain access to MPI corpora the world map MPI Corpora HTTP 2 click here to open the world map ESF corpora to enlarge the corresponding region e g MPI Nijmegen Acquisitid MPI Language and Cog LUND Demo DoBes corpora Root URL lhttpwww mpi nl world corpora media archive maps world Basket functions List Save Add Remove Clear udin V ers i P oj ay 4 e M LS Meta Descriptions Tree E d sg eS e JEN E F J Xwrfiellwig corpus im puta u G i H Hellwig E zm a S Lo TEM 4 click ona m language to open the corresponding corpus node 6 3 3 Selecting parts of the corpus for purposes of searching basket f
34. Of Metadata 4 Location o Keys 2 0 o Project display of metadata Collector Content information 4 Participants transcription pdf m TreesSM_1 mpg m TreesSM_1 wav metadata T TreesSM_1 eaf Annon Info nodes Source LBHGYD15Jun01 2 DV Referentes Root URL ffile DBirgit Data MyCorpus imdi Basket functions Name TreesSM Title The uses oftrees Date 2001 06 25 Recording date 2001 06 148 Furthermore you can access all metadata information through the session node Double click on a session node to open it then click on it to select it and right click to open a pull down menu The pull down menu offers you the following options e Choose Show As HTML from the menu The metadata information is displayed in an HTML format in the Info Content panel e g IMDI BCBrowser for MPI on M03402985 E x File Options Search Help select Show AS Metadata Descriptions Tree H TM L from the Xm MyCorpus imdi e MyCorpus pull down menu 9 f 8 Deserfti double click click 7 ELAN and right click on a mier session node AB aceiodeum Session Count TreesSM The uses of trees 2001 06 25 List Sessions escription of the uses of Metadata Add to Bookmarks the HTML version 1S ees We looked at trees in FrogStory Remove fro
35. Repository Participants panel gt 0 9 nu 1 4 5 Importing nodes into the repository You can import a node into the repository In this case the IMDI Editor creates a copy of the original file and stores this copy in the folder user directory IMDI TOOLS SDRCache Note If you have old templates that were created with an earlier version of the IMDI Editor 1 e earlier than version 2 0 you need to import them into the Repository panel To import a node do the following 1 Make use of the Import options 9 amp 3 Local Repository Collectors a Either click on the Import button e g E Content E Participants Languages amp Access Projects 7 New Folder button 47 b Or right click somewhere in the gt Put Data Repository panel and then 4 Get Data click on the Import item of the pull down menu e g 1 New Group Rename Group q Import The Import Repository dialog window appears 2 Browse to the file that contains the repository and click on OKR f Import Repository x ication vINNT PersonaltestMyParticipantsirepository xm Browse Enter the name and location of the repository The file 1s displayed as an additional node in the Repository panel e g 9 Local Repository Collectors Content Ej E Participants c Languages
36. WANG Wa GCS esscscsceeecoeucees cota ee ee eate apes a dea e een aus 85 LEN ME e d 89 243 2 JACIOISDeSCHDLHIOFLIS eei cina uium DD 90 29 S queer 91 ox MEN eio mt 92 2 9 2 Media CS ei d conim pi eme MM MEME EUM NM 93 2 9 2 1 Media Elle sse inito head adu se s SAN ESSEN ERRARE UDUUME 94 19 2922 TANS Posion ecoute cee a eue acce oreo nca devo acer eda a a sea ii 95 2 92 9 JAOCOSS hee rites sats ca ML MEM M 95 2O24 IKOVS 95 ZOO WWSSCIDUONS ecse e a 95 2 5 3 Written Resources ccccccecccsecseeecseeceeeceueccaeeceecseeceueceueesaeesseeseessees 96 ZO Witten ROSOUCE 97 2992 MOMAN enen HERO eas aceasta 99 2933 Valdagno NANE eee 101 295 34 ACCESS oie ni HIR aan aaa Aa 101 DAR SS eS S1 TE 101 2 9 9 0 JDOSOFIDIHOD eee eee teen ee eee ee ee een ce ce ee ee ee ee eee 101 2o SOUCO S da eden deasdetedenc dates 102 AOAI DOUC a a OnE EU UE OR UE 103 ZZ POSON P M 103 2 5 4 3 Resource References ccccecccseccceeeeceeseceeecsaeeceeessueessueeesseeess 104 20 44 PCCCSS ciiciiiee iHi hes 104 RS cc LM 104 94D DOSCLIDIOIIS seese e ste tete pedit iE 104 2 6 References eeceeeeeeeeeieee eene nene nnne h nnn nnns ann s anna ans ennnen nenanem 105 2 7 Changes to the metadata categories IMDI 3 0 standard 105 2 7 1 Changes t
37. a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 101 2 5 4 Sources Each source e g video or audio tapes or books associated with the session has to be listed on the Sources screen To add a source see also section 1 3 do the following Es IMDI Metadata Editor File View Options Help 5 aeo Links 4 HTML unknown the list of sources click on Add to add another source to edit or remove a source click on the source to highlight it then click on Edit or Remove The Add and Edit buttons give you access to the Source screen that contains the following kind of information IMDI Metadata Editor File View Options Help section M e Cee ae h1 section 2 5 4 6 102 2 5 4 1 This screen contains general information about the source 1 e Source ID A short code to identify the source Note for researchers working at the MPI for Psycholinguistics Nijmegen Please consult the tape labeling conventions Format The physical storage format of the source Choose a value from the pull down menu or type in an alternative value value comments OQ Compact Cassette 1 e normal audio cassette CD Compact Disc CDR
38. at the MPI for Psycholinguistics gt um eine frei verf gbare Test Resource zur Verf gung stellen zu stik umfasst Die Annotationen wurden von Peter und Kita dem Gestik n fot Submit Quer ee sili e e T e ro ELE y wl C SURU CEE Q kleve ri n amp o dep Internet P Figure 8 indicates the Google like search field allowing the user to do unstructured search on the structured IMDI files Version Converter A converter program that converts IMDI 2 x into IMDI 3 03 was created and was successfully tested The IMDI conversion was carried out in March 2004 This conversion has many aspects that have to be considered carefully and many institutions that make use of IMDI have to be informed So the step over to IMDI 3 03 is a major step and has consequences also to many management and index creation programs that have been developed during the past years The adaptation of all programs is almost finished so that the move to the new IMDI version can be made soon The converter is available also on the IMDI site The editor was extended such that it can read in old IMDI files and transforms them to new ones The user is warned that transformations will be done so that he she can manually intervene if necessary IMDI to OLAC Gateway Both relevant metadata initiatives for language resources have created new versions that can be seen as major changes So the old mappings between OLAC and IMDI were not valid any
39. box appears 5 Specify a name for the bookmark and click on OK The new bookmark is added to the Bookmarks panel and will be marked as personal 139 E g inx Et x File Options Search Help Bookmarks Name for Bookmark L amp C Cancel specify a name for the bookmark and click on OK Worldmap HTTP global Worldmap global IMDI corpora global IMDI corpora HTTP global MPI corpora global MPI Corpora HTTP global Metadata Descriptions Tree dt MPLimdi e MPI corpora i index html Acquisitios o etanguage amp Cognition MetaData Search Show Services Session Count List Sessions kA northamerica Add to Bookmarks kJ soOnthecerica Dna y r navigate to a node select it right click on it and select Add to MPI Corpora HTTH Bookmarks from the pull down tont personal menu Z CGN eu HTT Search australia fareast mesoamerica E IMDI BCBrowser fi File Options Sear the bookmark 1s Bookmarks added and MPI corpora marked as ee etd n O x gion ai L amp C personal file Im The bookmarks are saved permanently by the IMDI Browser 1 e they remain available every time you restart the Browser To remove a bookmark do the following l In the Bookmarks panel click on the bookmark that you want to remove It will be highlighted in blue color 2 Click with the right mouse butto
40. click on it and then right click on it to open the pull down menu Choose Open with Acrobat Reader 5 0 from the pull down menu Acrobat Reader will open up to display the file For example oN IMDI BCBrowser for MPI on M03402985 File Options Search Help Metadata Descriptions Tree Description 9 N IMDI omen indexh PI corpora i index html e Language amp Cognition Q africa Q Goemai Ene 9 8 Hellwig n sd i mit display of html file C anguege pdt i Public tions pdf j home introduction research people facilities e N Natural display of 2 Root URL file pdf file INata IMDLimdi N Browsable Corpus on the web Basket function Max Rlanck offers access to a large amount of linguistic4da B Acrobat Reader Project pill LIEN of data you can have acet oust s Sa Bis Edit Document yiew Wijde Eili Slax SS nea HRA BAO Hos OO A Lac tis verbis deictic elassifiers amd noni classification in a number of comexts Gormal xpeakers are forced io pay atienneon io the enia c inis Lg Ten iboe an sin oe Spatsal ui fonat is obligaborily essposded in slave ve verb ia the lacie nist in deae ele fci ot the decere word nd i xaliars sh i n the popen eonsmnacti n Furhenmorne i d exstuonadlh adedi i the kem of pihaa vers origina an serial conetnactiom and m iha forma of copia in vquaive amd possessive
41. corpus If you have selected a part the List button at the bottom of the Metadata Descriptions Tree panel is highlighted in red color 160 3 For searches through selected parts of a corpus make sure that the list displayed under the List button at the bottom of the Metadata Descriptions Tree panel contains all relevant nodes 7 1 2 Creating a new data file Many corpora provide you with predefined data files Such corpora are visualized in the Metadata Descriptions Tree panel by means of a white C on a black background see section 7 1 1 above However it 1s also possible to create your own data file To do so do the following 2 In the pull down menu click on Create MD search DB No Descriptions zi select an Option Ei IMDI BCBrowser for MPI on m03403023 File Options Search Help Jelx Bookmarks Worldmap HTTP Worldmap IMDI corpora IMDI corpora HTTP global global global global Info Content Re creating the Search DB L D i li k u E can take a some time Continue ouble click on a corpus mem Dye mo cancer node to open 1t click on it to select 1t and then right click to open a pull down menu a nS A 3 A window pops up warning you that the creation of data files takes time For large corpora it may take up to 30 minutes If you want to create the file click Open Otherwi
42. ep ore em Gr eleg n Arn bn pil e dab equo poiim Uerum drjrern pls eri em eret mera perl pon len pe lr gem i abren pn hla etl rp and font here Armenian gample from The WorkTe Writing roa select the unicode block X Rem tene we the selected unicode block and font are displayed click on OK a 36 1 2 4 Help menu Click on Help to view information about the IMDI Editor e g _ Build 80 2003 10 01 16 0 1 3 The Browse panel The Browse panel allows you to browse through one or more IMDI Session Lexicon or Corpus files It displays all currently open IMDI files e g Bi IMDI Metadata Editor EE _ inci x File View Options Help IMDI Session files Project 4 Content currently active GoEMAI IMDI file gt Actors internal structure of an highlighted in Resources IMDI Session file m kwande wav Kwande tet G LBHGACOTDecUD D1 Access References e Mycorpu Descriptive blue color IMDI Corpus file and its internal structure FrogStory 3 The Browse panel makes use of the following icons m 2 session node corresponding to an IMDI Session or Lexicon file see sections 2 and 3 corpus node corresponding to an IMDI Corpus file see section 4 metadata node media file node written resource or transcription file node lexicon node corpus link node Under each session or corpus
43. identical to the participant of query 2 e g X Please ignore these variables if you specify one query only 7 2 Changing the profile Section 0 above has shown that the categories to be searched are displayed in form of pull down menus These categories are specified in a so called profile stored in the folder user directory IMDI TOOLS Profiles local To change the profile do the following 1 Click on File menu 2 Click on Change Profile The Open dialog window appears and displays all available profiles e g xi al gt cal ca eR 3 SESSION Profile xml 3 SignLanguage Profile xml Loo available profiles in the folder user directory IMDI TOOLS Profiles local File name open Files of type All Files x Cancel 3 Double click on the profile you want to use The IMDI Metadata Search window will display the chosen profile E g the following illustration displays some categories that are specific to sign language research and that are defined in the profile SignLanguage Profile xml ioi x Session Speaker Signer X Key Deafness Status Deafness Status hearing Note For the moment profiles have to be programmed by hand In future versions of the IMDI tools it will be possible to create profiles with the help of the IMDI Metadata Editor 7 3 Initiating and stopping the search Click the Search button at
44. larger searchable corpus the whole corpus will be searched not just the subcorpus E g F IMDI BCBrowser for MPI on M3402985 i m x File Options Search Help Metadata Descri blue icon IMDI corpora cannot be searched xm IMDLimdi gt i IMDI pal meer tine PAR MPI corpora l isiti ieee black icon MPI Acquisition and Language M Acquisition amp N africa amp H australia l amp N fareast blue icon below black icon 1 e below amp H mesoamerica eod Language amp Cognition corpus N northamerica H southamerica the data file of the whole Language amp amp N oceania Cognition corpus will be searched T Sign Language A Ey By defa Root URL file IIsun 3rCorporallMDlimetadata MDI imd corpus However it is possible Basket functions this case you have to manuall EN sme EUN Feme ww fwser To select nodes make u S9 __ Remove EM ttom of the Metadata Descrip sx ave selected node s metadata search is automatically Tested to the lege node s Again you can only search parts of corpora that have a corresponding data file Note If the search process does not yield the expected results please check the following possibilities 1 Make sure that the data file displayed next to Search Service is the correct file 2 For searches through an entire corpus make sure that you have not accidentally selected a part of the
45. number 14 Vved Feb 12 2003 Generated Corpus 15 A generated corpus with sequence number 15 ved Feb 12 2003 Generated Corpus 16 A generated corpus with sequence number 16 wed Feb 12 2003 Generated Corpus 17 A generated corpus with sequence number 17 wed Feb 12 2003 Generated Corpus 18 A generated corpus with sequence number 18 ved Feb 12 2003 Generated Corpus 19 A generated corpus with sequence number 19 wed Feb 12 2003 Generated Corpus 2 A generated corpus with sequence number 2 wied Feb 12 2003 Generated Corpus 20 A generated corpus with sequence number 20 wed Feb 12 2003 Generated Corpus 3 A generated corpus with sequence number 3 wyedFeb12 2003 Generated Corpus 4 A generated corpus with sequence number 4 Wied Feb 12 2003 Generated Corpus 5 A generated corpus with sequence number 5 vred Feb 12 2003 Generated Corpus 6 A generated corpus with sequence number 6 Vved Feb 12 2003 Generated Corpus 7 A generated corpus with sequence number 7 wed Feb 12 2003 Generated Corpus 8 A generated corpus with sequence number 8 wed Feb 12 2003 Generated Corpus 9 A generated corpus with sequence number 9 Vved Feb 12 2003 D generated imdi imdi file Eh IMDIxsd lt xsd schema imp 3 0xsd xsd schema O folders 24 files fs e g 2 2 e e s s 2 2 e e s 2 22 2 2 e 3 3 Due to the necessary debugging of the editor code and to the lost of the programmer of the editor further work
46. on its values These constraints are visualized in the form of pull down menus of controlled vocabulary items e g hearing hard of hearing etc Q 4 Actors Actor 1 Resources References Deafness AidType LanguageExperience ExposureAge lageExperience AcquisitionLocation anguageExperience SignTeaching Family Mother Deafness Unspecified fother PrimaryCommunicationForm Unspecified Family Father Deafness Unspecified ather PrimaryCommunicationForm Unspecified Family Partner Deafness Unspecified artner PrimaryCommunicationForm Unspecified Education Age Unspecified Education SchoolType Unspecified Education ClassKind Unspecified Education EducationModel Unspecified cation Location EducaWqn BoardingSchool Unspecified Unspecified 29 To manage types and profiles click on Profile Manager The following options are available Add a new profile B Profile Manager x Click on the type where you dair ea ae want to add the profile Then nee UE Bundle Add URL click on Add to add a 3 corpora Delete locally stored file or on Add S URL to add a file stored on ane the Web A dialog window appears allowing you
47. on the Tree Copier had to be postponed Browser The metadata browser is another true IMDI tool operating in distributed IMDI environments and directly on the IMDI XML files It can be used to easily navigate in the IMDI domain with the help of browsing and searching It also allows to set bookmarks to nodes such that users can easily go to sub corpora The browser is also the basis of all advanced options such as the domain interlinking proposed for the INTERA project to immediately start tools when having found suitable resources Of course the browser has facilities to support special project files The details are described in the manual which is attached as appendix B Figure 5 gives an idea about the user interface Ee IMDI BCBrowser for MPI on m03403001 File Options Search Help Bookmarks Worldmap HTTP Worldmap IMDI corpora IMDI corpora HTTP MPI corpora MPI Corpora HTTP a aa global global global global global global Metadata Descriptions Tree Xml IMDI imdi Q IMDI corpora i index html 9 MPI corpora i HTMLcorpusContent html 9 Acquisition i Generallnfo htrl N L1 Acquisition Max Miller P Brown Caroline corpus S Eisenbeiss Bhuvana Narasimhan Gaby Cablitz Q Stoll corpora 9 8 is 9 E o1 10810 Metadata m 4001_10810 wav T 01 10810 cha References 03 10803 0 Root URL Ihtto iNww mpi nllwarldicorpora IMD metadata IMDLirndi Ba
48. resource N10155 Media File but not in resource N10198 Media File In this case the language should only be linked to N10155 Media File To link a resource click into the empty box to its left A checkmark will appear there 2 3 3 2 Languages Descriptions Enter a description of the role of each language as it is used in the session e g language of elicitation main language code switching etc Note that the description should not contain background information about the language in general reserve this for the Language sub screen see section 2 3 3 1 See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 76 2 3 4 Keys This sub screen contains keywords that are relevant to the content of the session For example Schema vIiMDI 1 8 keywords that are relevant to the Resources References content e g peech pointing emi spontaneous Involvement non elicited Value Il Add topic E goal demonstratiye space pes the topic the linguistic interest in the session 2 Clear Content See Appendix 1 Recurring Schemata Keys for instructions on how to fill in a Keys schema T1 2 4 Actors This screen contains information about the actors involved in the s
49. rights for groups 176 4 1 Defining access rights for groups 176 4 2 Remove access rights for a group 177 5 Listaccess FIGIIES uiuo ni Des se rataiie a ui see e eR c 177 171 1 Adding and removing users This part of the manual describes how authorized persons can add potential new users and remove users There are two kinds of users e definers e normal users Only definers can give access rights to other normal users and can create other definers Definers are always connected to a certain level 1n the hierarchical tree structure Therefore definers can only lend access rights for the current level in the tree or any level below 1 1 Adding a new users From the main index page click the add a user link In the opened form the definer has to provide the following date concerning the potential user e first name or initials e prefix optional e username e password the user can later modify this password e user class class is normal for normal users or definer The created definer can create new users but is not able to set access rights To solve this a Node must be assigned additionally to the definer see chapter Adding and removing access rights managers e postal address optional e email address potential new users have to accept an agreement sent to them by email see chapter add a new group 172 1
50. section 7 1 4 3 Adding or deleting a search query section 7 1 5 7 1 3 Selecting the category to be searched The categories that can be searched are displayed in form of pull down menus By default Session and Name are displayed These two options allow you to search for all sessions with a particular name as in the illustration above Click on the pull down menus to select other categories e The first category is always Session i e you will always search for a session e The second category is chosen from the following list Name Date Location Key Participant or Content 1 e you will search for a session that has a certain name or a certain recording date etc e Depending on your choice for the second category further pull down menus will appear to narrow down the category Note If too many categories and pull down menus are added the IMDI Metadata Search window cannot display them all To increase its size click on the full screen icon in the top right corner of the window ni 162 Some examples of metadata searches are given below 1 The screenshot below illustrates a search for all sessions that were recorded in the Netherlands In this case Session and Location were chosen as the first two categories A third pull down menu appeared subdividing the category Location into Continent Country and Region When Country was chosen a fourth pull down menu appeared displaying the names of all countr
51. the Content screen Please remove this keyword manually and fill in the Genre and Subgenre fields by hand 2 Errors concerning then usage of wrong values e g the usage of a wrong format such as a wrong format for a date or an age or the usage of a value that 1s not specified in a closed controlled vocabulary see section 1 2 3 3 for controlled vocabularies In all these cases you will receive the following message informing you that such errors were detected Ey 3 Warnings x A Some CViconstraints errors were found ees 108 activate the tool tip In the IMDI file itself such errors are highlighted in orange color and a description of the error is added to the tool tip which is activated by pointing with the mouse to the category label You can navigate to these errors by using the two arrow buttons at the top of the Content panel e g IMDI Metadata Editor loj x File View Options Help HTML Links use these buttons to move to the previous next error the error is highlighted in orange color Continent METATRANS CRIPT Session MD Group Lacation rror Description q The value is not a valid entry for the closed controlled vocabulary Continents at http ev mpi nlAMDl Schema Continents xml A point here to a description of the error is added to the too
52. the involvement of the researcher s the social context the event structure and the channel All six are closed vocabularies 1 e a value from the pull down menu must be chosen other values will not be accepted Enter the following information e Interactivity The level of participant interaction 1 e value comments interactive an interaction between at least two participants It may or may not include an investigator e g e conversation e many narratives e matching game non interactive a monologue produced without expecting extended verbal responses from the hearer s e g e many oratory texts and songs e some narratives semi interactive primarily a monologue but punctuated by repeated interjections from the hearer s e g e achild interrupting a narrative e hearer s repeatedly prompting a narrator 70 e Planning Type The degree of planning through the consultant 1 e value Spontaneous 9 semi spontaneous planned e Involvement comments an unprompted speech whose topic is not determined by the investigator or an observer e g e conversation e chatting e joke telling e singing while harvesting a prompted speech whose topic is determined in some way by an investigator or a community member but whose participants speak freely within this context e g e interview e queries e g Tell me about the history of your village Show me how to make to
53. to browse to the file or to enter its URL Delete a profile Click on the profile that you want to delete Then click on Delete Select the default profile for a type Click on the profile that you want to make the default profile for a type Then click on Make Default From now on this profile will automatically be used whenever you create a new IMDI file based on this type 1 e for new Session files see section 1 2 1 2 new Lexicon files see section 1 2 1 3 or new Corpus files see section 1 2 1 4 The non defaults can still be accessed through the menu item File New see section 1 2 1 1 Click on OK to exit the Profile Manager window again Note Profiles are stored in the folder user directory IMDI TOOLS Profiles local For the moment profiles have to be programmed by hand In future versions of the IMDI tools it will be possible to create profiles with the help of the IMDI Editor 1 2 3 2 Anonyms Anonyms is used whenever you have chosen to make the name of an actor anonymous 1 e whenever you have chosen to enter a code instead of the full name into the field Full Name on the screen Actor see section 2 4 1 1 If you have done so the option Anonyms allows you to create a file that maps this code to a full name You and everybody whom you grant access to this file will be able to see the full name while everybody else will only see
54. to add another Descriptions schema Do the following e Click on the Add button to add a new schema e All schemata are displayed at the right of the Descriptions schema They are listed under the language name as defined in the Language field e To access a schema highlight it by clicking on it It appears in blue color and its content 1s displayed The content of all other schemata 1s hidden e To delete a schema highlight it by clicking on it Then click on the Remove button to delete it 122 Appendix 1 Recurring Schemata Keys Keys contain user defined information that is a project specific b not taken care of in the standardized fields and c meant to be searched For example Bj IMDI Metadata Editor Oj x File View Options Help E 8 project specific keywords for the actor TreesSM Project Content Q9 Actors E Meter J i Summary Actor Informat 5 Editor HTML Links 4 Resources RoleUnspecif References Code AgeUnspecitied AnonymizedUnspecified Key Name dialect Hausa English 9 amp Local Repository Projects Content Gl Actors c Languages E Access amp Local Old IMDI Version g To add a key do the following 1 Click on the Add button A new key will be added 2 Specify a Key name The name has to be a single wor
55. was stated in the INTERA proposal the delivered IMDI domain is not just a demonstrator but will be a service that will be further supported by the participating institutions The availability of metadata is a must not only for resource discovery purposes but is also a basis for efficient management The IMDI infrastructure is set up in a way that all interested institutions such as ELRA for example could easily create their own portal and offer the total or a selected domain Of course all IMDI tools are Open Source and free 15 II II IV Appendix A IMDI Editor version 3 0 Manual This manual was last updated 22 Oct 2003 The latest version of the manual can be downloaded from the following webpage http www mpi nl IMDI tools Author Birgit Hellwig Introduction The IMDI ISLE Metadata Initiative Editor was developed at the Max Planck Institute for Psycholinguistics Nijmegen The Netherlands It 1s used to create metadata descriptions both for sessions or resource bundles and for corpora As such it complements the other IMDI tools At the moment the following IMDI tools exist tools and manuals can be downloaded tom http www mpi nl IMDI tools e IMDI CV Editor a tool for creating lists of controlled vocabulary items e MDI Browser a tool for displaying and accessing data and metadata e IMDI Metadata Search Tool a tool for searching through metadata information e IMDI Tree Builder a tool for c
56. will remain open If the file to be closed has not been saved yet a warning appears Click on Yes to save the file before closing click on No to not save the file click on Cancel to return to the file 1 2 1 10 Close All Click on Close All to close all IMDI file see also section 1 2 1 9 25 1 2 1 11 Save Click on Save to save the currently active IMDI file 1 e the file that is highlighted in the Browse panel see section 1 3 Note The Save option is only enabled if a file node 1 e a session lexicon or corpus node is highlighted in the Browse panel If any other node 1s highlighted the following error message appears E Save Warning X 1 You need to select a Session Corpus or Catalogue root node in the Browse If you save an IMDI file for the first time the Save dialog window appears Browse to the folder where you want to save the file assign a name to it and then click on Save Note When you save an IMDI file the IMDI Editor will check 1f the folder contains a file named imdi xsd If not it will automatically create this file Note for researchers working at the MPI for Psycholinguistics Nijmegen The name of the IMDI file should be the same as the name of the corresponding annotation and media files see section 2 5 The same name should also be entered in the field Session name see section 2 1 1 Furthermore the file
57. 1319 e ee stans used ete 26 152 1 10 EXIT Sasdudtse dee dicetis aed aeta ee a ad to idt 27 M22 AVEN NOD detti x md ma M M M S A TM E 2f 1 2 2 1 MOL DetlOlh Soe acusududadedudndunedunndtdudust equ ucc cans asddanc amd dunes cada aaa undE aes 28 1222 NGS OUI Se urne duit SII NO INO ORDRE 28 1223 WANG ALG ING oe aieo e eis sac p uad sinu past ra sb ntu sanded Ens Edad iid 28 T 2 9 ODUODS Menis oceodctoeccu nn 20 1 22 3 1 Prole Manage sjean 29 123 2 VATIODVITIS ee a a N N 30 1 2 3 3 Vocabularies s nsennennenesennrrorrrsrrsrrnrrrnrrrrrnrrnrrrrrsrrsresrrrsrrnrrnresnene 32 1 2 9 4 PICISTON CCS uos e ten dtes eco dado ud oagduegdvend usd cadis cad uasdN ond add unde 35 L24 CHO meni eden ue 37 to Ihe Browse Dahel siio a ese de su cb secs ecb poe uet ues a Emu pice 37 T4 The Repository panel isi eie e ea iiem rnt ose cune nme ai 42 1 4 1 Storing metadata information in the repository 43 1 4 2 Deleting metadata information from the repository 44 18 1 4 8 Managing the structure of the repository sseeeseeessse 45 1 4 4 Exporting nodes from the repository eeeeeeeeeeeeeess 46 1 4 5 Importing nodes into the repository eeeeeeeeeeeeeeeeeeeeeee 4T 1 4 6 Updating the information in the repository usse 48 1 4 7 Using metadata informat
58. 2 2 1 1 Create a controlled vocabulary by means of the IMDI CV Editor see section 9 2 Click on the Add Vocab Link button e g alo File View Help E BE a 9 amp New Session v x 9 E New Session Project Session Collector Session Name New Session e Content Participants Session Title o 4 Resources 7 References Recording Date unspecified amp Keys ___KeyName Key Value Add Vocab Link _ a al click on the Add Vocab Link button 9 Local Repository E Collectors m Content E Participants cf Languages amp Access Projects The Vocabs overview window appears and displays the available controlled vocabularies 3 Click on the controlled vocabulary that you want to add It will be highlighted in blue color e g Ee Vocabs overview EN xj Used vocabularies until now Link To Cannonical CV Description Language hility xml LanguagesID http veww mpi nllDI amp 8chema LanguagesID xml The id s of languages Lists of languages and language abbreviations used atthe http veww mpi nl MDI SchemaiMediaFile Format xml A list if media formats AddURL Cancel http www mpi nl D D Schema MPlI Languages xml click on the controlled vocabulary sme that you want to add Note If the required c
59. 5 Language RFC1766 x sil eng Link 2 Clear Project 39 b Some nodes in the Browse panel allow you to add further sub nodes which are then displayed as screens in the Content panel For example the node Actors allows you to add a sub node for each actor To add a sub node do one of the following Click on a node in the Browse panel It is highlighted in blue color 10 xi Then click with the right mouse button on the highlighted node A pull down menu appears Select Add from the pull down menu Full Name um Alternatively Close click on an Add button in the Content panel Actors Description Language The added sub node lol xi is displayed in the Browse panel 3 IMDI Metadata Editor He Vew Option Hep 7 A uw Standard Boslony Resource k E Eater Project Actors Full Name Actor 1 Unspecified i mo References Actors Description IMDI Metadata Editor File View Options Help A act Jj Standard Sessfon Resource B gt Project p dq A Content ctor Summary Actor Information RoleUnspecified Name Full Name Code Social Family RoleU nspecifieN Ethnic Group AgeUnspecified SexUnspecified EducationUnspecified AnonymizedUnspecified on the added sub node t
60. CDRoot icdrom useCDname true cacheDir DAWINNT ProfilesibirghewMDI Tools ItConfigFileName LocalTools cfg configURL http J veww mpi nl MDI Config mpi mdDefSearchFile globalBookmarkFileName mpiBookmarks xml selectedCorporaFileName SelectedCorpora xml defaultCharset IS0 8859 1 showURLfield true nodeSelectionEnabled true corpusStructurePresent true imdiMaode false corexMode false mpiMode true dobesMode false debugMode false Click in the box to the left of Check CV s to enable the IMDI Browser to check 1f the values used in a metadata file are correct 1 e are specified in the controlled vocabulary Click in the box to the left of Show Hide bookmarks to show or hide the Bookmarks panel in the IMDI Browser window Ea IMDI BCBrowser for MPI oaa EM X File Options Search Help Metadata Search Click on Metadata Search to access the IMDI Metadata Search tool see section 7 Ea IMDI BCBrowser for MPI ec File Options Search Help Click on About to view the copyright and version information 138 6 2 Bookmarks panel In the Bookmarks panel you can save shortcuts 1 e bookmarks to parts of the corpus Such bookmarks have the advantage that you do not need to navigate through the entire corpus hierarchy in order to access that part of the corpus Depending on the version and the configuration of the IMDI Browser a number of bookmarks are displayed by default For example the following d
61. Communication Cay information under Languages and Keys is displayed under the nodes Languages and Subgenre Description Task Modalities speech poi Subject plants i Root URL httpweww mpi nl world corporallMDl metadata lMDI imdi Basket functions Communication Context Interactivity non interac Keys Planning Type semi spont VE cv Involvement non elicited ww cv Social Context private Event Structure monologue Channel face to face 67 2 3 1 Content Type Information about the genre the task the modalities the subject and the communication context For example 3 IMDI Metadata Editor File View Options Help B as 9 exe ome Sue 5 Content pos Content Information Genre Discourse Subgenre Description T ask Modalitiesspeech pointin Social Contextprivate Event Structure monologue Channelface to face Subjectplants Content Type Descriptions Languages Keys Genre Discourse ic Subgenre Description Task Modalities speech pointing gestures Subject plants Communication Context Interactivity non interactive M cv Planning Type semi spontaneous M cv Involvement non elicited cv Social Context private Event Structure monologue Channel jface to face Many of the fields under Content Type offer predefined values tha
62. Content ummary Content Information GenreDiscourse SubgenreNarrative Taskfrog story Modalitiesspeech Social ContextUnspecified Event StructureUnspecified ChannelUnspecified Subject Content Type Descriptions Languages Keys Genre Discourse Subgenre Narrative Task frog story ina cv Modalities speech ina Subject Communication Context Interactivity semHinteractive M cv Planning Type semi spontenecus M cv Involvement inon elicited cv Social Context Unspecified M cv Event Structure Unspecified cv Channel Unspecified cv Vie cleer Content Figure 1 shows a screenshot of the new editor version PIS x amp iBlx File Name Content Genre Language English x Tag OSOS Title Ps URL Aton mpi nlllibl schema Cantent Genre xml Description list af values forthe genre aspect Tag Value Description Add Unknown Unknown nspecified Unspecified eC iscourse Discourse itual religinus texts Ritualireligioustets nemeve iction Fiction 000000 o Je ewspaper article Newspaper article C RadinTVfeature RadipTVfeatue eee Drama Drama oZ Singing Sing Je Instrumental music Instrumental musie E Poety Poetry o 4 literature terre S E oe a ES eee Click on a column header to sort Figure 2 shows
63. Corpus Dutch Spoken Corpus MPI Copora ESF Corpus TI at es es that are part of the archives stored at the MPI This its managemer em is d version sion is based on centralized m lik efficiency re s can be ociated with nodes in the co structure The specifications are isibility exists for certain corpus resulting from a specific project team has a responsible researcher These will get the rights to modify for that sub corpus Central co vanagement at the MPI has to take care that these top privil vill be handled Current point in the tree IMDl corpora Authorized persons can as potential new user add or remove users rename groups ights for a gro lf you want to access one of these links you will be asked to identify your self Users can modify their password by clicking here about the Content LL fe wet Z Figure 9 shows a screenshot of the new access rights management system that is tightly connected with the IMDI metadata domain 11 Browsable Corpus Access Rights Management Grant rights to a group nr l O x Go Links a Fie Edit view Favorites Tools Help Back amp Z Search Favorites Meda 4 Eye 3 A 8 2 Address http corpust mpi nl arm ver2 arm rights group html Access Rights Management Grant or deny rights to a group This form allows you to associate read rights to a group with all resources that can be found under the selected node Therefore you
64. D Birghe SoftwareanualsdMDI Editorversio n3_OTreesSM old imdi LJ My Computer A SS click here to go to the next message Furthermore the converter informs you about any errors that took place during the conversion process Click on Next to read this information 107 The following two types of errors can occur 1 Errors concerning the Genre element In earlier versions Genre consisted of three separate subtypes These were removed and were mapped onto one Genre and one Subgenre instead see section 2 7 1 While some of these mappings are known to the converter others are not If the mapping is not known the converter will generate an error message informing you that the old Genre has been mapped onto the Keys schema of the Content screen instead see section 2 3 4 e g Ay The Conversion generated a warning Ho mapping was available for Genre and SubGenre in Content The original Genre information in included as Keys in Content Please fill in the correct values for Genre and SubGenre This warning will continue to appear until you generated have removed the warning conversion warning Key in Session Recording Date con Keys Key Name Key Value onversion IMDI 1 9to3 0 warning Unknown mapping of Genre narrative gt 7 IMDI file creator AddCV the old Genre has been mapped onto the Keys schema of
65. E Access z Projects ode Se EZ MyParticipants The imported file 1s added to the Repository panel 1 4 6 Updating the information in the repository This option allows you to update an imported repository 1 e the copy of the file see section 1 4 5 after the original file has been modified Do the following I Click on the imported repository It will be highlighted in blue color 2 Right click on that repository A pull down menu appears gt Click on Update The information in the repository is updated 48 1 4 7 Using metadata information from the repository To use metadata information from the repository do one of the following e Drag and drop a sub node from the Repository panel to the appropriate top node in the Browse panel For example IMDI Metadata Editor File View Options EAE Q TreesSM Project Collector O cake e Participants e 2 Drag the sub node to the appropriate top node in the Browse panel and drop it 9 Local Repository E Collectors mf Content Q Panicinan 1 Click on the sub node in the Repository panel Type consultant Code J Age24 Anonymous true Bm x v x rSummary Participant Information Role Sex Male Full Name Code Role Ethyic Group 3 The information 1s entered into the Content panel Education Ano
66. Editor has some major changes to its user interface The most 2A 4 impprtant change is that everything is now contained within one window If you open an IMDI file or create a new 7 ong the file will not appear in a separate window as it used to do in previous versions We have tried to keep 2 7 mist of the interface elements you are used to from the previous versions These elements are now presented in A Hii thg Content Panel Browse panel A E Content G5 Actors Languages E Access i P amp Local Old IMDI Version Repository panel The IMDI Editor window has the following structure e amenu bar see section 1 2 e a Browse panel allowing you to browse through one or more IMDI files see section 1 3 e a Repository panel allowing you to work with recurring information see section 1 4 e a Content panel displaying screens schemata and fields for entering metadata information see section 1 5 22 The size of any of the three panels Browse Repository and Content can be changed relative to the size of the other panels To change its size do one of the following File View Options Et Use the split pane Go with the mouse to any split pane The mouse will turn into a double headed arrow Click and move that arrow to decrease or Use the arrows increase the size of the panel Click on an arrow
67. F amily F ather Deafn Hearing status ofthe actor s IS80538 eng ess xml father Content SocialContext http Jwww mpi nl lMDI Schema Content SocialContext xml A list of values for icationc Add File AdduRL Read cache Refresh selected Refresh all Show entries Close Click on a column header to sort The Vocabs overview window offers you the following options e You can sort the available controlled vocabularies Click on any of the column headers to sort the corresponding column alphabetically e Add File Click on Add File to add a controlled vocabulary file that is stored locally The Browse window appears allowing you to browse to the folder and to select the file The vocabulary will then be added to the Vocabs overview window e Add URL Click on Add URL to add a controlled vocabulary file that 1s stored on the Web The Vocab add window appears Enter the URL of the vocabulary click on Apply and then click on OK The vocabulary will be added to the Vocabs overview window Vocab add Fa URL http tfwww mpi nlIMDI SchemarCantent Genre Interactionalxml om j oem amy 33 e Read cache Click on Read cache to view all available controlled vocabularies 1 e all vocabularies that are stored in the folder user directory IMDI TOOLS CVCACHE e Refresh selected Click on a vocabulary to select it then click
68. If an actor occurs in two different Roles e g once as collector and once as annotator s he has to be listed twice To add an actor see also section 1 3 do the following Ee IMDI Metadata Editor lal xl ile View Options Help A ag e HTML Links Full Name irgi ig Birgit Hellwig Collector EM A Jan Smit Consultant CNN Birgit Hellwig Annotator Edit Remove the list of actors Ai click on Add to add another anguage English actor Text jJ was the only participant in the sessid does not appear in the recording And t to edit or remove an actor click observers present on the actor to highlight it then click on Edit or Remove The Add and Edit buttons give you access to the Actor screen that contains the following kind of information E IMDI Metadata Editor File View Options Help A ap o section section Actor Summary Acto section section 2 4 1 1 liName Jan Smit Code J Social Family Role Ethnic Group Goemai Age 24 Sex Male Education secondary school Anonymized false 79 2 4 1 1 Actor Information Information about the individual actor For example IMDI Metadata Editor PRISES File View Options Help A aps 9 Deme o mw Sue 5 Actor Summary Actor Information
69. Intera CCouteut Intera Deliverable D2 2 Integrated Resource Domain Project reference number Project acronym Project full title Project contact points Project web site EC project officer Document title Deliverable ID Document type Dissemination level Contractual date of delivery Actual date of delivery Status amp version Work package amp task ID Work package task amp deliverable responsible Number of pages Author s amp affiliation s Additional contributor s Keywords Abstract Additional notes amp remarks e content EDC 22076 INTERA 27924 Intera Integrated European language data Repository Area Khalid Choukri Evaluations amp Language Resources Distribution Agency S A 55 57 Rue Brillat Savarin 75013 Paris France Phone 33 1 43 13 33 33 Fax 33 1 43 13 33 30 Email choukri elda fr http www elda fr intera Philippe Gelin Integrated Resource Domain D2 2 1 Second Version Prototype Demonstrator PP M9 01 010 2003 01 11 2003 indication to ELRA that the prototype is operating first version next and last version to come in summer WP2 all WP2 Tasks except 2 1 MPI USAAR LORIA ELRA 13 except appendixes Daan Broeder Peter Wittenburg MPI Meta data Language Resources 1 It was not clear to MPI that in addition to the working prototype also a report has to be written This was created in February 2004 Integrated Resource Domain The Integrated Resource
70. MDI Metadata Search 2ini x IMDI BCBrowser for MPI on M03402985 loj x File Options Search Help X Session Participant e Ful Name J Bookmarks me IMDI corpora HTTP global MPI corpora global MPI Corpora HTTP CGN Corpu nca il GN Corpus HTTP earch Results Query specification and results Session MDGroup Participants Participant X FullNamez J Search Results hits 28 progress http www mpi nl world corpora media archive lac data hellwig Natural Conver Quest imdi http www mpi nl world corpor34 lwig Natural Descri save the c di reesSM imdi i z eran http www mpi nl world corpord lwig Natural Dg criptive Metadata A gt 3 ouest pistes results here S TreesSM 9 S Animal Search Service http vww mpi nl worl d a Petac dbt Browse Metadata m Animal2 wav T nimal2 tt 49 Annon Info e E Source LBHGAC25Nov00 01 CC References result3 result 4 results resuts results of the metadata search cms Scorch sw Save results in speakerJ Basket functions ouis Save l Add Remove Clear 2 The new corpus node is treated like any other corpus node 1 e it contains sessions and the sessions contain metadata media and annotation files 166 You can remove the saved corpus node again Do
71. OM Compact Disc Read Only Memory DAT Digital Audio Tape MD Mini Disc Reel Reel to reel tape DVD Digital Video Disc DVDROM Digital Video Disc Read Only Memory Hi8 Hi8 Video Tape VHS VHS Video Tape DV Digital Video U matic U matic Tape Book A publication on paper Microfiche A film card Quality The quality of the tape Choose an option from the pull down menu 1 stands for low and 5 for high quality 2 5 4 2 Position The start end position of the session in the corresponding source Please enter this position in the following formats e In the case of digital audio tapes use the field Time Position enter hh mm ss 1 e hours minutes seconds e In the case of digital video tapes use the field Time Position enter hh mm ss f 1 e hours minutes seconds frames e In the case of non digital tapes and books use the field Counter Position enter a sequence of digits to represent the counter position of a tape or the page numbers of a book Note for researchers working at the MPI for Psycholinguistics Nijmegen If you have asked the digitization team to digitize your tape the team will create a digital master file DMF from it You then need to go through this DMF using either Windows Media Player or ELAN to identify the relevant 103 session enter the start and end position of this session as it appears on this DMF under Position The digitization t
72. Session Descriptions under htt WWW mM l nl IMDI for updated lists of controlled vocabularies their values and their definitions 55 For all fields there are tool tips available Point with the mouse to the label of the metadata category and leave it there for one second The tool tip appears giving you information about this category e g In addition to these standardized fields the IMDI Editor allows for the possibility to enter project specific information Such information is entered into a Keys schema see Appendix l Recurring Schemata Keys on instructions of how to fill in a Keys schema e g E IMDI Metadata Editor CERT Ls E Editor point with the mouse to the category label and leave it there for one second Ses ion Session Name JreesSM Session M ame METATRANSCRIPT Session Name Recg Content Description Desc iptions amp short name to identify the session The name of the session can be considered as shorthand of the session title Desciptio Encoding String aage O x Metadata Litor TrepaSh Project E Cone db Acton bess Diay factor iri Rescues ade References project specific keywords for the actor J F 5 Gator TO nma 5 ur Enalish Bd Local Repectory 2 E Projects 7 Content B Actors d Ei Languages S E Access gt gp Local Cd IM Version Note
73. They are used to allow authorized persons to define access rights for language resources that are part of the archives stored at the MPI 1 e DOBES Corpus Dutch Spoken Corpus MPI Corpora and ESF Corpus At the moment only the right to read resources can be assigned The corpora are arranged in a hierarchical tree structure and access rights can be assigned to nodes in that structure All Nodes below a certain Node in the hierarchy inherit the rights of this Node Access rights assigned to nodes higher in hierarchy are overruled by access rights assigned to nodes lower in hierarchy The Access Rights Management web pages support the following features e Adding and removing users e Adding and removing access rights managers e Adding and removing groups e Modifying groups e Defining and removing access rights for a group e List access rights The Access Rights Management web pages can be accessed from the Browsable Corpus web site http corpusl mpi nl BC IMDI corpora At the bottom of the page is a link set access rights for this node Clicking this link opens the Access Rights Management main index page The current node in the corpus tree 1s the starting point for the Access Rights Management Note you need username and password to login 168 The main index page display is as follows E Browsable Corpus Access Rights Management Microsoft Internet Explorer provided by MPI Nijmegen o x File Edit Vie
74. U Utrecht U Leiden U Helsinki Kotus Project Helsinki U Stockholm U London U Nijmegen U Kansas U Victoria ca EMU Michigan U Sydney U Melbourne U Kiel U Bochum U Bonn U Leipzig U Frankfurt U Bielefeld U Erfurt Federal U of Rio de Janeiro U Buenos Aires Phonogramm Archive Vienna MPI Nijmegen European Sign Language Communities in Se UK NL and D several others we don t know Also these resources will become available under the IMDI portal when they are ready and registered Partly they are already integrated as sub parts of other projects Given this list we can hope that an increasing number of institutions will join For every metadata provider the following steps are being carried out at the MPI e he metadata files provided are being validated since most of them are created at least semi automatically with the help of scripts So a validation step is necessary e Some indexes have to be created to make the metadata searchable e he metadata hierarchies have to be created and integrated to offer browsing capability For all these steps the tools have been created such that they can be carried out efficiently Often some interaction has to be done to carry out the necessary corrections in particular on the semantic level When downloading and installing the IMDI browser everyone could start looking at this data 14 3 Prototype and Demonstrator The prototype includes the software that was d
75. Youth imdi Multilingualism file S amp SUN O3 MediaArchivellac data hellwig Natural Conversation Metadata Guest imdi e Demo file SSUNO3 MediaArchive lac data hellwigiNatural Conversation Metadata HandLLAS imdi file SUNOS3 MediaArchivellac data hellwig Natural Descriptive Metadata TreesSM imdi file SUNO3 MediaArchive lac_datas hellwig Natural Descriptive Metadata Speaking imdi fille SUNO3 MediaArchive lac_datas hellwig Natural Descriptive Metadata Kwande imdi file SUNO3 MediaArchive lac dataihellwigiNatural Descriptive Metadata vvitch2 imdi file J SUNOS3 MediaArchiverlac dataihellwig Natural Descriptive Metadata Animal2 imdi file J SUNOS3 MediaArchivellac dataihellwig Natural Descriptive Metadata Animal3 imdi file SUNO3 MediaArchive lac_datashellwig Natural Descriptive Metadata Animal4 imdi file J SUNO3 MediaArchivellac data hellwig Natural Descriptive Metadata Animal5 imdi file J SUNOS3 MediaArchivellac data hellwig Natural Descriptive Metadata F amily imdi Search Service fleiISUND3IMediaArchiverCorpusstr cture mpi dbf Browse Search Stop Save results in resutts Root URL ffilev SUNO3 MediaArchive Corpusstructure MPI imdi 33 Mediafirchive Z wwu mpi nl vorld cor Basket functions A Figure 6 shows a screenshot of the search component that is integrated with the browser It allows the user to enter queries supporting all structured elements of IMDI and also supports profiles ED HTML Based Browsi
76. a Participants sT r 3 atceRIU449495 mE mI r utie nrst o3 Tenge pang 2995 ml type Participant s gt folder pites Languages types Languages 3 cfolder namea Access typea Access 24 Clolder pames Projects typee Projecta 3 if repository gt io a OOGA a A See ae Ser 1 Uu PE t1 ci fe ps LOL EL EIU x als Tai rid emt Mgmt uh fane rtm He j O D D m m e Ow GOGH a4 4 object s ES Tei Cover Nee U Ery ENTE Z IL 1 d 3 i A Taml wersione l1 0 encoeding 150 B859 1 754 Project Hameocoesmalc Namse d Tirle umsentatiomofscooesmns F Tirles3 1 4 erontact 2 xNamexBirgit Hellwigc Namesd tc Mddgess Poa thua 310 6500 A4H Nijsegen T 8 2 Email Birgit Helluig mpi nlc Emailsd Opganisation Max Planck Institut f r Pay c Centacrsm Descriprieon Languagelds RFC1766 x sil eng L part of a FPhD project begun in August 1998 The1 project i1l To inwestigate the memantics and pra dgerlwved elegents Host of the collected expetimgent t pic 2 To d ocument the Gosmai language in forag dicticonarjy and am annotated text corpus c Descrip I ciFProject 1 aaj joe bo n SOOM Se A cr Pa wl 11 aF ini toll F 42 The Repository panel displays a number of buttons and a pull down menu which 1s accessed by clicking on a node first with the left mouse button to select it and then with the right mouse button to ope
77. a Search Tool section 7 Note This manual contains a number of references to corpora housed at the Max Planck Institute for Psycholinguistics MPI It uses these corpora as a means to illustrate the functionality of the IMDI Browser Please note that any remark e g about the structure of the corpora their availability the contact Please note throughout this manual the terms session and resource bundle are used interchangeably see also section 1 2 127 person etc only pertain to these corpora Corpora stored at other archives or institutes follow different rules Notation Conventions The following notation conventions are used e Menu items icons and screen displays are written in the font MS Sans Serif e SHORTCUT KEYS ARE WRITTEN IN SMALL CAPS e Information on troubleshooting starts as follows 128 Table of Contents T Basic Intotmatioli oii ee o Eo ee 130 MEME DU ss 130 1 2 Sessions or resource bundles eeeeeeeeeereeenere 132 2 Displaying and accessing data e eese eee rennen nnn 134 NEN JU Nn Let 135 2 T He scatti eet pi du aM MU 135 2 12 OPIONS MENU seii dee RFESRUC KENN HIN MI M SE bEmPI at nt xbbda a Usu adus 136 21 9 Saen Tello eacus op EINAD EU Ma EM Desde Mann MEME EE 138 2 14 Telpquelizcxmudiduis cu x dM E M ME IM M UE 138 2 2 Bookmarks Da
78. a screenshot of the CV Editor Corpus Structure Builder Ed Microsoft Excel corpus Structures corpman xls Bl x Fie Edit View Insert Format Tools Data Window Help Acrobat x Fi igure 3 shows a OSE S6RY xum c amp e rr iiddm 50 amp c gt eius NS Aria O0 B 7z UES SHO ORE 5 5 A 2 gt T from the Excel Al asec2000Dec01b program This can be nsi mec lt a RNC S REG ET n DL used to specify the 1 asec2000DecO1b I circle of dirt Metadata asec20 0Decl S Eisenbeiss corpus 1 1 imdi circle of d 1 01 imdi 2 lasec2000Dec02a circle of dirt Metadata asec2D0DDeclS Eisenbeiss corpus 1 1 imdi circle_of_d1 01 imdi IMDI hierarchy in an 3 Jasec2001Api02a circle of dirt Metadata asec2001AprC S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi easy Way The output 4 Jasec2001Api02d circle of dirt Metadata asec2001ApiC S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi E CAMS circle of dirt Metadata asec2001Apr2 S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi is transformed by a asec2001Apr25b jcircle of dirt Metadata asec2001Aprz S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi i 7 jasec2001Apr26i circle of dirt Metadata asec2001Apr2 S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi program into proper B lasec2001MayO3e circle of dirt Metadata asec2001May S Eisenbeiss corpus 1 1 imdi circle of d1 01 imdi IMDI XML files 9 lasec2001MayO3f
79. acquisition context this field specifies whether or not this language 1s the source language From the pull down menu select either true 1f it is the source language or false if it is not Target Language In a second language acquisition context this field specifies whether or not this language is the target language From the pull down menu select either true if it is the target language or false if it is not Descriptions A description that gives background information about the language in general Note that the description is not about the role of the language in that particular session reserve this information for the Languages Descriptions sub screen see section 2 3 3 2 See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation Resource References The IMDI Editor automatically assigns a unique resource identifier to each media file and written resource see section 2 5 The sub screen Resource References lists all available resource identifiers it displays the identifier together with the type of resource e g N10155 Media File in the screenshot above and it displays an empty box to the left This sub screen allows now you to link the language to a specific resource E g a language may only be used in
80. als BAS 2 News I IMDI corpora I C Oral Rom 2 New IMDI document IMDI Metadata Elements for Session Descriptions Version 3 0 4 MPI Nijmegen C Oral Rom Corpus i C Oral Rom Demo Spanish IMDI continues as part of the ECHO and INTERA projects French l 4 ECHO European Cultural Heritage Online italian ITI U UEM eret ema PE ESS E WV INTERA Integrated European language data repository Area e Portuguese Informal Q Formal e Media Projects using IMDI Formal in natural context i Telephone CGN The Spoken Dutch Corpus Project DOBES Documentation of Endangered Languages ECHO Case Study 4 Sign Languages i ECHO WP2 Infrastructure and Technology Root URL Ihttp Iwww mpi ntiwarldicorpora iIMDUimetadata IMDI irndi D IMTERA WIS s Intedrated Gurcoeen snausus datanineciiw Ame Basket functions 13 In the left frame called Metadata Description Tree one can find the contributions that are marked as being integrated such as from CGN BAS IFA GTRP Meertens C ORAL Florence and all the resources covered by the MPI archive Other resources will be added when they become available In addition to this list we can state that the following institutions are currently creating IMDI metadata descriptions mostly on own money or via other funding sources Univ Lund Sweden s National Dialect Archive U Uppsala U Stavanger U Groningen
81. ame directly into the Link field of the IMDI session Lexicon or Corpus Editor Do not make use of the browsing option 1 e the open folder icon to the right of the Link field Please make sure that no prefixes are defined in the View Preferences Prefixes menu see section 4 1 2 112 However if you manage your own corpus you need to enter both the file name and the directory location You can enter this information either directly into the Link field see section 4 1 1 or you can make use of prefixes see section 4 1 2 4 1 1 Link fields You can enter the directory information directly into the Link field Do the following 1 Make sure that the browsing option is set correctly Do the following 1 Inthe IMDI Editor click on Options menu 2 Click on Preferences 3 In the Preferences dialog window click on Resources 4 In the Resources dialog window make sure that there is no checkmark in the box next to Retain only the filename when browsing Note If there is a checkmark in this box the directory information will not be saved e g amp Preferences EG This preferrences panel gives the user access to certain fork properties which define how resources are handled in the editor Retain only the filename when browsing Controlled vocabularies fg Prefixes o Resources f Natural imdi IMDI Metadata Editor v0 11 Schema vIMDI 1 8 Pi
82. an machine dialogue 2 3 2 Descriptions A description of the content of the session For example E IMDI Metadata Editor File View Options Help 5 ace JP Ector HTML Ls E Content Summary Content Information Genre Discourse Subgenre Description T ask Modalitiesspeech pointin Social Contextprivate Event Structure monologue Channelface to face Subjectplants Content Type Descriptions Languages Keys Descriptions nglish Langy4ge English e cime describes the various trees in the vicinity He nam amp s he trees explains their uses and compares different pes oftrees to each other The topic was prompted b me Immediately before the session had asked him to show me typical trees and to explain their uses to me e then went into the bush to look for suitable trees henever such trees were found prepared the equipment And during this time he had a few minutes fo prepare his explanations My interest in this task was Jo prompt the use of demonstratives in a semi natural a description of the content See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 72 2 3 3 Languages Information about the language s used in the session e g Ea IMDI Metadata Editor File View Op
83. anel e g K E e looked atthe trees in four different sites all in the Ay vicinity af Kwande beyond the football field The four media um ening Because ofthe s taa clear y i ERE m ep d yarm tae k us information TLI n mto m orld data metadata lMDI imdi kA Speaking Kwande Note The Description panel displays prose information For searchable keywords look at the Info Content panel see section 6 4 The Description panel can display additional information To view this information do the following 1 In the Metadata Descriptions Tree panel click on the node whose information you want to view It will be highlighted in blue color 2 Right click on the highlighted node 3 Select one of the following options from the pull down menu see section 6 3 1 e Show Content Type Displays the file format e Show Description Displays the file description e Show URL Displays the directory information for the metadata file The information is displayed in the Description panel The Description panel also displays a Print button that allows you to print the information displayed in the panel 157 7 Searching data IMDI Metadata Search Tool To start the IMDI Metadata Search Tool do one of the following a Double click on the IMDI Metadata Search Tool icon b Or use the IMDI Browser interface 1 Use the menu items see section 6 1 3 1 Click on Search
84. anel see section 6 4 Displays the file format in the Info Content panel see section 6 4 Displays the file content in the Info Content panel see section 6 4 Displays the directory information for all files contained under this node in the Info Content panel see section 6 4 Displays the name and directory information of the file needed for searching through the metadata in the Info Content panel see section 7 1 Displays the directory information for the metadata file in the Description panel see section 6 5 Opens the media file in Windows Media Player see section 6 3 4 2 Note Depending on the configuration of your IMDI Browser other tools may be available Furthermore under Windows the default option for starting media files of specific types is also supported e g if Windows is set to open 143 wav files in Windows Media Player the option Windows Media Player will be available Note For some nodes one of the options is marked with the symbol In this case the corresponding option will start automatically whenever you double click on the open node Note Not all options are available for all nodes 6 3 2 Navigating through the corpus world map The IMDI Browser displays a world map with the locations of all documented languages To access the world map do the following 1 In the Bookmarks panel double click on the bookmark Worldmap The info file world html is displayed in the
85. ar22b S Eisenbeiss corpus 1 1 imdi endpoint 1 03 imdi PIENE corpus Structures corpman B 1s mmm l Draw ls G Ready E ps pes on m EESTI The IMDI metadata infrastructure is not only focussing on searchable metadata descriptions but also on browsable hierarchies that can be used for organizational and management purpose Altogether the IMDI infrastructure is intended to form a corpus management framework to manage large collections of language resources In this respect the users confronted us with the need of an efficient way to generate corpus structures We allow people to specify corpus structures with the help of for example Excel which allows fast and easy column operations These Excel tables are then processed by a program to generate validated IMDI hierarchies existing of IMDI XML files See a screenshot of a typical Excel Sheet with a corpus structure in figure 3 Also the editor was extended to allow people to generate corpus structure nodes Tree Builder The Tree Builder is a tool that allows users to create their own virtual archive within the overall IMDI domain Users can create arbitrary nodes and link metadata descriptions with them All work can be done graphically by connecting existing nodes with others In doing so researchers can create their own private working domain for a special study to be carried out for some time This working domain much more easily allows the user to find and combin
86. ate 2001 06 25 61 session Name A short name or abbreviation that uniquely identifies the session Note for researchers working at the MPI for Psycholinguistics Nijmegen This name should be the same as the name of the IMDI file see section 1 2 1 11 and as the name of the corresponding annotation and media files see section 2 5 Furthermore the file name has to be Unix compatible do not use file names longer than 14 characters do not use non letter or non number characters except for the underscore and do not use blank spaces Session Title The complete title of the session Usually it 1s the spelled out version of the abbreviated Session Name Recording Date The date at which the session data was collected The date has to be entered in the format YYY Y MM DD Once you start typing into this field the IMDI Editor will automatically display the required format highlighted in blue color Type over it to enter the date 2 1 2 Descriptions A description of the l xl File View Options Help circumstances under which the data was collected E e e D Editor G HTML 3 Links Whenever you access a Session session through the IMDI Browser this description is language in which the first piece of the description is information that you will written e g see It should therefore Pese English contain all the information 1 Goemai tha
87. ation 5 Creating controlled vocabularies with the IMDI CV Editor 6 Working with controlled vocabularies in the IMDI Metadata Editor Notation Conventions The following notation conventions are used e Menu items icons and screen displays are written in the font MS Sans Serif e SHORTCUT KEYS ARE WRITTEN IN SMALL CAPS e Information on troubleshooting starts as follows 178 of 196 Intera D2 2 1 Table of Contents 1 Basic Information imd croc terier a a 180 2 Creating controlled vocabularies with the IMDI CV Editor181 2 1 Managing controlled vocabularies the File menu 181 2 2 The metadata category eeeeeeeee cesse nennen 185 2 3 The Values a AEE UEM EUER S 188 3 Working with controlled vocabularies in the IMDI Metadata J e iqo gent 190 3 1 Viewing the available controlled vocabularies 190 3 2 Working with predefined controlled vocabularies 193 3 3 Working with user defined controlled vocabularies 193 179 of 196 Intera 8 Basic information D2 2 1 All the IMDI tools are concerned with metadata information As such they make use of so called controlled vocabularies Each vocabulary consists of a metadata category e g a category labeled languages and of a list of predefined values to be entered into this category e g all the names of the world s languages The IMDI CV E
88. ave to delete the list again see below 4 Repeat this process to add other nodes to your selection Note Please select only nodes that are listed below a corpus node for which there 1s a search file available Such corpus nodes are visualized in the IMDI Browser by means of a white C on a black background see section 7 1 As indicated in the illustration to the right you can select and search nodes contained under MPI corpora or under DoBeS archive But you cannot select and search one node from MPI corpora and one from DoBeS archive since they do not have a common search file E IMDI BCBrowser for MPI on M0340298 File Options Search Help Metadata Descriptions Tree O x dt IMDI imdi Q IMDI corpora i index html N MPI corpora index html Acquisition Language amp Cognition Multilingualism Q DoBeS archive i introduction html Aweti team Kkuikuro team Lacandon team Salar Monguor team nnteam Root fleJisun 3 Corpora IMDUimetadata IMDI Basket functions ESI ERE EIE To view a list of the selected nodes click the List button at the bottom of the Metadata Descriptions Tree panel It will display all selected nodes e g Ei Selected Nodes olx rinl_01 file d COREX4Corporaimetadataimlirinl_01 imdi rini_02 file d JCOREX4 Corpora metadata ml r1 nl 02 imdi Dismiss 146 To remove a node from the list do t
89. constructions For example ihe demmoensirairve werd contains an oblipstery deictiz classifier ili eng read ame nietos l ng Marne ine ALIVZ CT cbwuK pl EAL IS FAX ima magm Thosz standing trees arc mango irora The thesis presenis m comprehensive anahsis of all contewts where such spatial sso i ch cubical It dha the ijh mal p La i ply E LM lisa Mis i th Hheir amp cuantiex and pragmascs and thet alixanan patha m addit ar mcludes a grammatical sketch of the oma Gaas and vllustraics ths ds i locate cheers 5 plane im thee creme ef the lagu age as a who uem dui is linked ta hs thecretizal literature om ino desenpiens pute um fig ox t the fille iwere apu s i a Ti 155 6 4 Info Content panel The Info Content panel displays information about corpus session and file nodes To read the information click on the node in the Metadata Descriptions Tree panel e g Meta Descriptions Tree kJ Lillive sa Q 8 de XE eS sion information Name TreesSh Title the uses oftrees described by SM session node le TIN Date 2 eredina di Root URL hip awww mpiniworldidatametadata lMDL imdi IF b M information List f Save f Add Remove f Clear m ng Note The information is displayed in form of searchable keywords For prose information look at the Description panel see section 6 5 The Info Content panel can display additional information T
90. contact in order EREE ioir mpi niworldidatatmetadata tO gain access Responsible researcher s by contrast have unrestricted access to all files This includes the following possibilities e Access to metadata and info files including a all suppressed information and b the possibility to update the information See sections 6 3 4 1 and 6 3 4 3 e Access to media files See section 6 3 4 2 e Access to annotation files including the possibility to continuously update them See section 6 3 4 2 133 6 Displaying and accessing data Starting the IMDI Browser opens up the window IMDI BCBrowser for MPI referred to in this manual as the IMDI Browser window In the IMDI Browser window you can view and access the corpora 1 e you can read information about the available data access the files and initiate searches The IMDI Browser window contains the following information 1 Menu items see section 6 1 Bookmarks panel see section 6 2 Metadata Descriptions Tree panel see section 6 3 Info Content panel see section 6 4 29 uv Description panel see section 6 5 IMDI BCBrowser for MPI on M03402985 0 x m Description panel Pao aad E Description http global file global file global http MPI corpora global file MPI Corpora HTTP global http m ma Metadata Description 1222 gt Info Content panel Metadata Descriptions Tree pa
91. cording took place in early evening Because ofthe twilight the ideo recordings are nottoo clear A No Language RFC1766 x sil eng 54 Most of the fields are more or less standardized The following symbols and options are available closed controlled vocabulary evi closed controlled vocabulary list mug c CM open controlled vocabulary my aE 2 open controlled vocabulary list You can enter only one value and this value must be selected from the pull down menu You can enter more than one value separated by commas but all values must be selected from the pull down menu You can enter only one value You can either select this value from the pull down menu or type in an alternative value You can enter more than one value separated by commas You can either select these values from the pull down menu or type in alternative values In all four cases above you can choose a value directly from the pull down menu Alternatively you can start typing in which case the pull down menu will automatically open to display the available values constrained format The value must be entered in a certain format e g a date must be entered in the format YYYY MM DD As soon as you start typing the format is displayed in the field highlighted in blue color Please type over this format Please see the document IMDI Metadata Elements for
92. ct more members at a time to add or to remove Only If you want to list only the group S that you have created check the creator s only box Andreas Claus Kees van der Veer Remove one or more members from the group elg donn the Clem n For the selected group all members NO are listed here Select the John Deer member s you want to remove and Omne click the Remove button of Tanya Stivers All users are listed here Select the user s you want to add to the selected group and click the Add button Note Holding down the Ctrl key makes it possible to select more names one by one If you press the Shift key you can select a group of names blocked 4 Defining and removing access rights for groups 4 1 Defining access rights for groups From the main index page click the define access rights for a group link The opened page allows you to associate read rights to a group or to all users for all resources that can be found under the selected node Therefore please make sure that you are at the right node in the corpus tree The current starting node has to be within your domain of authority All nodes and all resources that will be affected by your operation are listed at the bottom of the web page 176 Intera D2 2 1 To define access rights for a group or for all users do the following 7 select the group whose access rights you l want to define The group members are listed in the field on the r
93. d e g MetaDescriptionCreator Do not use blank spaces 3 Specify a Key value Multiple words are allowed e g Student Assistant XY To delete a key again highlight it by clicking on it Then click Remove It is furthermore possible to add project specific controlled vocabularies to a Keys schema i e to have a Keys schema display a pull down menu containing predefined values see also section 1 2 3 3 and see the IMDI CV Editor manual for instructions on how to create controlled vocabularies 123 To add a controlled vocabulary to a Keys schema do the following Ee IMDI Metadata Editor Bl x File View Options Help T Session Session Name gtandara Session Resource Bundle Session Title Recording Date Unspecified 7 The controlled vocabulary is added to the Keys schema 2 The Vocabs overview window opens and displays all available vocabularies If the needed vocabulary is not displayed add it by clicking on B Vocals nag Add File for a locally stored vocabulary or NENNEN 3 Click ona Add URL for a vocabulary stored on the vocabulary to select it 4 Click on OK to add the selected SSS Ee ae vocabulary to the Please neis T of controlled vocabulan te secn e key value http liwww mpi nllMDI Schema Content Genre xml list of val es ferthe d Closed Controlled Vocabulary O Open Contro
94. d exemplifies the features of the Access Rights Management web pages It is organized around the following 5 parts VI Adding and removing users chapter 1 VII Adding and removing access rights managers chapter 2 VIII Adding removing and modifying groups chapter 3 IX Defining and removing access rights for groups chapter 4 X List access rights chapter 5 169 Notation Conventions The following notation conventions are used Screens schemata and fields of the web pages are written in the font MS Sans Serif SHORTCUT KEYS ARE WRITTEN IN SMALL CAPS Text in double quotes refers to menu items and screen displays Information on troubleshooting starts as follows 170 Table of Contents 1 Adding and removing USCS GS 1 cc1sccccsssccessnccesnnssnssnscenansesannsnens 172 1 1 Adding d NOW USCIS eae ooo ver esce eoe EA 172 1 2 Removing USCIS uere etl a sc ossa uen xe xr unu UcM V VE EAEN 173 2 Adding and removing access rights managers 173 2 1 Add a new access rights manager from top node 173 2 2 Add a new access rights manager from the current point in tree 173 2 3 Removing access rights managers 174 3 Adding removing and modifying groups 174 3 1 Add anew group oo 80u6 a 174 3 2 Remove d gro p ern eer unas a arses ners 175 3 3 M dify GEOU DS e 176 4 Defining and removing access
95. d to the A Ia repository E 43 e Alternatively make use of the Get Data option in the Repository panel For example to add the actor J to the repository do the following IMDI Metadata Editor File View Options P Lm 1 Click on the sub node in dica Browse panel Collector Content ibd n 2 Click on the appropriate top As node in the Repository panel 49 Resources 3 Make use of the Get Data References option l a Either click on the Get a Data butt Coles a a a E o E Content 2 P M Participants 3 b Or right click on the ld d e p highlighted node and then e click on the Get Data Q eomm 775 item in the pull down ae Gu menu dno In both cases the node is added to the repository Note You can only add a sub node from the Browse panel to an appropriate corresponding top node in the Repository panel e g a language can only be added to a language type node and an actor can only be added to an actor type node If the two nodes of the Browse and Repository panels do not match the program will automatically disable the Drag and drop and the Get Data functions 1 4 2 Deleting metadata information from the repository To delete metadata information do the following l In the Repository panel click on the node that you want to delete The node w
96. de from the Repository panel into the Browse panel e g the node Actors Note You can overwrite an exiting node by dragging a node from the Repository panel onto a node of the Browse panel e g by dragging actor X onto the node actor Y In this case the following warning will appear i Confirm Replacement E You are replacing an existing node This action cannot be undone all information in the existing node will be lost Do you want to continue Tes Hon Click Yes 1f you want to replace the node otherwise click No 50 1 5 The Content panel The Content panel displays the screens see section 1 5 1 and the schemata and fields see section 1 5 2 which are used for entering metadata information And it allows you to specify links to media written resources lexicon info and metadata files see section 1 5 3 After you have entered metadata information you can choose to view this information either in the IMDI Editor format the default view or as an HTML document Click on the buttons at the top of the Content panel to switch between the two views IMDI Editor view HTML view 1 Session Name TreesSM Session litle The uses of trees Recording Date 2001 06 24 Descriptions Location Keys Descriptions Language Engish Ie Description of the uses of trees We looked al trees in four difterent ees all in the vicinity of Kwande beyond the
97. definition Special Profiles are now supported which allow projects or institutions to create their own special extensions to IMDI in addition to what is defined in the core set Profiles are possible since the IMDI set from the beginning was designed as a set that is extendible For example the Dutch Spoken Corpus project is using a number of additional elements taken from the TEI that should be searchable For this extension a special profile was created In general we can state that there is no other metadata infrastructure world wide that has such a level of support by a complete set of tools The IMDI infrastructure is now even more interesting also for industry since more and more management tools are being built around it All tools are offered under Open Source licence and can be downloaded from the official IMDI site including manuals For some of the tools the manuals have to be finished Editor The most important tool is the editor that allows people to enter correct metadata descriptions The new editor is a professional tool and certainly one of the best available world wide The functions are described in detail in the manual that is available via the IMDI web site see appendix A The IMDI editor was completely revised to make it even more user friendly and more efficient The editor supports a conversion from the old IMDI version to the new one i e a user will be notified and asked whether a new version should be generated At this
98. ditor allows you to create such controlled vocabularies 1 e to define a metadata catgeory and its values For example in the illustration below the category MPI Languages 1s specified and its values are defined as consisting of the names of all those languages that researchers at the MPI for Psycholinguistics in Nijmegen work on 1 e a subset of the world s languages Controlled ocabulary _ _ _ mi x File metadata category Name MPl Lanquages Language English v TaN 0 7 Title URL http Avwww mpi nl lMDI Schema MPI Languages xml Description Lists of languages and language abbreviations used atthe Max Planck Institute for Psycholinguistics This appendix lists some ofthe language names and identifiers as they are used in the Ethnologue Please use these standardized Value jeeordian Ghodoberi Goemai Guguyimidjir KK Guugu Yimithirr Gunwinggu Hausa Hebrew values Hindi jHinukh GIN Hocak TWIN iHunzib HUZ alia talian ur Indonesian Indonesian MZ Click on a column header to S In other IMDI tools e g 1n the IMDI Metadata Editor such values are then displayed in form of pull down menus For example in the illustration below all values of the metadata category MPI Languages are displayed E IMDI Metadata Editor Bl x File View Help i 4 New Se
99. dow does not appear in the Anonyms input dialog window This column is automatically filled 1n by the program 31 e To edit a mapping click on it to highlight it Then click on Edit The Anonyms input dialog window appears and displays the highlighted mapping Type in your changes Ef Anonyms input Click on OK to save the changes Institute Project Goemai type in the changes e To remove a mapping click on it to highlight it Then click on Remove The mapping 1s removed without further warning 3 Click on Close to exit the Anonyms overview dialog window All mappings are saved in the file user name ano in the folder user directory IMDI TOOLS Anonyms e g File Edit View Tools Help ay erm birghe ano Notepad olx E Anonyms H TEE t Ele Edit Search Help gt of Email UserId birghe ProjectId test InstituteId MPI lt Anonym InstituteId MPI ProjectId Goemai UserId birghe Code Code X Name Birgit Hellwig gt lt Anonym InstituteId MPI ProjectId Goemai UserId birghe gt gt lt Anony e UserId birghe birghe ano file lt AnonymList gt Contents DAWINPKanoenunList Fullname D WINNT Profiles birghe 3 IMDI TOOLS a SCHEMACACHE Gi Search gt 1 objectis 406 bytes Disk free space If you want other people to have access to these mappings send them a copy of t
100. dow Options Help General Project Collector Content Participant Enter the chosen code into the field Full Name Ef Participant Participant instead of the full name Summary Participant Information Note The code entered m i into the field Full Name Participant Information Descriptions Languages Keys Type should not be the same as ns EZ enter the the code entered into the O pare code field Code 2 Clear Participant S Open Participant Save Participant The code that you have entered into the field Full miono PAET ES File Edit Search Help Name is saved in the NalosSin n Mama IMDI file as follows Code J Code lt Anonymous gt true lt Anonym the code is saved in Institute Project User Code the IMDI file The IMDI Editor accesses the mappings that you have defined in the Anonyms overview dialog window see section 1 2 3 2 and displays the full name Ef Anonyms overview All people who Institute Project User Code Name N have access to the MPI Goemai birghe Code_J Simon Miller file that specifies these mappings will be able to see the IMDI Editor the full name all fal xi accesses the others will see the mappings between code f iWm w full names and codes and displays Participants Participants Name Simon the full names Note Prev
101. down menu or type in an alternative value For example value comments Father Mother Sibling Boss Partner Student Teacher Shaman Priest Mayor Doctor Ethnic Group The ethnic group of the actor Age The age of the actor Please enter the age in the following format YY or YY MM or YY MM DD Note If the exact age is not known it is nevertheless useful to enter an approximate age This will allow you later to conduct searches on all actors who are in the age range between e g 20 and 30 years of age Sex The sex of the actor It has to be chosen from the pull down menu Education Type in the education or literacy level of the actor For example primary school 29 66 secondary school literate illiterate etc There are no constraints on this field 81 Anonymized Specifies whether or not the actor s name was made anonymous Choose true if you have entered a code in the field Full Name Choose false if you have entered the full name in the field Full name Note If you have chosen the option true you can create a conversion file that maps the code onto the full name Make use of the menu item Options Anonyms see section 1 2 3 2 If you have created such a file the actor information 1s entered saved and displayed as follows l Ef xy imdi IMDI BCEditor v1 5 Schema vIMOL1 e File View Win
102. e Name the Address the E mail address and the Organisation he she belongs to Descriptions A description of the scope and goals of the project See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description 1s written not to the language under investigation 66 2 3 Content This screen contains information about the content of the session All the information that is entered into the Content screen will be displayed in the IMDI Browser in the following way see the IMDI Browser manual for details s IMDI BCBrowser for MPI on M03402985 File Options Search Help Bookmarks 2 Description Worldmap HTTP global global global J describes the various trees in the vicinity He names the trees explains their uses and information under Descriptions is displayed under the Content node itself MPI Corpora HTTP CL Metadata Descriptions Tree Q Publications pdf e Natural conversation 9 descriptive Q TreesSM Q Metadata Location Subgenre Description Task 2 TRSA eech pointin Social Contextprivate Event Structure monologue y information under Content eo mure Type is displayed under the nodes of each category e g Task Modalities etc Modalities
103. e The type of the media file Choose a value from the pull down menu or type in an alternative value value comments audio video image document drawing text Format The format of the media file Choose a value from the pull down menu or type in an alternative value value comments video mpeg1 video mpeg2 video mpeg4 video quicktime audio wav audio aiff image jpeg text pdf text html 94 e Quality The quality of the recording Choose an option from the pull down menu 1 stands for low and 5 for high quality e Recording Conditions The technical conditions under which the media file was recorded e g the equipment used in the recording e g microphone type amplifier type mono stereo recording etc There are no constraints on this field 2 5 2 2 Time Position The start end position of the session on the media file Please enter the start end position in the following format hh mm ss f 1 e hours minutes seconds frames 2 5 2 9 Access Information about the access rights to the media file See Appendix 1 Recurring schemata Access for instructions on how to fill in an Access schema 2 5 2 4 Keys Keywords that are relevant to the media file See Appendix 1 Recurring Schemata Keys for instructions on how to fill in a Keys schema 2 5 2 5 Descriptions A description of the media file See Appendix 1 Recurring Schema
104. e possible for the user to select a different language version e g German That is all vocabularies will then be displayed in the appropriate language 2 Language Select the language of the controlled vocabulary from the pull down menu e g German in the illustration above Note that you can open an already existing controlled vocabulary file and create a different language version of it see above Do the following 185 of 196 Intera D2 2 1 1 Open the file that contains the underlying controlled vocabulary 1 e the so called canonical vocabulary e g the canonical Continents file ie Controlled vocabulary Editor EContinenEs zml E ioj xl Filg o N Hame Continents canonical or Language English i TAG M dec predefined ee URNS http mpi nliMDlSchemancontinents xml controlled Descriptiom as vocabulary in list of linguistic cantinern English m 2 Go to Language and select the appropriate language from the pull down menu e g German pos Controlled Yocabulary Editor Continents xml File Name Continents Langugfe English Tag J Title select a language from the pull down menu URL httg iiaa mpi nMOS chern atc d Description list af linguistic continents N 3 The following window appears x 9 Do you want to edit a new file Ye No Click on Yes to create a new file for the new language
105. e the needed resources Figure 4 gives a screenshot of the Tree Builder File Help TE Gm N Geserzed Corpas 127 5 Gewerated Corpas S Generated Corpus 13 Generated Corpus 16 0 oU ER o Generated Corpus 9 E GewrzedCormss 177 9 Generated Corpus 17 a 9 Generated Corpus 15 m zu eredi Fig u re 4 shows a Generated Corpus 20 MEN Ge reratd cope 15 uunc j B Sere aea cope m screenshot of the Tree Generated Corpus 8 E cererseacome 8 GewrsEdCornss 15 8 Generated Corpus 15 Gereraea Corpus 2n j TE oenerateg comus 15 T Mioma Builder that can be used to Generated Corpus 8 ae create new virtual 2 Generated Corpus 15 E Gexerzea Corpis amp 8 Generated Corpus 20 O collections from the whole E 9 Generated Corpus 17 IM IMDI domain by simple graphical means This Eisteu allows users to easily A T create temporary working Generated Corpus 0 A generated corpus with sequence number D wied Feb 12 2003 Generated Corpus 1 A generated corpus with sequence number 1 ved Feb 12 2003 2 Spaces which leads to Generated Corpus 10 A generated corpus with sequence number 10 wed Feb 12 2003 7 i Generated Corpus 11 A generated corpus with sequence number 11 wed Feb 12 2003 great efficiency Generated Corpus 12 A generated corpus with sequence number 12 wed Feb 12 2003 Generated Corpus 13 A generated corpus with sequence number 13 wied Feb 12 2003 Generated Corpus 14 A generated corpus with sequence
106. e Language Target Language Resource References Descriptions Descriptions Language Emil e Cv nglish Text Goemaiis a Chadic language Vest Chadic A ngas Goemai Southern Branch that is spoken by 150 000 to 200 000 speakers in Plateau State Central Migeria e Name The name of the language This name is standardized Choose it from the pull down menu or type it in Please use capital letters e ID The identifier of the language based on the Ethnologue identifiers see http www ethnoloque com web as The IMDI Editor automatically enters the correct identifier once you have filled in the Name field above e Mother Tongue Specifies whether or not this language is the mother tongue of the actor From the pull down menu select either true if it is the mother tongue or false if it is not e Primary Language Specifies whether or not this language is the language that the speaker is most fluent in From the pull down menu select either true if it is the most fluent language or false 1f it 1s not 87 e Dominant Language Please ignore this field It has been disabled for Actor Language and is only available for Content Language see section 2 3 3 1 e Source Language Please ignore this field It has been disabled for Actor Language and is only available for Content Language see section 2 3 3 1 e Target Language Please igno
107. e created check the creator s only box You can select how you want the list to be ordered by group name creator first name creator last name or path from the pull down menu Next click the Del button in front of the group you want to remove You can only remove groups that you have created 5 List access rights This feature will be added soon 177 of 196 Intera D2 2 1 Appendix D IMDI CV Editor version 1 0 Manual This manual was last updated 14 Jul 2003 The latest version of the manual can be downloaded from the following webpage www mpi nl tools Author Birgit Hellwig Introduction The IMDI CV Editor SLE Metadata Initiative Controlled Vocabulary Editor was developed at the Max Planck Institute for Psycholinguistics Nijmegen The Netherlands The CV Editor is used to create lists of controlled vocabulary items As such it complements the other IMDI tools notably the IMDI Editor also known as the IMDI Metadata Editor At the moment the following IMDI tools exist tools and manuals can be downloaded tom WWW mpi nl IMDI tools e IMDI Metadata Editor a tool for creating metadata descriptions e MDI Browser a tool for displaying and accessing data and metadata e MDI Metadata Search Tool a tool for searching through metadata information This manual explains and exemplifies the features of the IMDI CV Editor It 1s organized around the following three chapters 4 Basic inform
108. e deselected files will not be saved Click on Cancel to return to the Editor window 1 2 2 View menu The View menu displays information about the currently active IMDI file 1 e the file that is highlighted in the Browse panel see section 1 3 It contains the following options Fe IMDI Metadata Edits ml x File View Options Help Information At File Prefixes Validate File 2 1 2 2 1 Information Click on Information to view information about the currently active IMDI file The following kind of information 1s displayed Sj Ntiimdi information about properties of the Pa l IMDI file such as D WINNTiPersonalSoftware ManualsilM whether the file 1s valid with respect to the XML location size last Information Prefixes Validation Schena modified IMDI File Size 12 kB standard etc ast Modified Dec 05 2002 09 17 Creation Date Nov08 2002 Originator MDI 1 8 the prefixes that are used in the SESSION IMDI file see section 1 2 3 4 ju 2 1 2 2 2 File Prefixes Click on File Prefixes to view the prefixes that are used in the IMDI file see section 1 2 3 4 1 2 2 3 Validate File Click on Validate File to view information about whether the file is valid with respect to the XML schema 1 2 3 Options menu The Options menu allows you to set a number of preferences It co
109. eam will then segment the DMF further according to these specifications and create the appropriate media file 2 5 4 3 Resource References The IMDI Editor automatically assigns a unique resource identifier to each media file and written resource see section 2 5 The sub screen Resource References lists all available resource identifiers it displays the identifier together with the type of resource e g N10155 Media File in the screenshot below and it displays an empty box to the left Ej IMDI Metadata Editor all xd File View Options Help B a e 4 am Editor 9 HTML 3 Links ES E Vv N10155 Media File v N10198 Media File C N101DD Written Resource The sub screen Resource References now allows you to link the source to a specific media file or written resource E g a source may only be connected to the media files e g N10155 Media File and N10198 Media File but not to a written resource To link a resource click into the empty box to its left A checkmark will appear there 2 5 4 4 Access Information about the access rights to the source See Appendix 1 Recurring schemata Access for instructions on how to fill in an Access schema 2 5 4 5 Keys Keywords that are relevant to the source See Appendix 1 Recurring Schemata Keys for instructions on how to fill in a Keys schema 2 5 4 6 Descriptions A description of the source S
110. earch window the data file associated with the selected corpus node 1s automatically entered into the box e g Lescription This i amp fe LAC Language and Cognition corpus which contains sportarseous ecd ard experiment uade d EET kamar Ross roma iHTTE intrans a Fe ora IMDI Metadata Search Jof x 5 5 J epee a Language 9 HCH tirara 1 index iml a m Ex 0 2E file containing OO LAC pint woraataimatad bsp metadata information corpus Su Pees of the LAC corpus Search E Browse Close Search Stop Save results in resutts 159 Note The name and location of a data file can be seen in the IMDI Browser In the Metadata Descriptions Tree panel double click on a corpus node to open it click on it to select it and then click with the right mouse button to open a pull down menu From the pull down menu select Show Services The name and location of the corresponding data file is displayed in the Info Content panel see section 6 3 1 Note Corpora can only be searched if there 1s a data file available Such corpora are visualized in the Metadata Descriptions Tree panel by means of a white C on a black background Corpora that have no corresponding data file are visualized by means of a white C on a blue background If you click on a non searchable subcorpus that is part of a
111. eck the Creator s only box You can also select how you want the list to be ordered by manager first name manager last name manager login name creator first name creator last name or path from the pull down menu Next click the Del button in front of the manager you want to remove You can only remove an access rights manager that you have created 3 Adding removing and modifying groups 3 1 Add a new group From the main index window click the add a new group link A new page 1s opened with a form that allows you to create a new group and add users to that group Only the creator of a group can modify that group 174 To create a new group do the following E Browsable Corpus Access Rights Management Microsoft Internet Explorer provided by MPI Nijmegen Mam Add the name for the new group File Edit View Favorites Tools Help Bak gt Q A Qsearch Favorites dUMeda 55 GSI EV here Address 4 amp http lux02 mpi nlicgi bin arm arm add group pl m Go Access Rights Management Select the users you want to add to this new group Hold down the ctrl noc pc c 1lsilA key to select more than one user or hold down the shift key to select a block of users Add a new group Please enter the following fields Select one or more users to If yo e Shit Select an agreement that the user pn E has to accept The agreement is sent to the user by email This fea
112. ed by S EBF tomus you or by other archiving initiatives see section 6 1 1 corpora http veww mpi nl warld carpo CGN corpus IFA corpus Basket functions Currently the following corpora are a housed at the MPI MPI corpora DoBeS archive ESF corpus CGN IFA corpus Corpora collected by researchers affiliated with the Max Planck Institute for Psycholinguistics Nijmegen The Netherlands Corpora collected by researchers affiliated with the DoBeS Dokumentation Bedrohter Sprachen project funded by the Volkswagen foundation European Science Foundation Second Language Acquisition Database Corpus Gesproken Nederlands Corpora collected by researchers affiliated with the Instituut voor Fonetische Wetenschappen Institute of Phonetic Sciences Amsterdam The Netherlands Note Depending on the version of the IMDI Browser it may not always be possible to access all corpora If you access the Corpus Gesproken Nederlands via the IMDI Browser content searches are not possible To conduct content searches in the CGN corpus you need the corresponding CDs see the separate manual Corpus Gesproken Nederlands COREX for details 130 Each corpus contains further subcorpora collected by individual researchers or project teams The internal structure of each subcorpus varies according to the purposes and needs of the project You can access information about their conte
113. ed from the following webpage http www mpi nl TIMDI tools Author Birgit Hellwig Updates Dieter Van Uytvanck Introduction The IMDI ISLE Metadata Initiative Browser was developed at the Max Planck Institute for Psycholinguistics Niymegen The Netherlands It is used to manage metadata and data files As such it complements the other IMDI tools notably the IMDI Editor At the moment the following IMDI tools exist tools and manuals can be downloaded tom http www mpi nl IMDI tools e IMDI Metadata Editor a tool for creating metadata descriptions both for so called sessions or resource bundles and for subcorpus nodes e IMDI CV Editor a tool for creating lists of controlled vocabulary items e MDI Metadata Search Tool a tool for searching through metadata information e IMDI Tree Builder a tool for creating browsable trees based on IMDI metadata descriptions beta version The IMDI Browser supports the following features e displaying and accessing data and metadata through a hierarchical tree structure and a world map e direct access to media annotation and metadata files e searching of metadata files by means of the IMDI Metadata Search Tool This manual explains and exemplifies the features of the IMDI Browser It is organized around the following three chapters 1 Basic information about the IMDI Browser section 5 2 Displaying and accessing data section 6 3 Searching data IMDI Metadat
114. ee Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 104 2 6 References This schema contains cross references to other sessions fieldnotes or publications that are relevant to the content of the session All the information that 1s entered into the References screen will be displayed in the IMDI Browser in the following way see the IMDI Browser manual for details Es IMDI BCBrowser for MPI on M03402985 File Options Search Help Bookmarks Description Worldmap HTTP Worldmap IMDI corpora IMDI corpora HTTP MPI corpora MPI Corpora HTTP infoiContent References The sessions TreesLL and TreesAS have a sim ilar content The fieldnotes 25 6 2001 contain drawings the location of the various trees and the location of the speaker a nd the camera The idea for collecting this type of data is bas ed on an original idea by David Wilkins who collected video m aterial about a grub species and the trees it affects for the an alysis of demonstratives Metadata Descriptions Tree TreesSM Metadata i transcription paf e m TreesSM_1 mpg m TreesSM_2 mpg m TreesSM_4 mpg Es IMDI Metadata Editor m TreesSM 5 mpg File View Options Help
115. efault bookmarks give access to the corpora housed at the Max Planck Institute for Psycholinguistics World Map HTTP World Map IMDI Corpora IMDI Corpora HTTP MPI Corpora MPI Corpora HTTP CGN Corpus mpi local CGN Corpus HTTP Search Results Access via the web server to a world map that displays the locations of documented languages Access via the local MPI network to a world map that displays the locations of documented languages Access via the local MPI network to the MPI DoBeS ESF CGN and IFA corpora Access via the web server to the MPI DoBeS ESF CGN and IFA corpora Access via the local MPI network to the MPI corpora Access via the web server to the MPI corpora Access via the local MPI network to the CGN corpus Access via the web server to the CGN corpus Access to your search results Whenever you double click on a bookmark the corresponding node 1s displayed in the Metadata Descriptions Tree panel see section 6 3 In addition to the predefined bookmarks you can create your own bookmarks Do the following 1 In the Metadata Descriptions Tree panel see section 6 3 navigate to the corpus or session node for which you want to create a bookmark 2 Click on the node to select it It will be highlighted in blue color Click with the right mouse button on the node to open a pull down menu 4 Select Add to Bookmarks from the pull down menu The Input dialog
116. ere defined in the original IMD file E LJ Anonymous files Cum O Always use the prefixes defined in this panel dir e ct Ory eases Use Prefixes defined in the IMDI file itself Use Prefixes defined in an external configuration file information of files Global DAWINNTPersonaliMDI Session Files Metadata files Corpus Files Metadata files Info Files Info files Media Files Media files Annotation Units Annotation files Anonymous Info A nonymous filest 115 2 Use Prefixes defined in an external configuration file Choose this option if the prefixes are defined in an external configuration file This option is more flexible than option 1 above because the prefixes are not saved in the IMDI file itself I e if you decide to move files to a different directory at a later stage you need to change the prefixes only once in the external configuration file not in each IMDI file It is therefore recommended that you make use of this option whenever you manage a large corpus If you choose this option you are prompted to enter the name and directory of the configuration file It should be in the same folder as your IMDI Session files fy Preferences Ej E vim this nrafarrencee panel usar gs administrab r can d ine the pretis that AR mauld Exi bed amd ato whin mir Should be used x cea
117. escribed in chapter 1 and the metadata domain where we indicate the state in chapter 2 The prototype is running for some months already including all tools All major tools are downloadable and can be used by others Currently some optimizations and debugging is carried out This will lead to new releases that will be deployed from time to time The growing metadata domain can be browsed and searched by going to the following web site http www mpi nl corpus At LREC 2004 a demonstration was given that covers already many of the resources that are mentioned in the matrix and that have to be integrated in the final deliverable The intention was to create a sufficiently large metadata domain that makes it interesting for others to use it and hopefully to join This would be in the line of the ENABLER declaration ENABLER meeting in PARIS which stated clearly that e most language resources are still not visible on the Internet e even less language resources are accessible via the Internet e for all initiatives that are funded by governmental money it should be obligatory to create open metadata descriptions and to make the resources accessible via the Internet The IMDI infrastructure as further developed in INTERA allows people to create metadata in an easy way to have a low threshold for participation The list given on the previous page shows that there is an increasing number of institutions that understand the relevance of metadata As
118. ese files contain the audio or video recordings They are usually digitized in one of the following formats MPEG mpg Cinepak Quicktime Movies mov WAVE wav 3 Annotation files see section 6 3 4 2 These files contain the transcripts codings and annotations Their format varies e g ELAN Media Tagger Shoebox CHAT etc 4 Info files see section 6 3 4 3 These files contain further background information on specific topics They are in PDF or HTML formats With a few exceptions metadata and info files are publicly accessible This policy enables researchers to search metadata and to thereby gain an overview of the available material Annotation and media files are handled differently in the different corpora some corpora allow for general access e g the CGN corpus while others do not e g the MPI corpora In the latter case if you have not been explicitly granted access by the 132 responsible researcher s you will not be able to access the files To gain access ask the responsible researcher s for permission Contact details are displayed in each corpus e g Meta Descriptions Tree TreessM 1 eaf Type a ap Access wldaee Lo 2 TresssM eaf mE Publisher Contact for Access me Birgit Hellwig information dress Postbus 310 6500 AH Nijmege ail Birgit Hellwigigrgpl nl amp annon into about whom to anisatian Max Planck Institut f r Psy S 4 Source LB f
119. ession All the information that is entered into the Actors screen will be displayed in the IMDI Browser in the following way see the IMDI Browser manual for details E IMDI BCBrowser for MPI on M03402985 File Options Search Help Bookmarks Description a J was the only participantin the session The collector does not appear in the recording And IMDI corpora here were no other observers present IMDI corpora HTTP MPI corpora MPI Corpora HTTP Metadata Descriptions Tree Info Content lt gt amp E IMDI Metadata Editor 9 M Goemai File View Options Help 8 Helwig en wew e d i Project paf fi a o i language pdf B Dihlicatianc na 4 gt D Editor HTML 3 Links E ia x information about each Actors Actor is displayed under the TOU Full Name Role ec AY Birgit Hellwig Birgit Hellwig Collector Add node Actors or i Jan Smit Consultant Participants in the older I NBirgit Hellwig Birgit Hellwig Annotator version see section 2 4 1 consultant J T Actors Description Participants cv nglish Root URL Intt Jwww mpi nliworldicorporallMDl metadataill 1 M i Tg See section 2 4 2 tsion The collector Basket functions d there were no other observers present 78 2 4 1 Actor Each actor involved in the session has to be listed on the Actors screen
120. ession Name standard Session Res Session Title Descriptions Language Text the old repository 1s automatically converted and added to the node Local Old IMDI Version 110 3 The IMDI Lexicon Editor The IMDI Lexicon Editor creates IMDI Lexicon files imdi that describe lexicon resources Its structure 1s almost identical to the IMDI Session Editor 1 e it displays the screens Session see section 2 1 Project see section 2 2 Content see section 2 3 Actors see section 2 4 and References see section 2 6 The only difference concerns the screen Resources the IMDI Lexicon Editor does not display Anonymous Media Files Written Resources and Sources instead it displays Lexicon Resource This section of the manual describes only the screen Lexicon Resource For the other screens please consult section 2 3 1 Lexicon Resource The lexicon resource associated with the session is listed under Lexicon Resource which allows you to give general information about the resource about its lexical entries about the metalanguage used and about access rights and restrictions to the resource You can furthermore give a prose description and add keywords E IMDI Metadata Editor l Ioj x File View Options Help A an 9 Standard Lexicon Resource Bu Project Content Actors z L Lexicon Resource 1 References E Editor B HTML 3i L
121. ether or not you have made actors names anonymous in the written resource Choose true if you have used a code Choose false if you have used the real name 98 2 5 3 2 Information e Type and Subtype The type and subtype of the written resource whereby Subtype is dependent on Type I e the value that you enter under Type will determine the values that are available under Subtype Both Type and Subtype are open vocabularies i e you do not have to select a value from the pull down menu but can enter an alternative value instead The following values have been suggested Examples Primary Text any material that is the object of study Documentary Fiction Annotation an annotation e g transcription or grammatical analysis of the material under study gesture orthography phonetic phonology morphology morphosyntax syntax semantics pragmatics typology Lexical analysis a lexical analysis of the material under study dictionary terminology wordlist lexicon Ethnography an ethnographic analysis of the material under study Study the written resource is used for a specific subfield of linguistic science Please see the document IMDI Metadata Elements for Session Descriptions under htt WWW mM l nl IM DI for updated lists of values and their definitions 99 Note If there are several Subtypes you ca
122. first Ras fe to be s sure that you are at the right node in the corpus tree The node has to be within your domain of responsibility You can inform y ourself about the nodes that are children of the z cted ue and about all resources of a certain type would be affected by your operation You can also select E your ae should be e exec cuted recl i only at the le of the selected node In the ssociated with the selected node will be affected In fhe firs st case tH SUBARU Y SEIS be exec cuted for FP resources of the selected type that are in the sub tree spanned by the node After submit the system will carry out some checks with all information To finally activate the input you have to confirm this Current point in the tree which indicates the resources that will FF path be affected Take care that this is what you want and node Selected group lt No group s To allow or deny read access to a specific medium check the box next to it To deny according box all video pat audio INN Done Internet 7 l D L all annotations all information files Showmembers members Show resources Show resources amp Done E o g Internet Z Figure 10 shows a screenshot of the define access rights option of the new access rights management system It is linked with the current node in the metadata hierarchy and allows users to select options At this moment only read access can be
123. football field The four media fes correspond to these four shes The recording took place in earty evening Because of the twilight the video recordings are not too chear 5 1 Screens EU f iBmixj Click here to switch between the two views en n o Date 2001 05 75 Tale The uses of trees Country Nigena Qua an Pan Local Government Ares Kwande beyond the football field ev Address Description Laegoage 5OG39 eng Description of fe uses of trees We looked M frees m four different sites a in the vicinity of Kwande beyond the football field The four media fles correspond to these four sites The recording took place in eanry evening Because of te twiligh the vdeo recordings are not too clear Description Laegoage Goewa J K wal a ka feng Kemkvern goepe m ep dyan ioe Kus noe Moekwo Chack TreesSM J mpg is lost The Content panel displays a number of screens The following main screens are available IMDI Session Editor see section 2 and IMDI Lexicon Editor see section 3 Session Project Content Actors Resources References general information about the session see section 2 1 information about the project within which the session data was collected see section 2 2 information about the content of the session see section 2 3 information about the actors participating in the session see section 2 4 information about written resource fi
124. formation Type consu Meta Descriptions Code SM Role eo FullName Type consultant boni Sex Male Education secondary school us true Anonymo Education secondary school Code SM Metadata information that is entered M directly into an IMDI file is displayed in the Info Content and Description panels of the IMDI Browser s Language amp Cognition RF C1 766 x sil eng KlhnicGroup Goemai i Publications paf Natural conversation 8 descriptive Q TreesSM Q Metadata Location 4 keys 4 Project Collector e Content Add Lik BheakerSM pdf 3 Remove speakersM na I pdf e Links are Kor Tah displaye d as Participant Open Participant Save Participant additional nodes in Een SeA the IMDI Browser and can be accessed through double clicking on them Note for researchers working at the MPI for Psycholinguistics Nijmegen Please specify only the name of such a file do not make use of the open folder icon to the right of the Link field 57 1 6 Changes to the user interface version 2 0 The following changes to the user interface of the IMDI Editor were implemented in version 2 0 Version 2 0 Functionality corresponding item in Version 1 5 and earlier Browse displays all currently open files Windows menu item panel
125. ftranc fhw IY Wile lanked at l Worldmap global hs E IMDI Metadata Editor Mi x IMDI corpora global IMDI corpora HTTP global n File View Options He MPI corpora global file MPI Corpora HTTP global httk S PA l ection 5 2 mb section Metadata Descriptions Tree section I NEN 9 Mj descriptive 2 5 l 9 Ej TreesSM Metadata amp m TreesSM_1 mpg e m TreesSM_2 mpg source Link F m TreesSM 4 mpag m TreesSM_5 mpg m TreesSM_1 way m TreesSM_2 wav gt Meet A wav TreessM 1 eat T TreesSM 1d information on the screen T TreesSM_2 t xx T Teese 4t Resources is displayed under the Anonimo resource nodes Source LBHGVD1 5Jun01 2 DV References j Root URL http iwww mpi nliworidicorpora IMDIimetadata IMDI imdi Basket functions 91 The IMDI Editor automatically assigns a unique resource identifier to each media file and written resource The resource identifiers are then used on the sub screen Resource References to link a source see section 2 5 4 3 an actor see section 2 4 1 3 or a language see section 2 3 3 1 to a specific resource E g it may be the case that an actor only plays a role in one of the resources but not in the others in this case you can use the resource identifiers to link the actor to this one resource but not to the others Click on the b
126. functionality is described in detail in the manual of the browser To support search the browser supports creating a DB file for all IMDI files harvested These are then queried to find hits A user can use this facility to search through his private local corpora Language archives have the possibility the possibility to make the created DB files available for remote querying by running a DB daemon by users through the same search interface Again we just want to restrict ourselves in this document to give a screenshot of the component see figure 6 E IMDI BCBrowser for MPI on delta File Options Search Help O NT eo Bookmarks z Design UML Ref e Tem p IMDI Metadata Search File x session Location x continent z ca vw Worldmap HTTP Worldmap IMDI corpora IMDI corpora HTTP CGN Corpus HTTP Search Results Matna Descriptions Tre hits 401 progress Finished in 79484 ms mt MPl imdi file J SUNOS3 MediaArchivellac data hellwig Natural Conversation Metadata Aram imdi 9 MPI corpora file JSUNO3 MediaArchive lac_data hellwig Natural Conversation Metadata Jos imdi i HTMLcorpusCo file GUNO3 MediaArchivellac data hellwig Natural Conversation Metadata Dialect1 imdi Acquisition file SUNO3 MediaArchive lac_data hellwig Natural Conversation Metadata Dialect2 imdi Language amp Co fileJ SUNO3 MediaArchivellac dataihellwig Natural Conversation Metadata
127. g the path D WINNT Personal IMDI in the illustration below e Session Files the location of your IMDI Session files relative to Global e g the folder Metadata files V in the illustration below e Corpus Files the location of your IMDI Corpus files relative to Global e Info Files the location of your info files relative to Global e Media Files the location of your media files relative to Global e Written Resources the location of your written resource files relative to G obal e Anonymous Info the location of your file that specifies the mapping of codes onto full names relative to Global Note Please add a backslash after each folder Note This option 1s especially useful whenever files of the same type are stored in the same directory because then you only need to specify the directory information once in the Prefixes dialog window fX Exploring D AWINNTYXPersonalMMDI File Edit View Tools Help tu im P F ar lial m la es 5 41 Ei Preferences xc Niel With this preferrence panel a user or administrator can define the prefixes that should be used and also fen when they should be used Consoild Vicabulsdes Prefixes can be defined in the IMDI file itself or in an external configuration file that should be situated in the same folder as the IMDI file Annotation files ig Use these prefixes only when none w
128. g different policies Apo not Update Cache automatically Check for an update once during use of the program heck for an update every time a Resource is requested heck for an update once every Monti I 7 Check for any update when Starting the program M Update Now click here to The schema cache conta configuration files will be updated once every sessi update now Cancel Note The IMDI tools make use of so called controlled vocabularies 1 e lists of values for a specific key to ensure consistency across users These 136 vocabularies are stored on the Web and a copy is stored in the cache of your desktop computer in the folder user directory IMDI TOOLS CVCACHF The Web version is changed sometimes e g new values are added However the IMDI tools will continue to use the locally stored version until the cache 1s updated again By default it is updated whenever you run the IMDI Browser However you can change this policy and determine if and when it should be updated Font This option allows you to display information from metadata files that contains special character sets To select a set and a font do the following 4 Under Unicode Block in the left panel select a unicode block by clicking on it The available fonts for that block are displayed under Font 1n the right panel 5 Under Font select a font by clicking on it 6 Click on OK to implement the new font in the Brow
129. ght click on the corresponding node and select Show 7 The image will be displayed in the right window titled Info Content Fa IMDI BCBrowser for MPI on m03403023 File Options Search Help Bookmarks Worldmap HTTP IMDI corpora HTTP MPI corpora MPI Corpora HTTP Metadata Descriptions Tree DoBeS archive http Q Helsinki e Phonogrammarchiv Wien 9 8 Swedia2000 G taland Svealand 9 N Finland 9 Nyland 9 8 Borg Q borga_om_1 Metadata info_trans html m borga om 1 pd wav borga om 1ajpeg T borga om 1 og gif T borga om 1 ph gif Anon Info References borga_ow_2 borga ym 1 borga yw 3 Root URL http veww mpi nl world corpora lMDl me Basket functions Description Show format Save File Content Clone Node Show Content Type Show Description Show URL Jag kan ber tta om n r jag var byggnadsjobbare och byggde en bat h r t en granne till mig S kommer det en ldre karl in och s sade han s h r Baten r nog annars bra sade han men lite mera skuvagat f rst v skulle jag vilja ha sade han Jag talade ingenting s l nge han var d r men s n r han gick d rifr n s sade jag 3t han som jag byggde b ten t sade jag Vad menar han med skuvagat f rst v Ja sade han jag vet inte sade han men jag skulle tro att den skulle vara mera sned mera lutande Det h r har jag ber ttat f
130. given We are working on a semi automatic ingest system that will also allow users to define write rights however for an archive this leads to complicated issues such as versioning since nothing may be deleted 12 2 Metadata Integration In an earlier note we have described that it was much more difficult to convince data providers to participate in the metadata creation process and that at the beginning too little money was offered by us Things have changed and first metadata is available The following matrix gives information about the state GTPR corpus MPI MPI ESF Second Learner speech text integrated Corpus providers providers According to this matrix the first metadata descriptions were delivered and a number have already been integrated into the domain see figure 11 s IMDI BCBrowser for MPI on m03403068 ES 58 xl File Options Search Help Bookmarks Description Worldmap HTTP global Worldmap global IMDI corpora global IMDI corpora HTTP global Z MPI corpora global Info Content MPI Corpora HTTP global CGN Cornus mni local ilahal T Metadata Descriptions Tree i IMDI imdi IM Q IMDI corpora i IMDI Info MPI corpora T M DoBeS archive gt ESF corpus ec s Home No Descriptions C IFA corpus last update 5 Api 2DD4 IN GTRP Corpus I Bavarian Archive for Speech Sign
131. he Content panel displays the additional screen 40 You can insert the new sub node anywhere under an appropriate top node To select its position do one of the following e Add the sub node see above and click on it in the Browse panel keep the mouse button Ki Jade bes clicked and drag it to its new position A line A will appear at this position Release the mouse Y ME oui button and the sub node will be inserted in its new position e g e ai e Alternatively do the following E IMDI Metadata Editor Dl x File View Options Help 1 In the Content panel click on the sub node before which you B ap o Standard Session Resource CAN Project f D exter want to insert the new sub node It comm actors is highlighted in blue color Birgit Hellwig Daan Broeder bum a Onse Le did enga Freddy Freddy Offenga Unspecified 2 don References i IMDI Metadata Editor mix Add File View Options Help u N a o Standard Session Resource 4 escription p Project f nknown Content angua M A Actor j WEP EP AAA Birgit Hellwig Actor 4 paan broeser 3 The new sub node is Freddy Offenga added before the Resources References selected sub node Note The IMDI Editor makes use of different Add buttons for different purposes and not all of them add
132. he file user name ano They should then save this file in their folder user directory IMDI TOOLS Anonyms 1 2 3 3 Vocabularies The IMDI tools make use of so called controlled vocabularies to ensure consistency across users 1 e they provide lists of values for specific metadata categories or keys These vocabularies are stored on the Web and a copy of them is stored in the cache of your desktop computer in the folder user directory IMDI TOOLS CVCACHE see the IMDI CV Editor manual for details 32 You can view these vocabularies through the menu item Vocabularies Click on it to access the Vocabs overview window e g ES Vocabs overview Used vpedbularies until now Name Tag ent Subject CV URL Local Path Description http www mpi nl MDI Schema Content Subject xml name of the metadata name and URL path of PE language of category controlled vocabulary the and its tag if it is in a file descriptive metadata language other than DEM information category Pnad and an indication of nglish i about the and its whether or not the file is controlled vocabulary stored locally DescriptionLanguages vocabulary Interpreting Visibility Interpreting Visibility http www let kun nlfsign lanqIMDivocabs interpreting Visibility Visibility of the interpreter ISO639 eng xml Family Father Deafness Family Father Deafness http www let kun nl sign Iang IMDlIivocabs
133. he following 1 In the Metadata Descriptions Tree panel double click on the node to open it 2 Click on the open node to select it It will be highlighted in blue color 3 Do one of the following a Click the Remove button at the bottom of the Metadata Descriptions Tree panel b Or click with the right mouse button on the highlighted node then select Remove from basket from the pull down menu 4 Repeat this process to remove other nodes from your selection You can remove all selected nodes from this list by clicking the Clear button at the bottom of the Metadata Descriptions Tree panel You can save the selected list for future uses Click the Save button at the bottom of the Metadata Descriptions Tree panel The following message informs you that your list has been saved E Info message Ed a Your selection has been saved To remove it use Clear and Save Note Once you have saved a selected list you can only remove it by first clicking the Clear button to remove all selected nodes and then the Save button to save the empty list When you are satisfied with your selection you can initiate the search see section 7 6 3 4 Accessing data The IMDI Browser contains links to files or to information from files These links are symbolized through the following icons amp link to information from metadata file s 1 e information from IMDI files link to digitized media file s containing audio
134. he information into the IMDI Editor as follows IMDI Metadata Editor Bl x File View Options Help 5 a 9 short name for the project full title of the project D editor HTML E Links Project Name Goemai Title Documentation of Goemai project identifier if any Contact Informatiet Birgit Hellwig person institution responsible for the project Address Postbus 310 6500 AH Nijmegen The N E mail Birgit Hettwig mpi n Max Planck Institut f r Psycholinquistik Organissti Descriptions Language English cv M he Goemai project is part of a PhD project begun in description of August 1998 There are two aims to this project 1 o investigate the semantics and pragmatics of the proj ect locative verbs and derived elements Most ofthe 2 collected experimental data is concerned with this d 2 To documentthe Goemai language in fa s E Add 2 of a reference terarmmar a dictionanandan annotated A Link a Remove i Name A short name or abbreviation that uniquely identifies the project Title The full title of the project ID A unique identifier for the project if any e g IST 1999 10651 Contact Information Contact information about the person or institution responsible for the project th
135. i xi File E 3 The highlighted vocabulary will be Te displayed in the IMDI CV Editor URL http vwww mpi nl IMDI SchemaiContinents xml Description i of linguistic continents Tag Value Description ada Unspecited Unspecitsd Edi Africa Africa Asia Asia Remove Sav iiis EE ocal Dir to save the controlled Oceania Oceania North America North America he local directory The file is Middle America Middle America South America South America ser directory IMDI TOOLS Click on a column header to sort Save To Cache Click on Save To Cache to save the controlled vocabulary file to the cache The file is saved to the folder user directory IMDI TOOLS CVCACHE Www Tk Save As Click on Save As to save the controlled vocabulary file anywhere on the computer Note The files will be saved as XML files 183 of 196 Intera D2 2 1 Note If you are in doubt where the local directory or the cache is located on your computer point with the mouse to Save To Local Dir or Save To Cache The tool tip 1s activated and displays the corresponding location e g Controlled Vocabulary Editor Continents xml B x Ctrl N Ctrl O nents Language English bal i d Biss date C Save To Local Dir P o Cache Se er m point here 2 As C DWINNTProfilesibirghelMD TooleICVCACHENocal cuis Cti S aint 0 7 es tool tip displays
136. ick on the file to insert its name and directory location into the Link field 4 1 2 Prefixes You can enter the directory information by means of prefixes Do the following 1 In the IMDI Editor click on Options menu 2 Click on Preferences 3 In the Preferences dialog window click on Prefixes The Prefixes dialog window opens 4 In the Prefixes dialog window specify when the prefixes should be used Choose one of the following two options by clicking in the box next to this option 1 Use these prefixes only when none were defined in the original IMDI file I e if the directory in formation is already specified in the original IMDI file the prefixes defined in the Prefixes dialog window will not be used 2 Always use the prefixes defined in this panel I e the prefixes are always used even if a different directory is specified in the original IMDI file 114 5 In the Prefixes dialog window specify where you have defined the prefixes Choose one of the following two options by clicking in the box next to it 1 Use Prefixes defined in the IMDI file itself Choose this option if the prefixes are defined in the Prefixes dialog window These prefixes will then be saved within the IMDI file itself To define prefixes enter the directory information for each type of file e g e Global the directory that is shared by all files e
137. ield Name 3 Specify the directory location of the corresponding IMDI Session Corpus file in the field Corpus Link 4 3 Create a corpus node As the last step you need to create a corpus node in the IMDI Browser To create such a node do the following 1 Open the IMDI Browser 2 At the bottom of the Metadata Descriptions Tree panel enter the location of the IMDI Corpus file into the field labeled Root URL Press ENTER The corpus node together with its subcorpus nodes and sessions is displayed in the Metadata Descriptions Tree panel e g IMDI BCBrowser for MPI on m00402404 Browser Action Bookmarks options MPI corpora SMB World Map MPI Corpora HTTP intranet IMDI Corpora HTTP i Info Content DoBes Corpora MD Search CGN Corpus sessioni new corpus 1S session2 displayed here st dvi Ee i PWINNT Personal lMDUMetadata_files Natural imdi EE 5 enter the directory of the IMDI Corpus file press ENTER 119 3 To save the new corpus node permanently do the following l In the Metadata Descriptions Tree panel click on the corpus node to select it Right click on the selected corpus node A pull down menu appears Choose Add to Bookmarks from the pull down menu The Input dialog window appears Enter a name for the bookmark and click OR right click on the selected corpus node Bscriptions Tree
138. ies The selected categories and subcategories were then automatically added to the box Query specification and results subdivisions within the specified category X x Location b Country vY Netherlands Session Name Continent Nepal num Netherlands Region Netherlands Antilles 1 Pk ETT S e yay Query specification and New Zealand sell E Session MD Group Location Country Netherlands Nicaragua Niger Nigeria selected categories are added to this b0X z c E lt 2 The screenshot below illustrates a search for all sessions that were recorded after 1 1 2000 and before 1 1 2001 e Search Specification Sees seep E RARE ER dA RM after and 1 1 2000 Query specification a recorded before 1 1 2001 pen Date 01 01 2000 amp amp lt 01 01 2001 In this case Date was chosen as the second category 1 e the date of the recording Further pull down menus appeared that allowed to specify the exact time period e g search for a date before the specified date search for the exact date search for a date after the specified date 163 7 1 4 Entering the search item After you have selected the search categories enter the search item into the box in the top right corner and then press the key ENTER e g IMDI Search Specification SZE Query specification and results i Session MDGroup Participa
139. ight If you want to list only the groups that you tu eg have created check the creator s only specify rights for a group j box The current node in the tree is shown here This node and nodes below in the Current point in the tr cates lc a S E tree will be affected by your operation want a E Define the access rights by checking the na Allow for Grou Deny fo boxes Access rights can be defined for a All annotation gt specific type of media i e all annotations all audio all images all all images information files or all videos Access All information files Oo o rights can be allowed denied for the selected group or allowed denied for all n Rights users Note that allowing denying access c rights for all users automatically Affected corpus nodes Cmmmb A overrules access rights defined for the Affected resources group s Idata corpora mediaVat chive lac data kita Info Quantitylnfo htm Y dicated in the pull down menus at the bottom of the page You can navigate through the menus using the scroll wheel on your mouse the PageUp and PageDown keys or the arrow keys Next click the AddRights button to complete the operation 4 2 Remove access rights for a group From the main index window click the remove access rights from a group link The opened page lists all groups with creator and the path to the corpus node If you want to list only the groups that you hav
140. ill be highlighted 1n blue color Click with the right mouse button on the highlighted node A pull down menu appears In the pull down menu click on Delete After a warning the highlighted node is deleted 44 1 4 3 Managing the structure of the repository The following two management options are available e Adda new repository group Do the following IMDI Metadata Editor EE x File View 4 Enter the name of the TU rT SAT EDDIE CUE IEEE SEINE 4 new group The group will Local Repository b automatically be assigned t type node po ES 3 EF Collectors 1 Click on the PE E to the righ node where you wehren m want to add the Gl adults yame new group E Landfanes T P E Accegs peg oo 2 Right click on EdProi tts C CA New Group gt that node 3 Choose New Group from the pull down menu 5 Click on OK The new group is added m to the repository e Rename a repository group Do the following Local Repository irc cdi E 4 Enter the t m new name l Click on Repository Group ame the node MT me Languages that you z je want to rename 3 Choose 2 Right Rename Group from the pull down menu 5 Click on click on OK that node The group 1s renamed 45 1 4 4 Exporting nodes from the repository You can export a node from the repository Do
141. ing the search iteM cccccccccceeceseeeseeeceeeseeeseeeeeeeseeeeseeeeeeeaees 164 3 2 3 Adding or deleting a search query eseeseeeeeeeeeereee 164 3 9 Changing the profile oo orco r ei eee ene eee eue 165 3 4 Initiating and stopping the search 165 3 5 Displaying the search results eee serere 166 3 6 Saving the search results ce pe ossa ve aaa He su ceo m ve E Ren a pU oes 166 S PHBUDO uhduhri010120 4 2 022 02 012 6 7 6 508 5 165 5 1 4 6 LL e E PERR PRAE ERUS 167 3 8 Closing the IMDI Metadata Search window 167 129 5 Basic information This section of the manual introduces you to the basic concepts and possibilities of the IMDI Browser It introduces the concepts of corpora section 5 1 and of sessions or resource bundles section 5 2 For illustration purposes it makes use of the corpora stored at the Max Planck Institute for Psycholinguistics for corpora stored at other institutes different rules may apply 5 1 Corpora By default the IMDI Browser gives access to all corpora that are stored at File Options Search Help E IMDI BCBrowser for ME ial x the Max Planck Institute for XEM Psycholinguistics MPI Nijmegen The Irn Netherlands 8 In addition or alternatively the Browser can display corpora creat
142. inks T Lexicon Resource Resource ID LR 1 Date Unspecified ResourceLink T ere the lexicon resource Access Descriptions Keys Information tma metadata information about the lexicon resource Number Head Ent Unspecified Humber Subentri Unspecified Size Schema Referen B CharacterEncodi Note This part of the IMDI Editor is still under development and the controled vocabularies for the metadata categories have not yet been specified Please consult the web page htt WWW IT l ni IMDI for further information 111 4 The IMDI Corpus Editor This part of the manual guides you through using the IMDI Corpus Editor to create a corpus hierarchy All sessions or resource bundles see section 2 and lexicon resource bundles see section 3 belong to a corpus e g to the corpus of the language XY which in turn is subdivided according to various criteria e g it may be subdivided on the basis of genre or on the basis of the age of the participants etc Note In many cases the corpus manager takes care of structuring your corpus 1 e you will not need the IMDI Corpus Editor You only need the Corpus Editor in case you manage your own corpus Note for researcher working at the MPI for Psycholinguistics Nijmegen If you plan on using the IMDI Corpus Editor please contact corpus manager mpi nl trs
143. ion from the repository 49 1 5 The Content panel c vir nx eie ELE NEL EE UE SE PER RE EE EBEN DEN REA FEE NP u TEES MEOS RESO 51 To 1 0 facta tanec et eee aia eas A 51 1 5 2 Schemata and fieldS xassekesccrsesecetesrsrseseeccrseccouenewstoncenatewanewueiencienctsioed 54 Ro DE DUE of 1 6 Changes to the user interface version 2 0 58 2 The IMDI Session EGIOFEF iiia er Ereu ideas 59 24 329855 000520 9 inge aoc mM UNUM ME De Ma E 61 UN MEME o eee m 61 2 012 IDOSCIIDHOPIS wusste ere ce ee DUE RUE EDI 62 MN MEM Noc P 63 2 1 4 KEYS eson S A aaa 64 AMEN ole 65 23 COMING ono nU E n dA I d AV di dae ee 67 23 1 C ohltehl TVDO earet enan dodidodidud edid ote eto taedet ut 68 2 02 IDOSCIHDHObBSneuaudsueusaue tuse dudes AAA NSA AAT ACA eS 72 2 9 9 JLANQUAGCS sac ec ee eee ee eee eas 73 2 3 3 1 LAMQUAGE essesssssssesseesseee a 14 2 3 3 2 Languages Descriptions ccccceccceecceeeceeeeseeeeeeeseeeseeeeseeeneeesaess 16 2 ie SN SRE ete eee RR OER eRe epee enn een ORR CREEPER EERE CRRY CRE ON dae a RED a EE TT DA PCCORS c T M 78 DAN oo R 19 ZN Actorion 80 Zoe ACOMMAG quoa zie sr 83 2 4 1 3 Resource References eessseessssessseeseee nnne nnne nnne nnn nnns 83 24 14 DeschpllOlS ee rep EEEE EEEN 84 ZNO
144. ious versions of the IMDI Editor supported a different method for rendering full names anonymous If you have old IMDI files and if you have 82 doubts about how the full names are displayed please contact your corpus manager 2 4 1 2 Contact Information Contact information about the actor the Name the Address the E mail address and the Organisation he she belongs to 2 4 1 3 Resource References The IMDI Editor automatically assigns a unique resource identifier to each media file and written resource see section 2 5 The sub screen Resource References lists all available resource identifiers it displays the identifier together with the type of resource e g N10155 Media File in the screenshot below and it displays an empty box to the left IMDI Metadata Editor E Bl x File View Options Help lH ap o D Eater Hm IS Links co SJ Actor Summary Actor Information RoleConsultant NameJan Full NameJan Smit Coded Social Family Role Ethnic GroupGoemal A Age24 Seale Educationsecondary schoo Anonymizedfalse Z Actor Information Contact Information Resource References Descriptions Languages Keys N10155 Media File N10198 Media File N101DD Written Resource The sub screen Resource References now allows you to link the actor to a specific resource E g an actor may only appear in the specified Role in resource N10155 Media File
145. ish Language English Text J s firstlanguage is Goemai Kwo dialect He is also uent in Hausa and has some basic knowledge of English 85 Languages Each language that is spoken by the actor has to be listed on the Language screen To add a language see also section 1 3 do the following Bj IMDI Metadata Editor e xl File View Options Help A ag 8 4 E HTML Links 4 Actor Summary Actor Information RoleUnspecified Named Full Nameg Code Social Family Role Ethnic GroupGoemal ee Age 24 Seale Educationsecondary scho p Anonymizedfajse Actor information Contact information Resource References Descriptions Languages Keys ID RFC17583c sil ank RFC1766 x sil hua RFC17663 sil eng Edit Remove languages click on Add to add another language click on the language to or Remove to edit or to remove languages highlight it then click on Edit 86 The Add and Edit buttons give you access to the Language screen that contains the following kind of information Bj IMDI Metadata Editor loj x File View Options Help A ap e 4 m Editor 9 HTML ei Links Language me GOEMAI ME ID RFC17683csil ANK Mother Tongue true Primary Language true MN Dominant Langua Sourc
146. l functionality Again screenshots are used here to indicate the functionality see figures 9 and 10 Also this tool was tested extensively since a deployment could only be accepted if IPR questions are dealt with correctly It was presented at LREC 2004 and is in operation The manual can be found in Appendix C The ARM solution is based on a Postgres database i e all commands entered via web interfaces are stored in the database Programs are generating appropriate entries in the HT Access file that determines the access rights for web based users and entries in Access Control Lists for internal users All software is available and can be re used by other institutions It is intended to extend these mechanisms to distributed access mechanisms within a coming European project such that the users will be confronted with one transparent domain of web resources and have just one identity in this domain that will cover several language resource archives By having added an ARM component we have reacted on the comments of the reviewers and shown that we take IPR issues very serious 10 E Browsable Corpus Access Rights Management Home page Microsoft Internet Explorer File Edit View Favorites Tools Help Back gt amp at Qsearch Favorites Media A 83 2 Address A http corpusi mpi nlJarm ver2 arm index html Go Links a Access Rights Management for Language Resource Archives at the MPI DOBES
147. l tip Regions pecified Africa Midcdle A amp merica Please fill in the correct values by hand Note Unless you manually change the items that do not conform to the IMDI 3 0 standard the error messages will continue to appear in the case of the Genre errors and the offending items will continue to be highlighted in orange color in the case of the value errors 109 Note If the converter was able to map the old Genre onto the new Genre it will inform you of this fact it will add a description under Content Description and it will additionally display the old Genre as keywords under Content Keys e g IMDI Metadata Editor loj xl ile View Options Help A as 9 HTML E Editor Links E IMDI Metadata Editor File View Options Help Content A ast 9 Summary Content Information Genre Discourse Task Modalitiesspeech pointin Channelface to face Content Type Descriptions Keys Descriptions Subgenre Description Social Contextprivate Subjectplants Event Structure ono i dnguage 180639 eng nglish 2 0639 his file was generated form an IMDI 1 9 file and ransformed to IMDI 3 0 The substructure of Genre is replaced with another single element named Genre he original content of Genre was Interactional i ive e anation Performance hese a descrip
148. layed in form of a metadata category and a pull down menu of its values e g E IMDI Metadata Editor Bl x File View Help E s o ei oe SBwessn ejx 9 E New Session jr Project Session Collector Content Participants Resources References Recording Date Unspecified Session Name New Session Session Title Location metadata t wee Up Country category Continents anes i 9 Local Repository VE E Collectors i ff Content pull down menu E Participants 7 Languages of values EJ Access z Projects Note Currently the IMDI Metadata Editor display the English language version of all predefined metadata categories However in a future version it will be possible for the user to select a different language version e g German That is all metadata categories and controlled vocabularies will then be displayed in the appropriate language 10 3 Working with user defined controlled vocabularies The IMDI Metadata Editor makes use of a number Keys fields These Keys contain user defined metadata information e you can either type in a metadata category and a value for that category e or you can use a user defined controlled vocabulary For this manual only the second option is of interest To add a user defined controlled vocabulary do the following 193 of 196 Intera D
149. le Es ew Window Help Corpus Corpus Name Natural fel Corpus Title Natural data lel Corpus Links file name and directory information 1s retained Name Corpus Link Session ID MWWINNTIPersanaliliDlidetadata files sessiont imdi Session Session imdi Ej Preferences only file name E This preferrences panel gives the user access to q 1S retained properties which define how resources are handle editor Controlled ocabularies Retain only the filename when browsing Pa Prefixes Resources o eae 113 2 Make sure that the preferences are set correctly Do the following 1 In the IMDI Editor click on Options menu 2 Click on Preferences 3 In the Preferences dialog window click on Prefixes 4 In the Prefixes dialog window make sure that there 1s a checkmark in the box next to Use these prefixes only when none were defined in the original IMDI file see section 4 1 2 3 In the IMDI Editor click on the open folder icon to the right of the Link field The Open dialog window appears Link Look in SD i A E B5 nS field l C Corel Link 7 COREX4 cc q Open dialog open ied window folder icon CA nucitas File name Files of type All Files v Cancel 4 In the Open dialog window navigate to the folder that contains the file 5 Double cl
150. le View Options Help E a p o HTML Links E Actor Summary Actor Information role Consultant keys that are relevant to CodeJ Aged the actor e g Anonymizedfafse Remove See Appendix 1 Recurring Schemata Keys for instructions on how to fill in a Keys schema 89 2 4 2 Actors Descriptions A description of the interactions and interrelations among different actors of the session Note that this description concerns the set of actors as a whole information about specific actors should be described by means of Actor Descriptions see section 2 4 1 4 description of the actor constellation does nat appear in the recording And there were amp Clear Participa See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 90 2 5 Resources This screen contains information about the resources belonging to the session All the information that is entered into the Resources screen will be displayed in the IMDI Browser in the following way see the IMDI Browser manual for details BB IMDI BCBrowser for MPI on M03402985 File Options Search Help Bookmarks Description amp Worldmap HTTP global Neecrrintinn ofthe ices n
151. les media files and non digital sources associated with the session see sections 2 5 or about lexicon files see section 3 1 cross references to any material that 1s relevant to the content of the session see section 2 6 51 IMDI Corpus Editor see section 4 e Corpus links to IMDI Session IMDI Lexicon and IMDI Corpus files The Se main screens IMDI Metadata Editor E File View Options are accessed TIE etm E Session Treessm EE 2j through clicking on a season the corresponding Gana Session Namg 1 Participants node in the Browse panel see also section 1 3 Resources References 8 Goemai D TreesSmM Descriptions Language RFC1766 x sil eng a Description ofthe uses oftrees We looked attrees in four different sites all in the vicinity of Kwande beyond the football field The four media files correspond to these four sites The recording took place in early evening Because ofthe twilight the ideo recordings are nottoo clear RFC1766 RFC1766 4 Click on a node in the Browse panel e tse aridiuadege A EJ Access 4 55 Projects r The corresponding screen is displayed in the Content panel The IMDI Editor screens are structured in the following ways e All screens display creer zm schemata and fields into h schema
152. lle lca RUNE RE IU IUD UDIN D IUD IL ND I QUE 139 2 3 The Metadata Descriptions Tree panel 140 2 3 1 Navigating through the corpus tree structure 141 2 3 2 Navigating through the corpus world map sss 144 2 9 9 1 99 parts of the corpus for purposes of searching basket function 2 3 4 Accessing data sssssssssssssssssseseee nennen nennen nnne nnne nna rns 147 2 3 4 1 Metadata information cccccccccssccceeeeceeeeseeeecsueeseusessueeseeeessneeeas 148 2 3 4 2 Digitized media files and transcription annotation files 150 ESI EC Ia eR T n 155 2 4 Info Content panel 1 leeeeeeeeeenieen eee nen n nnn ananas anna 156 2 5 Description panel leeeeeeeeieerieeieen eene nnns n nnn nnn n nnn 157 3 Searching data IMDI Metadata Search Tool 158 3 1 Specifying the corpus to be searched 159 3 1 1 Selecting a data file seem n 159 3 1 2 Creating a new data file eeeseeeseseeeesseeeeeeeeeeree 161 3 2 Specifying the search parameters eee esee 161 3 2 1 Selecting the category to be searched sseuessss 162 3 2 2 Enter
153. lled Vocabulary Closed Controlled Vocabulary List Open Controlled Vocabulary List 5 A dialog window appears asking you to choose the type of 7 M see also section 1 5 2 vocabulary only one value can be entered vs vocabulary list more than one value can be entered 6 Click on OK closed only values from the pull down menu are accepted vs open alternative values can be added 124 Appendix 1 Recurring schemata Access Access schemata group information about access rights to a resource Note The different archives handle access rights differently For example many of the corpora housed at the MPI for Psycholinguistics Nijmegen automatically deny access to media files written resource files lexicon files and media tapes But other archives may follow a different policy Specify the access rights as follows I Access Access specification of the Access Information Access Conta cces cription access rights owner and publisher ofthe resource Availability not available Birgit Hellwig Publisher Date 12002 02 08 date at which the access rights were set 4 Clear Access S Open Policy I Save Policy ES E3 I Access x Access Access Information Access Contact Access Description Access Access Information AcceBS Contact Access Description Contac
154. lligible speech Ritual religious texts a prayer a healing ritual a catechism a detective novel a science fiction story Newspaper article a political essay a scientific report Radio TV feature a political discussion on the radio a documentary on animal life on the TV Drama Film Play Opera Musical Singing Individual Song Chant a psalm a slogan during a demonstration Chorus Fs eens em M Bre A s 69 Note If there are several Subgenres you can enter all of them into the Subgenre field Separate them with a comma Task The name of the experimental or stimuli based task that was carried out if any 99 ee Choose a value from the pull down menu e g info kiosk wizard of oz travel 29 66 planning room reservation or frog story or type in an alternative value Modalities The modalities under investigation Choose a value from the pull down menu e g speech writing gestures pointing gestures signs eye gaze facial 29 66 expressions emotional state haptic or type in an alternative value Subject The subject s or topic s of the session There are no constraints on this field Communication Context Information about the communication context 1 e levels of participant interaction the degree of planning through the speaker s
155. lt Countries Ef http www mpi n 7iMDI Schema Content Genre interactional xml E3 list of c 2 click on Show Source Code 5 Content Task ersionz 1 0 encoding I80 8859 1 lA list of v4 entries Mos edited with XML Spy v4 2 U httpvwawwxmispy com by Daan Broeder Max Planck IN 4 ILLI mdi YVocabularyDef xmlIns imdi httpveww mpi nl MDI Schema lMDI xmlns xsi http imdi Description Languageld SIL ENG A list of values for the Genre Interat imdi Entry Value Unknown s gt imdi Entry Value Unspecified gt re imdi Entry Value conversation imdi Entry Value verbal contest E imdi Entry Value interview imdi Entry Value meeting imdi Entry value riddles Definition A witty question to a respondent who imdi Entry valuez consultation 7 imdi Entry valuez greetings 3 content 1s displayed Show entries imdi Entry Value leavetakings imdi Entry Valuez humoaor i di VocabularyDef Close Click on Close to close the Vocabs overview window 192 of 196 Intera D2 2 1 10 2 Working with predefined controlled vocabularies The IMDI Metadata Editor makes use of a number of predefined controlled vocabularies These vocabularies are stored on the Web and the links to them are stored in the subfolders contained under user directory IMDI TOOLS CVCACHE In the IMDI Metadata Editor these predefined vocabularies are disp
156. m basket jur different sites all in the 4 Metadata Add to basket displayed in the cinity of Kwande beyond e football field The four edia files correspond to ese four sites The recording took place in early evening Because of the twilight the video recorg e Choose IMDI BCEditor from the menu The IMDI Editor opens with the corresponding metadata file e g IMDI BCEditor Info Content panel Clone Node Show File Content Show URL ue See w nemoe ciar Root URL ffile D Birgi D Basket functions Es IMDI BCBrowser for MPI on M03402985 Bl xl A parch Help double click click and right click on a session node the IMDI Editor opens select IMDI BCEditor from Count pull down menu Bookmarks Session e from basket Sarssbon Mame Trin 5M a a d 3 Tris 8M Root URL file D Bi aT ae m E ren eet 3 ONCE a cione Node m es 7 tie E Show File Content Descriptions Location Keys Sa Show URL Descriptions Language English I x SN E Text Descripson of fe uses af ees We looked at trees in four trent tiles all in hie vicinity of arde beyond Lhe football Neki The Tour media files correspond to these tour assez The recording MOOK Mace in m evening Because of he biligi The EO NCOs aee matr tor ear 149 Note You can update the metadata information in the IMDI Editor However i
157. n enter all of them into the Subtype field Separate them with a comma eg orthography morphology Format The file format of the file that contains the written resource Choose a value from the pull down menu or type in an alternative value value comments CHAT Shoebox RDBMS TRS EAF AIF BAS text plain text pdf text html 9 Size The approximate size of the written resource in words Derivation The relation of the written resource to other documents It is a closed vocabulary 1 e the value has to be chosen from the pull down menu 1 e value comments Analysis Translation Commentary Criticism Annotation Content Encoding The name of the encoding scheme used for creating the written resource if applicable For example Eurotype 1 e following the Eurotype guidelines Character Encoding The name of the character encoding used for creating the written resource e g UTF 8 Language ID The identifier of the language that is used in the written resource e g English for an English translation It has to be entered in a standard format Please enter either the ISO identifier 1 e enter ISO639 eng for English or the Ethnologue identifier 1 e enter RFC1766 x sil eng for English please look up the ISO identifiers under http Icweb loc gov standards iso63 100 9 2 la ng home h
158. n most archives it is not possible to save the updated file directly to the server Please save it to the local disk and contact your corpus manager In any case please contact your corpus manager before you make any substantial changes to make sure that your changes do not interfere with the overall corpus management 6 3 4 2 Digitized media files and transcription annotation files A session node also contains links to digitized media files 1 e to audio and or video files and to transcription annotation files These files can be accessed either through the session node or the media file node or the transcription annotation file node The following options are available a Right click on a selected media file node to open the pull down menu then choose ELAN Windows Media Player or Open with Winamp from the menu The media file will be opened in ELAN Windows Media Player or Winamp e g fj TreesSM 1 mpa Windows Media Fisver BEI E3 ser for MPI on M0340298 media file Windows media file LN be NT ELAN N 0 00 15 140 D T 0U02 00 43 Ha WINAMP media file tH kbps 44 kHz y EEEE O o mE E ES T E E A mama ee Basket functions 1 ELAN is an annotation tool developed at the Max Planck Institute for Psycholinguistics Nijmegen The Netherlands Tool and manual can be downloaded from http www mpi nl tools 150 In order to view digital images gif jpg ri
159. n of the Editor need to be converted to the new standard 105 This section first summarizes the implemented changes see section 2 7 1 and then describes the conversion of old IMDI files see section 2 7 2 and old repositories see section 2 7 3 to the 3 0 standard 2 1 Changes to the metadata categories The following changes in metadata categories were implemented in the IMDI 3 0 standard IMDI 2 0 and earlier Standard IMDI 3 0 Standard Collector this screen is now subsumed under Actor with collector being one possible value for the field Role see section 2 4 1 1 Content Genre Interactional the three subtypes of genre are replaced by one Genre and one Subgenre whereby the possible values of Subgenre are dependent on Content Genre Performance the value of Genre see section 2 3 1 Language an additional panel allows you to specify resource references see section 2 3 3 1 Participant this screen is replaced by Actor Actor now includes Contact information an additional panel allows you to specify resource references see section 2 4 1 3 AnotationUnit this screen 1s replaced by Written Resource see section 2 5 3 Resources a Keys schema is added Content Genre Discursive Media File and Written Resource are assigned Resource Ids an additional panel under Source allows you to specify resource references see section 2 5 Lexicon Resource this screen is added to create lexicon resource
160. n on the bookmark to open a pull down menu 3 Select Delete Bookmark from the pull down menu The bookmark is deleted without further warning Note You can only delete bookmarks created by yourself 1 e those marked personal in the Bookmarks panel but not the predefined bookmarks 1 e those marked global 6 3 The Metadata Descriptions Tree panel The Metadata Descriptions Tree panel allows you to navigate through the corpus hierarchy It serves the following three purposes l The hierarchy of nodes allows you to easily navigate through the corpus to the session data see sections 6 3 1 and 6 3 2 Pull down menus and the five buttons at the bottom of the panel allow you to select parts of the corpus for purposes of conducting searches see section 6 3 3 The linking of metadata media and annotation files at the session level allows you to immediately access all relevant data see section 6 3 4 140 The following screenshot illustrates the Metadata Descriptions Tree panel corpus directory C dle URL of the corpus ES IMDI BCBrowser for MPI on M03402985 Fi directory file buttons for selecting parts of the corpus O x le Options Search Help ptadata Descriptions Tree Basket functig MDI imdi erimDrcorpora i index html 9 I8 MPI corpora i index html acquisition gt Language amp Cognition africa e Goemai 9 8 Helwig fi Project pdf i
161. n the pull down menu The buttons and the pull down menu are used for the following purposes Storing metadata information in the repository see section 1 4 1 Deleting metadata information from the repository see section 1 4 2 Managing the structure of the repository see section 1 4 3 Exporting nodes from the repository see section 1 4 4 Importing nodes into the repository see section 1 4 5 Updating the information in the repository see section 1 4 6 Using metadata information from the repository see section 1 4 7 1 4 1 Storing metadata information in the repository To store metadata information in the repository do one of the following Drag and drop a node from the Browse panel to the appropriate node in the Repository panel For example to add the language GOEMAT to the repository do the following f IMDI Metadata Editor OL x File View Options Eje iee vja E Treessm I Project Name GOMA 0 e a A Collector O A lien l Click on the node e c 9 Participants in Browse panel A e T ENGLISH E RFC17662si eng S 2 Keep the mouse Tm GOEMAI button clicked and Resources ibaken by References drag it to the tate Central appropriate node in the Repository 9 amp Local Repository panel Projects Collectors EI E Content 3 Release the mouse m button The node is E Iur Ki adde
162. name has to be Unix compatible do not use file names longer than 14 characters do not use non letter or non number characters except for the underscore and do not use blank spaces 1 2 1 12 Save As Click on Save As to save the currently active IMDI file 1 e the file that is highlighted in the Browse panel see section 1 3 under a different name see also section 1 2 1 11 1 2 1 13 Save All Click on Save All to save all open IMDI files see also section 1 2 1 11 1 2 1 14 Page Setup Click on Page Setup to specify the layout of the printed IMDI file 1 e paper SIZe source orientation and margins 1 2 1 15 Print Click on Print to print the currently active IMDI file i e the file that is highlighted in the Browse panel see section 1 3 26 1 2 1 16 Exit Click on Exit to exit the IMDI Metadata Editor window All open IMDI files will be closed If some files are not yet saved a warning appears that contains the following kind of information A list of all files that are not yet saved Checkmarks indicate the selected files by default all files are selected v NBI Imt To remove a checkmark 1 e to deselect a file click in the box v TreessM imdi C Lt inet Click here to select or deselect all files Select All Deselect All Cancel mu Click OK to save all selected file Th
163. nel a Q IMDI corpora i index html Q MPI corpora i index html acquisition 9 N Language amp Cognition e africa Q Goemai 9 8 Hellwig i Project pdf i language pdf i Publications pdf o Natural conversation Q descriptive TreessM Speaking Root URL httpwww mpi nl world corporallMDl metadata lMDI imdi Basket functions 134 6 7 Menu items The following four menu items are available File see section 6 1 1 Options see section 6 1 2 Search see section 6 1 3 Help see section 6 1 4 To access any menu item do one of the following e Click on that item f Or use the shortcut key ALT plus the underlined letter s e g to access the item File press the key ALT F 6 1 1 File menu The File menu supports the following two options File Options Search Help Open File Exit Open File Click on Open File to open a corpus file i e an imdi corpus file that has been created with the IMDI Editor and that contains information about the corpus The Open dialog box appears allowing you to browse to the folder that contains the 1mdi corpus file In the dialog box click on Open and the IMDI Browser window will display the selected corpus node E g xj E IMDI BCBrowser for MPI on M03402985 Ei fel xi File Options Search Help Look in Data v I r3 Il DD ME Bookmarks Kwa Tle LD ra T
164. nents ml C Nlent Task CGN Age ml Tad Link To Cannonical CV Description Language Content InyoA femen http weww mpi nl IMDI SchemaiContent InvoA list of values for the Ivement xml Involvement aspect of Content http vwww mpi nl CGNi Schema CGN activity Dummy file used for free input xml key CGN activity http www mpi nlMmDI Schema Continents x A list of li information about http veww mpi nl IMDI Schema Content Tas A list of va available controlled k xml aspect of p vocabularies http www mpi nl CGN Schema CGN Age x The age q Cancel click OK or Cancel to return to the IMDI CV Editor 182 of 196 Intera D2 2 1 Note You can open any of the available controlled vocabularies from within the Vocabs overview window Do the following f Vocabs overview x Name Tag Link To Cannonical CY Description anguage m Content Invalvemen http www mpi nl MDI SchemarContent wementxml 1 In the window click on the controlled CGN activity http www mpi nl CGN Schema CGN aq vocabulary to be opened xml BEI http iwww mpi nllMDI SchemaiContine It will be highlighted in blue color ml Content Task http Jvwww mpi nl l DI Schema Content Tas A list of values for the Task f kxml aspect of Contpnt z CGN Age http www mpi nl CGN Schema CGN Age x The age categipries used by the mil CGN corpus C303 2 Click on OK Edi UTIMITETTIDUHCTTOT NEN D
165. new sub nodes This function is only available for those nodes that allow for the three options Add Edit and Remove c Added sub nodes can be removed again Do one of the following In the Browse panel click on the sub node that you want to remove It will be highlighted in blue color Then click with the right mouse button on the highlighted node A pull down menu will appear Select Remove from the pull down menu Alternatively in the Content panel click on the sub node that you want to remove Then click on the Remove button 41 1 4 The Repository panel The Repository panel allows you to work with recurring metadata information Such information is entered once into the Content panel see section 1 5 and is then stored in the Repository panel for future usage For example amp IMDI Metadata Editor File View Options Bel x1 Tes fes en En x e TreesSM Project i Collector SS gt Content Id RFc1 766 x sil ank amp 9 4 Participants ians nguage IRFC 1766 x sil eng z RFC1766 x sil ethg information about the language Text Goemai is a Chadic language vest Chadic A Angas Goemai Southern Branch that is spoken by 150 000 to 200 000 speakers in Plateau State Central Nigeria GOEMAI is entered into the C
166. ng Knowing that many people have problems with downloading and installing special tools such as the IMDI browser we opened the possibility to navigate in the IMDI domain by using normal HTML browsers as indicated in the following figure Of course some restrictions have to be made for example with respect to the information about the corpus structure and the location of the user Here only a path to the actual node is delivered Nevertheless the user can access all metadata information that is available This mode is achieved by executing style sheet based XSLT transformations on the fly i e the HTML are created on the fly from the underlying XML representations see figure 7 It should be mentioned that these pages can be accessed by going to the following web site http www mpi nl corpus Zi Browsable Corpus Microsoft Internet Explorer f Bl x File Edit View Favorites Tools Help Bak gt amp A Qsearch GyFavorites Smeda 04 B Sp OI 8 Address http icorpus1 mpi nl BC IMDI corpora MPI 20corpora Demo PewWi gt 20corpus kleve route media Go Links Browsable Corpus of MPI for Psycholinguistics ry IMDI corpora MPI corpora Demo PeWi corp kleve route This recording was made to generate a freely available test ce including speech and gestures The annotations were created by Peter and Kita who is gesture her at the MPI for Psycholinguistics internet 7 Z Figure 7 shows a
167. nging the profile section 7 2 Initiating and stopping the search section 7 3 Displaying the search results section 7 4 Saving the search results section 7 5 Printing section 7 6 pe s xov m c Closing the IMDI Metadata Search window section 7 7 7 1 Specifying the corpus to be searched The corpus to be searched is displayed in the box to the right of Search Service For metadata search to work you need to enter the name and location of the data file that contains the metadata information of the corresponding corpus see section 7 1 1 Many corpora provide you with predefined data files but you also have the possibility to create your own files see section 7 1 2 7 1 1 Selecting a data file To enter a data file in the box to the right of Search Service do the following a Either Type in its name and location b Or Make use of the Browse button to the right of this box c Or Make use of the IMDI Browser interface In this case do the following 1 In the Metadata Descriptions Tree panel see section 6 3 double click on the corpus node to open it Then click on the node to select it It will be highlighted 1n blue color 2 Doone of the following a In the Search menu click on Metadata Search see section 6 1 3 b Or in the Metadata Descriptions Tree panel see section 6 3 right click on the selected node then select MetaData Search from the pull down menu In the IMDI Metadata S
168. nguistic continents fin German category descriptive information about the category 1 Name Tag and Title Name contains the name of the underlying metadata category Tag contains the form that is displayed in the user interface of the IMDI tools and Title contains the spelled out version of the Tag e The Name is obligatory It is automatically the same as the name of the controlled vocabulary file plus a language abbreviation if applicable see below e The Title is optional It contains the information that is displayed in the IMDI tools whenever the tool tip is activated That is if you point with the mouse to a metadata category a blue box containing the Title will be displayed e The Tag is optional It is only necessary if you have a single underlying controlled vocabulary a so called canonical vocabulary that corresponds to vocabularies in different languages For example in the above illustration the canonical vocabulary is Continents The abbreviation ger in the Name informs us that this particular vocabulary is the German language version of the canonical Continents The Tag is Kontinente 1 e the German word for Continents That is the IMDI tools will display the canonical Continents as Kontinente in the user interface Note Currently the IMDI tools display the English language version of all predefined controlled vocabularies However in a future version it will b
169. node the structure of the corresponding IMDI Session Lexicon or Corpus file 1s displayed in the form of metadata nodes media files nodes written resource file nodes lexicon file nodes and or corpus link nodes 38 The nodes of the Browse panel interact in the following ways with the screens of the Content panel see section 1 5 a Click on any node in the Browse panel to make the corresponding IMDI file the currently active IMDI file It will be highlighted in blue color and its metadata information will be displayed in the Content panel e g IMDI Metadata Editor File View Options JE ese Q9 Twes8M Project ollecto Browse panel Project node of the file TreesSM imdi 9 v x j Project Project Name e ProjegV itle Documentation of Goemai Project ID Contact Name Birgit Hellwig Content panel aids Project screen of the file TreesSM imdi RFC1766 x sil eng ME Text The Goemai project is part of a PhD project begun in August 1998 There are two aims to this project 1 To investigate the semantics and pragmatics of 2 locative verbs and derived elements Most of the collected experimental data is concerned with this 2 topic 2 To document the Goemai language in form of a reference qrammar a dictionary and an
170. nt Langua True Source Language Target Language Resource References Descriptions ource Refereno s Descriptions j v N10155 Media File Descriptions C N10198 Media File cv English Language English vj N101DD Written Resource Text Goemaiis a Chadic language West Chadic A jAngas Goemai Southern Branch that is spoken by 150 000 to 200 000 speakers in Plateau State Central Nigeria fee e Name The name of the language This name 1s standardized Choose it from the pull down menu or type it in Please use capital letters e ID The identifier of the language based on the Ethnologue identifiers see http www ethnoloque com web as The IMDI Editor automatically enters the correct identifier once you have filled in the Name field above e Mother Tongue Please ignore this field It has been disabled for Content Language and is only available for Actor Language see section 2 4 1 5 e Primary Language Please ignore this field It has been disabled for Content Language and is only available for Actor Language see section 2 4 1 5 e Dominant Language Specifies whether or not this language is the language that 1s used most frequently in the session From the pull down menu select either true 1f it 1s the dominant language or false 1f it 1s not 19 Source Language In a second language
171. nt and structure through the info files displayed in the IMDI Browser see section 6 3 4 3 Each corpus contains metadata media annotation and info files see section 6 3 4 These files can be accessed through the following two mechanisms but please note that data files may not always be accessible to the general public see section 5 2 a By navigating through the hierarchical tree structure see section 6 3 1 Each corpus 1S organized hierarchically in the form of tree S DEUS structures see the illustration to the i index tmi o MPI corpora right i index html Tree structures consist of nodes that e n group files together on the basis of i auantityinfa htm e g the geographical region the T A a ma discourse genre the sex or age of the Ewe speaker the dialect of the speaker Bu the target source language etc as l l fareast Browser Double click on a node to mideast northamerica access the next level in the hierarchy Bodas At the lowest level of the hierarchy E Sue bises the actual metadata media esFeorus 0 annotation and info files are displayed Note The tree structure does not display the physical location of files This means that the same file can easily be displayed in different parts of the corpus e g the same file may be displayed under the nodes folktale male speaker and age group 20 to 30 years 131 b By searching the metadata files
172. nt here UM pem erum PREFIE Dr WwyIMNNTIPersanallhiplnfa filesu 116 4 2 Select the sessions and subcorpora that belong to a corpus A corpus consists of subcorpora and sessions For example the following illustration shows a corpus labeled Goemai corpus preliminary version together with its two subcorpora Natural data and Elicited data The subcorpus Elicited data contains further subcorpora labeled Matching games and Picture books Each subcorpus consists of sessions labeled Session to Session6 that contain the actual session data 1 e IMDI Session files with metadata information and links to info media written resource and lexicon files Goemai corpus preliminary version Natural data Session 1 Session2 Session3 Elicited data Matching games Session4 Session5 Picture books Session6 To specify such a hierarchical corpus structure you need to create an IMDI Corpus file for each corpus and subcorpus You are asked to provide the following information 1 Corpus Name A short name or abbreviation that uniquely identifies the corpus or subcorpus Note for researchers working at the MPI for Psycholinguistics Nijmegen The Corpus Name should be the same as the name of the corresponding IMDI Corpus file Furthermore the file name has to be Unix compatible do not use file names longer than 14 characters do not use non letter or non number characters e
173. ntains the following options Ee IMDI Metadata Editor m x File View Options Help Profile Manager Anonyms Vocabularies ES Preferences F 28 1 2 3 4 Profile Manager Every IMDI file is assigned to a type and to a profile These types and profiles are managed through the menu item Profile Manager By default there are three different types of IMDI files Resource Bundles or Sessions that are used to describe audio video and Written resources see section 2 Lexicon Resource Bundles that are used to describe lexicon resources see section 3 Corpora that are used to describe larger corpora see section 4 Types are associated with one or more profiles These profiles specify the available metadata categories together with their constraints For example members of the sign language research community have developed a Sign Language Profile IMDI files that are based on this profile contain some sign language specific metadata categories and constraints e g Ea IMDI Metadata Editor File View Options Help BE af 6 e Sign Language Profile 4 Project 4 Content 4 actors IMDI files based on the Sign Language Profile contain the metadata category Speaker Signer This Speaker Signer category contains keywords that are of relevance to sign language research e g Deafness Status etc For each keyword there are constraints
174. nts Participant x Ag Ap 20 7 oie ess 1 search item Note Some search items have to be entered manually others can be chosen from a pull down menu see section 7 1 3 If you enter the search item manually you always need to press the key ENTER to commit the item Note Alternatively you can type the search item directly into the box Query specification and results Type it between the two quotation marks following the category specification e g 20 in the illustration above 7 1 5 Adding or deleting a search query You can add and delete queries as follows L use this button to add a query X use this button to delete the query When two or more queries are specified the search process will find all sessions that contain query 1 and query 2 and all subsequent queries For example in the following illustration all sessions will be found that contain utterances from 1 a male participant X and 2 a female participant Y S IMDI Search Specification male female Query specification and results Session MDGroup Participants Participant Sex male Session MDGroup Participants Participant Y Sex female Note Variables such as X and Y are only relevant if you specify more than one query In this case they allow you to specify that the participant of query 1 e g X should either be different from the participant of query 2 e g Y 164 as in the illustration above or that s he should be
175. nymous 49 Make use of the Put Data option in the Repository panel For example to insert information about access rights do the following f IMDI Metadata Editor L OL x File View Options 9 Ri Lii Eiei joe not available entered under the bmtHeo appropriate node I A al a t3 AM AFeb99 01 References 2 Make use of the Put Data option a Either click on the Put Data button b Or right click on the highlighted node and then 1 Click on the a click on the Put Data item E TN t EIE A D meqopu Repository CI New Group panel Rename Group Export 9 Local Repository E Collectors Content a E Participants Import Note You can only insert information into the appropriate corresponding node of the Browse panel and the corresponding screen of the Content panel e g information about an actor can only be inserted in the Actors node of the Browse panel and the screen Actors of the Content panel If a non matching node is active in the Browse panel the program will automatically disable the Drag and drop and the Put Data functions Note You can add a group of nodes to the Browse panel e g a group of actors In this case you have to drag and drop or put the higher no
176. o the metadata categories ssssse 106 2 7 2 Conversion of old IMDI files to the 3 0 standard 106 2 7 3 Conversion of old repositories to the 3 0 standard 110 3 The IMDI Lexicon Editof oii uio eet OE ect Oui ees 111 31 LEXICONVIRES OUNCE ina 111 4 The IMDI Corpus EGILOF oiii aho nae E cep oos en aoxusa sos quen s nuus 112 4 1 Specify the directory information of files 112 7 ei MEMINI Cio ee ene 113 ATZ iic 114 4 2 Select the sessions and subcorpora that belong to a corpus 117 4 3 Create a corpus Node senie a xb ceu ERR ae al UE 119 APPENDICES EIU T Et 121 Appendix 1 Recurring schemata 1 Lecce eere eise e eaa nah a aan ua 121 Appendix 1 Recurring Schemata Descriptions 121 Appendix 1 Recurring Schemata Keys eee eeeee esee 123 Appendix 1 Recurring schemata Access eere eeu 125 20 21 1 Navigating through the IMDI Editor This part of the manual describes the structure of the IMDI Editor 1 1 The IMDI Editor window Starting the IMDI Editor opens up the IMDI Metadata Editor or IMDI Editor window Its initial display is as follows Changes in the IMDI BC MetaData Editor 4 This Mew version of the IMDI BC Metadata
177. o view this additional information do the following 1 In the Metadata Descriptions Tree panel click on the node whose information you want to view It will be highlighted in blue color 2 Right click on the highlighted node Select one of the following options from the pull down menu see section 6 3 1 e Bookmark Info Displays information about the bookmark e List Sessions Lists all sessions contained under this node e Session Count Displays the number of sessions contained under this node e Show Displays the file content e Show As HTML Displays the file in an HTML format e Show File Content Displays the file in an XML format e Show format Displays the file format e Show Info Displays the file content e Show LR s Displays the directory information for all files contained under this node e Show Services Displays the name and directory information of the file needed for searching through the metadata The corresponding information is displayed in the Info Content panel 156 The Info Content panel also displays the world map see section 6 3 2 And it displays Backward and Forward buttons that allow you to move to the previous or subsequent view and a Print button that allows you to print the information displayed in the panel 6 5 Description panel The Description panel displays a description of corpus session and file nodes To read the description click on the node in the Metadata Descriptions Tree p
178. on Refresh selected to have the IMDI tools check for a newer version of that vocabulary on the Web and to update the cache accordingly e Refresh all Click on Refresh all to have the IMDI tools check for newer versions of all vocabularies on the Web and to update the cache accordingly e Show entries Click on a controlled vocabulary to select it then click on Show entries to view its content e g E Vocabs overview x Used vocabularies until now Name l click ona CV URL Local Path Local Description vocabulary to Countries Select it Dl 8chema Ca A list of countries from the CIA fact book 2 click on DeafContacts DeafCluDeafContacts DeafCluhttp www let kun nl sig n lanqiMDivocabsiDeafC ontacts Deafx Extent of contacts Show Add File Add URL spe mp Refresh selected T EE E e 99 Refresh all d Show entries entries amp Vocab entries x Sorted oi Entries af httplvwww mpi nl IMDI Schema Countries xml Tag Definition 3 content 1s displayed American Samoa Andorra Angola Anguilla Antarctica Antigua and Barbuda Axctic Ocean Argenti Armerntc fy e Close Click on Close to close the Vocabs overview window 34 1 2 3 4 Preferences Click on Preferences to access the Preferences dialog window In the Preferences dialog window the following four o
179. ontent panel 9 amp 8 Local Repository 9 Projects Goemai Collectors Content e Participants this information is then stored in the Repository panel S J E Languaggg TN z GOEMAI 4 A ETAT Link Remove not available 2 Clear Note The repository is stored under the name local repository xml in the folder user directory IMDI TOOLS SDRCache The files that contain the actual metadata information are automatically assigned a name and are then stored in the same directory e g f Exploring D WINNT Profiles birghe IMDI TOOLS SDRCache File Edit View Tools Help 49 SDRCache gt 1 EX All Folders s s BL birghe M javaws L Application Data nf Cookies HL Favorites History of Imdi EHL IMDI TOOLS e Anonyms RI Cvwcache eof J Cv set vL Editor H SCHEMACACHE o 4 ENSEM RI Seara f t DE Wee duet Fumab fade tiie rde rie Do BAT mu os S FOR eB Dito D Piae Taxi Courar hane RrgurcNWH EIEL 1 4 I a 23 R1045058197562 xml files that store the metadata information e g 2 8H58 1 734 Tm l wversionst1 0 encodings I18 ca fTile D WIHKT Profiles birghe IMD epository names Local Repository smr ool1s sDBCache Eve seE nages cCollectors t ypa Collect Content types content 4 z Participants type
180. ontrolled vocabulary is not displayed click on Add File or Add URL to add it to the Vocabs overview window 194 of 196 Intera D2 2 1 4 Click on OK The Specify Vocabulary Type window appears e g E Specify Yocabulary Type x Please selectthe type of controlled vocabulary to be used far the key value pair Closed Controlled Vocabulary C2 Open Controlled Vocabulary C2 Closed Controlled Vocabulary List O Open Controlled Vocabulary List 5 Select the vocabulary type by clicking in the box to its left Closed Controlled Vocabulary only values defined in the controlled vocabulary will be accepted by the IMDI Metadata Editor Open Controlled Vocabulary values not defined in the controlled vocabulary can be typed in and will be accepted by the IMDI Metadata Editor Closed Controlled Vocabulary List Open Controlled Vocabulary List 6 Click on OK The metadata category and its list of values are added to the Keys schema e g Gy 190 Fhetadata Editor p T i jae amp i amp E Mew Session Xx v E Hew Session E ab Collector E pasion Mamas Hew Session S d Conant di Participants Session Title I e de Resources Ex db Pieteranices Recording Date Unepecified E E Descriptions Location Keys D igi Local Repository n f Collectors t E Content E Participants amp EZ Languages amp Access 7 Projects The controlled
181. pen Click on Open to open an IMDI file Browse to the folder that contains the IMDI file and double click on it to open it Note You can only open files of the IMDI Editor format If you try to open a file of a different format an error message will appear Note If you open a file assigned to a profile that has been deleted the default profile for that type will be used instead E g you may have an IMDI session file that was originally assigned to the Sign Language Profile but later on you deleted this profile If you now open this IMDI Session file again it will be assigned to the default profile for session files e g to the Standard Session Resource Bundle Profile see section 1 2 3 1 for details Note If you open a file created with an old IMDI standard it will be converted to the IMDI 3 0 standard see section 2 7 for details 1 2 1 7 Open URI Click on Open URI to download an IMDI file from the Web A dialog window pops up Enter the corresponding URI into the window and click on OK see also section 1 2 1 6 1 2 1 8 Open Recent Go to Open Recent to view a list of the IMDI files that you have recently worked on Click on any file from the list to open it see also section 1 2 1 6 1 2 1 9 Close Click on Close to close the currently active IMDI file 1 e the file that is highlighted in the Browse panel see section 1 3 All other open IMDI files
182. place we will just stick with a screen shot that shows the new user interface see figure 1 From the users we were informed about errors in the new version of the editor The code was corrected with respect to the errors we heard about We also re factured the code at some places so that we can speak about a stable version again CV Editor This editor allows people to add controlled vocabularies that are also important to guarantee useful search results Also here we will only show a screen shot see figure 2 and refer the interested user to the web site A manual is available see appendix D We also refer to the Web Sites http www mpi nl ISLE and http www mpi nl IMDI for details Im ile View Options Help A aps e e oujda ilham Project ibd d Arabic Moroccan spoken Q Actors Q Abderrahman EI Aissati Tarifit amp Arabic Moroccan spoken English French Dutch 9 4 iihamB 4 Arabic Moroccan spoken d Don 4 Resources m Media File 1 Q oujda_ilham cha Access T oujda ilham cha T oujda ilham cha T oujda ilham cha T oujda ilham cha e cc 4 DAT Access References 9 Local Repository Projects 1 Content EB Actors Languages E Access amp Test Local Repository amp imdi 3 amp esf Old IMDI Version amp new Old IMDI Version amp Local Old IMDI Version X
183. ptions are available 1 Prefixes xj ve m presses mal amp haukd b used and alto when they click here to select an option eanfiguration fle that should be sealed in the same original RAN fia E Uae Pretiors defied in thre BER Tie teed 2 Resources 3 Update Policy 4 Font Use Prefices defined in an external configu ation file Gloteat Soc Fue Do pus Files Pieria Files Witten Resources Click on any of the headings in the left panel to activate the corresponding option C O EE Puoi ia Prefixes This option allows you to specify the path to the different types of files see section 4 for details Resources This option allows you to specify the browse options see section 4 for details Update Policy The IMDI tools make use of so called controlled vocabularies see section 1 2 3 3 These vocabularies are stored on the Web and a copy of them is stored in the cache of your desktop computer in the folder user directory IMDI TOOLS CVCACHFE The Web version is changed sometimes but the IMDI tools will continue to use the locally stored version until the cache is updated again By default the cache 1s updated whenever you run an IMDI tool However you can change this policy and determine if and when the vocabularies should be updated 35 Click on Update Policy to determine this policy The following options a
184. r m nga det h r om skuv gad f rst v f r d rf r tror jag att det r F r det var faktiskt s s llsynt s att jag har inte h rt n gon efter t som har f rst tt det d r med skuvagat Kan det vara hans eget ord d eller Nej Det tror jag inte Jag tror att det var ett allm nt anv nt ord F r att jag hade mycket sv rt att f rst de d r gamla karlama de som var en sextio r ldre s h r n r jag var pojk Det var mycket sv rt att f rst dem Varifr n kom han 8 N han var pellingsman Han talade kta pellingedialekt s ger egentligen 3t gt 151 b Right click on a selected session node to open the pull down menu If the annotation of that session was done in ELAN e if there is an eaf file the pull down menu will display the options ELAN and Media Synchronized Viewers Select one of these options to view the media file together with its annotations E IMDI BCBrowser for MPI on M03402985 File Options Search Help fb TreessM 1 lol x Fie Search View Player Setecin Opis Help Bl x Wn S Anims 4 Metadata Q transcription paf e m eesSM_1 mpq m TleesSM_1 wav T TieesSM_1 eaf Arnon Info e gt Squrce LBHGYD15Jun01 2 DY Rdferences Kwande Metadata Metadata Descriptions Tree Xml MyCorpus imdi ELAN 9 8 mycorpus E e Descriptive AE 00 00 02 095
185. re available E Preferences click here to select a xi policy e updated automatically tg Prefixes o Resources c Update Policy Dd not Update Cache automatically Check for an update once during use of the program Check for an update every time a Resource is requested Check for an update once every Day Week Month Year Check for any update when Starting the program Update Now click here to The schema cache co updated once every se update NOW hema configuration files will be Font This option allows you to display special character sets and fonts To select a set and a font do the following l Under Unicode Block in the left panel select a unicode block by clicking on it The available fonts are then displayed under Font in the right panel 2 Under Font select a font by clicking on it oF Click on OK to implement the new font ES IMDI BCBrowser for MPI on M03402985 loj x ile Options Search Help IMDI corpora global IMDI corpora HTTP global Bookmarks Description ES Worldmap HTTP global No geseripton 0 description Worldmap global 24 MPI corpora global Info Content MPI Corpora HTTP global ff Da pitapak bleu ppg hngun l sjeg m iahh spn Hp bs sera abel ah peeps bm qe id PAG lares hri ra lap lepi s papae gem eerie bre dun loin rm sa dbp ben or
186. re this field It has been disabled for Actor Language and is only available for Content Language see section 2 3 3 1 e Descriptions A description that gives background information about the language in general Note that the description is independent of the actor s familiarity with it reserve such information for the Languages Descriptions sub screen below See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation e Resource References Please ignore this sub screen It has been disabled for Actor Language and is only available for Content Language see section 2 3 3 1 Languages Descriptions A description of the set of languages that the participant is familiar with Note that the description does not contain background information about the language in general reserve this for the Language Descriptions schema above See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 88 2 4 1 6 Keys This sub screen contains keywords that are relevant to the individual actor independent of his her Role in the session Bj IMDI Metadata Editor PM x Fi
187. reating browsable trees based on IMDI metadata descriptions beta version This manual helps you to understand the structure of the IMDI Editor It consists of the following parts Navigating through the IMDI Editor section 1 describing the structure of the IMDI Editor The IMDI Session Editor section 2 illustrating the metadata categories used for describing sessions or resource bundles i e audio video and written resources The IMDI Lexicon Editor section 3 illustrating the metadata categories used for describing lexicon resource bundles The IMDI Corpus Editor section 4 giving step by step instructions on how to manage a corpus with the help of the IMDI Editor Appendices Note The IMDI Editor has undergone a number of changes since it was first created both in terms of its user interface and in terms of the metadata categories supported If you have metadata files created with previous versions of the Editor please consult the following sections section 1 6 for a summary of changes to the user interface introduced in version 2 0 and section 2 7 for a summary of changes to the metadata categories introduced in version 3 0 16 Note This manual contains a number of references to corpora housed at the Max Planck Institute MPI for Psycholinguistics Nijmegen It uses these corpora as a means to illustrate the functionality of the IMDI Editor Please note that any remarks only pertain to these corpora
188. red into the metadata categories categories that are displayed in the form of screens schemata and fields in the Content panel of the IMDI Editor see also section 1 5 e g E IMDI Metadata Editor loj x File View Options Help dtor 9 HTML Links e gt Session Name TreesSM reir u fields Screens displayed Session Title The uses oftrees in the form of Recording Date 2001 06 25 nodes in the Descriptions ec 99 Browse panel Gooma anguage Y English schemata Text Description ofthe uses oftrees We looked attrees in four different sitks all in the vicinity of Kwande beyond the football D orrespond to these four sites The 2 evening Because ofthe twilight sub screens ttoo clear 9 Local Repository Projects 2 Ei Content s E Actors c Languages amp Access Local Old IMDI Version Throughout the manual these two terms are used interchangeably with session being the preferred term for audio video resources and resource bundle for written resources 59 The following sections illustrate the purpose of the various screens schemata and fields It is organized on a screen by screen basis following the structure as it is displayed in the Browse panel Session see section 2 1 Project see section 2 2
189. rsions and canonical vocabularies are visualized in the Vocabs Overview window as follows see section 9 1 name of xi Name Tag Link To Cannonical CY Description Language underlying arm z category rr gt http p mpi nlllIAPGl SchemarCorntwqents A list of linguistic continents xml ec 99 Continents o Coftinents dut Continents dut hipaa mpi nviMDbuSchema Continents X list of linguistic continents in SO639 dut Nutch A Continents frn Cqntinents trn underlying category in different languages Dutch csses French German c Sorted on column Name ontinents ger Zontinents ger name of the file that contains the underlying category Continents xml Specify the URL of the controlled vocabulary file Note It is recommended to store all such files on the Web The controlled vocabulary file is an XML file and the file name is identical to the name entered into the category Name minus the language abbreviation see above amp 187 of 196 Intera D2 2 1 4 Description The Description is optional It contains a prose description of the category 9 3 The values The IMDI CV Editor asks you to specify the following kind of information about the values To add modify or delete a value to or from the list make use of the buttons Add Edit and Remove e To add a value click on Add The Controlled Vocabulary Entry window
190. rtillas e retellings e g the speaker is asked to re tell a story from a picture book or to describe a task in his her own words e promptings e g children answering a teacher s questions the structure and content of the speech is planned in advance by the consultant performer e g e political or ritual speech e poem recitation Note This entry does not necessarily refer to an elicitation session where a consultant is given a framework but does not plan his her answer The involvement of the researcher 1 e value elicited non elicited 66 99 no observer e Social Context comments the investigator asks the speaker s to produce isolated phonemes words utterances or grammatical structures e g e production of sounds in different phonological environments e responses to morphological lexical etc questionnaires the investigator does not interfere verbally with the speech event other than with his presence no outside observer is present only a tape recorder The social context of the event 1 e value family private public controlled environment comments 71 e Event Structure The event structure 1 e value comments monologue dialogue conversation not natural format e Channel The channel of communication 1 e value comments face to face experimental setting broadcasting telephone hum
191. s a Chadic language West Chadic A ngas Goemai Southern Branch that is spoken by 150 000 to 200 000 speakers in Plateau State Central Nigeria Link language pdf 3 Remove 73 2 3 3 1 Language Each language that is used in the session has to be listed on the Language screen To add a language see also section 1 3 do the following Bj IMDI Metadata Editor el xd File View Options Help A ag 8 4 E HTML Links 2 Actor 2 Summary Actor Information RoleUnspecified Named Full Nameg Code Social Family Role Ethnic GroupGoemal oe Age24 Soale Educationsecondary scho Anonymizedfalse Actor Information Contact Information Resource References Descriptions Languages Keys d L ID RFC17583c sil ank RFC1766 x sil hua RFC17663 sil eng Edit Remove languages click on Add to add another language to edit or remove a language click on the language to highlight it then click on Edit or Remove 74 The Add and Edit buttons give you access to the Language screen that contains the following kind of information IMDI Metadata Editor loj x File View Options Help 8 a8 6 gt Ddo ES mle Links eS Name GOEMAI ID RFC1766 x sil ANK Mother Tongue Primary Language Domina
192. s stored on the Web The Vocab add window appears enter the URL of the file click on Apply then click on OK f Vocab add URL http Awra rnpi nlMOlSchema Content Genre Interactional xml ow j cmm v Click on a controlled vocabulary in the Vocabs overview window to select it It will be highlighted in blue color Then click on Remove to remove this vocabulary from the window It will be removed without further warning Click on Read all to view all available controlled vocabularies that are stored in the cache 1 e all links that are stored in the subfolders contained under user directory IMDI TOOLS CVCACHE Click on a controlled vocabulary in the Vocabs overview window to select it It will be highlighted in blue color Then click on Refresh selected to update the selected vocabulary The IMDI tools will check the corresponding URL for newer versions of this vocabulary Click on Refresh all to update all vocabularies The IMDI tools will check the corresponding URLs for newer versions of the vocabularies 191 of 196 Intera D2 2 1 Show entries Click on a controlled vocabulary in the Vocabs overview window to select it It will be highlighted in blue color Then click on Show entries to view its content e g 1 click on a vocabulary to X Ef Vocabs overvies Description Name Content Genre Interact select 1t bma Content Genre Interactionalxml A list of values forthe Genre Interaction
193. screenshot of the way browsing is made possible via normal web browsers XML files are transformed on the fly to HTML pages Unstructured Search The same environment is used to test out an alternative search option We also know that many users find it too difficult to formulate structured search queries but want to have an unstructured search option that has the functionality of the Google search function Although theoretically it does not sound convincing to execute unstructured search on structured data it offers some advantages Even people who don t know the semantics of the elements and those who don t understand how certain resources are characterized could execute searches on typical values and get hits Programs are used to generate an index file covering all metadata elements including descriptions and other unconstrained text fields Figure 8 indicates this option which also can be tested out by going to the web site http www mpi nl corpus 44 Browsable Corpus Microsoft Internet Explorer ET iBl xl File Edit View Favorites Tools Help Bak gt amp At Qsearch Favorites media C4 EY 8 9 Address http corpust mpi nl BC IMDI corpora MPI9e20corpora Demo PeWi s20corpus Keve route media Go Links This recording was made to generate a freely available test resource including speech and gestures The annotations were created by Peter and Kita who is gesture researcher
194. se click Cancel 8 africa N australia D eastasia e southeastasiz gt ahaj MetaData Search 4 Lao Create MD Search DB 4 e southasia Shows northamerica mesoamerica southamerica List Sessions oceania Add to Bookmarks Sign Languag Gesture Proje Multilingualism Add to basket Demn IMDI BCEditor E s Session Count Remove from basket RootURL lejisun03 Corpors Clone Node Look in 7108 e 8 x E E Progress x Show File Content aT i Basket functi MEEF ENEE asket functions Show URL ienr e 3 Creating IMDI DB List Save Add Remove Clea andata A Found IMDI Session file 1 Fite Wane ooutneastasia data Files of Types HOLDO eie Tiina 5 After the data file has been created the corpus node changes its color from blue to black and you can now search this corpus see section 7 1 1 Note These data files are stored permanently in your local directory user directory IMDI TOOLS DB Specifying the search parameters 161 The search parameters are specified in the topmost part of the IMDI Metadata Search window e g add a que the category to be searched session Name M IMD Metadata S IMD Metadata Search e E enter the ifi search item The following search options are available 1 Selecting the category to be searched section 7 1 3 2 Entering the search item
195. se uea PRR can be dedned in the MDI file itself or in an extemal configuration file that should be situated in ive sama folder as e IMEI file iF a Lisa these pretfboes only when none were defined in the original IILI fie re Meng use the prefoces defined in this panel B test configuration file CB TAT Fil EE Une Pratos defined in tha PADI file ited SEES 5 MR RU c M lt xml uersion 1 0 encoding I 0 8859 1 DOCTVPE METATRANSCRIPT X ENTITV annotationunitPrefix Annotation filesX BN lt fENTITY corpusPrefix Metadata filesX 5B lt tENTITY infolinkPrefix Info filesX gt i Mn damage sessions Tile enter name and lt fENTITY globalPrefix D WINNT Personal IMDI gt N directory of the lt ftENTITY anonymousPrefix Anonymous_files gt lt tENTITY mediafilePrefix Media filesX 5B configuration k ENTITV sessionPrefix Metadata filesXV N N file Note All preferences that have been set in the Options Preferences menu are automatically used for each new IMDI file unless you change them manually Whenever you create a new IMDI file please make sure that the preferences are set correctly so that you do not accidentally insert wrong prefixes There is a tool tip available that reminds you of the defined prefixes To access it point with the mouse into the Link field A blue box appears informing you about the prefixes e g poi
196. see section 7 All resources or data files are accompanied by metadata files 1 e files that give information about the data These metadata files are searchable For example as illustrated below you could search for all texts by female speakers above 60 years of age The IMDI Browser displays the search results and allows you direct access to the corresponding data files IMDI Metadata Search LL OL x Session Participant E Age gt Bo amp amp true Session 4 Participant Md X Sex female 5 2 Sessions or resource bundles The lowest level of the corpus hierarchy contains the session or resource bundle These two terms are used interchangeably session being the preferred term for spoken corpora and resource bundle for written corpora A session or resource bundle corresponds to a meaningful unit of analysis usually to a piece of data having the same overall content the same set of participants and the same location and time e g one elicitation session on topic X or one folktale or one matching game or one conversation between several speakers A session or resource bundle contains different files The following four types of files exist 1 Metadata files see section 6 3 4 1 These files contain information about the session e g its date and location its content and its participants They are of the IMDI Editor format 2 Media files see section 6 3 4 2 Th
197. ser E IMDI BCBrowser for MPI on M03402985 BD x File Options Search Help Bookmarks Description Worldmap HTTP global Worldmap global IMDI corpora global IMDI corpora HTTP global MPI rarnnra nlinhal E Preferences eI Update Policy gt Fr wh qnedupu Gh kykna pingi hrg nj mimg pup ales meng fl eo deron slbi heels fh din hovel Latin Extended B Cyrillic Armenian unicode block Hebrew hb thang REE and font here TTL A IPA Extensions gpl Wein ni iby Peel be pepe gine paursphgl manira 2 os a I uu illi lia any lig ch Entrees aa anula Vigo Ala bra Greek I Font select the peti ma dab ugue pans Vbrupanzgua oer pps n gla sapra penpals pu su lien pea fra phan ub riagnjba rmienian eample fram Tha World s Writing Example Na and Bright pp 361 Rem tene verba sequnt the Browser displays the chosen click on OK Lo T unicode block and font 137 Configuration Info Check CV s Show Hide bookmarks 6 1 3 Search menu The Search menu supports the following option Metadata Search 6 1 4 Help menu The Help menu supports the following option About Click on Configuration Info to view information on the configuration of the tool you might need this information when reporting a bug to the developers e g E Info xj Ji Configuration Information Name Value Os WINDOWS CorpusRoot
198. sket functions Info Content Description he corpora ofthe Language Acquisition Group at the MPI for Psycholinquistics Nijmegen NL include first L1 as well as second language L2 acquisition data from children and adults Language Acquisition Acquidata Corpus The corpora of the Language Acquisition Group at the NIPI for Psycholinguistics Nijmegen NL include first L1 as well as second language L2 acquisition data from children and adults Content 2 First L1 amp second language L2 acquisition data Longitudinal amp snapshot data Children amp adults data Control data of native speakers of different languages spontaneous and elicited data Partly transcribed and annotated data Additional info files e g project or language description Data of following languages are available Czech L1 amp L2 Chinese L1 Dutch L2 English L1 amp L2 French L2 German L1 amp L2 Figure 5 shows a screenshot of the new browser version that can process new as well as old IMDI files It allows users to flexibly browse in the linked IMDI domain of XML files Structured Search Component The structured search component is an integrated part of the browser but could also be used in different frameworks It offers to combine patterns with different elements that are part of the structured IMDI set and it also supports profiles Its broad
199. ssion Content v i MP MN gm Ne metadata category MPI Languages Id il Language NHindi 180 of 196 Intera D2 2 1 Controlled vocabularies thereby a facilitate the entering of metadata information as the values need not be typed but are selected from a pull down menu and b ensure consistency as the values are selected from a predefined list 9 Creating controlled vocabularies with the IMDI CV Editor Starting the IMDI CV Editor opens up the Controlled Vocabulary Editor window This window is structured as follows oR Controlled Vocabulary Editor iml x File menu Khe information about the Name metadata category Click on a column header to sort The File menu of the Controlled Vocabulary Editor window allows you to create open save and print controlled vocabularies file see section 9 1 Each file contains information about the metadata category see section 9 2 and its values see section 9 3 9 1 Managing controlled vocabularies the File menu The File menu is used for managing the controlled vocabularies files It contains the following menu items Controlled ene on je m x Ctrl H Ctrl Ci Show Available CV s Save To Local Dir Ctrl save To Cache Save As 181 of 196 Intera D2 2 1 To access any menu item do one of the following g Click on that item
200. ssion screen will be displayed in the IMDI Browser in the following way see the IMDI Browser manual for details E IMDI BCBrowser for MPI on M03402985 File Options Search Help Bookmarks global global global different sites all in the vicinity of global A kwande beyond the football field global The four media files correspond to MPI Corpora HTTP global A these four sites The recording Metadata Descriptions Tree Acquisition bi Language amp Cognition 8 africa 8 Goemai 9 8 Hellwig i Project pdf i language pdf Publications pdf 9 Natural Session information A Name TreesSM Title The uses oftrees J Date 2001 06 25 ion Name TreesSM Session Title The uses oftrees Recording Date 2001 06 25 Descriptions i mE Descriptions i oemai Basket functions 72 Langga ah z CV English Text Description ofthe uses oftrees We looked attrees in information under Location and Keys is displayed under the nodes Location and Keys Link EN Remove 2 1 1 Session The name title and recording date of the session For example IMDI Metadata Editor File View Options Help B e o P5 Editor HTML 3 Links gt Session Session Name Treessm Session Title The uses oftrees Recording D
201. stored under the name user name ano in the folder user directory IMDI TOOLS Anonyms on your desktop computer 1 e it is only available to you However you might want to store a copy of it in the corpus itself e g to protect it against data loss in case of a system error In this case store the file in your corpus or ask your corpus manager to do it for you specify the name and directory of the file and the access rights to it See Appendix 1 Recurring schemata Access for instructions on how to fill in an Access schema 92 2 5 2 Media Files Each digitized media file e g audio video or image file associated with the session has to be listed on the Media Files screen To add a media file see also section 1 3 do the following E IMDI Metadata Editor File View Options Help itor HTML Links E asma unknown unknown unknown unknown the list of media files click on Add to add another file to edit or remove a file click on the file to highlight it then click on Edit or Remove The Add and Edit buttons give you access to the Media File screen that contains the following kind of information BS IMDI Metadata Editor File View Options Help section tor um 2 5 2 Kem i ReemCelD N10198 TreesSM 1 wav
202. t The IMDI Corpus Editor creates an IMDI Corpus file imdi The following steps are necessary to create structure and manage a corpus 1 Specify the directory information of all info media written resource and lexicon files that belong to a session and of all IMDI files that belong to a corpus see section 4 1 2 Select the sessions that belong to a subcorpus Select the subcorpora that belong to a corpus see section 4 2 3 Create a corpus node see section 4 3 These steps are explained in the following three sections 4 1 Specify the directory information of files The IMDI Editor creates links to different types of files These links are entered into fields that are labeled Link see also section 1 5 3 e In the IMDI Session Editor Link fields exist for info files see Appendix 1 Recurring Schemata Descriptions media files see section 2 5 2 and written resource files see section 2 5 3 e In the IMDI Lexicon Editor Link fields additionally exist for lexicon files see section 3 1 e Inthe IMDI Corpus Editor Link fields exist for IMDI Session IMDI Lexicon and IMDI Corpus files 1 e files created with the IMDI Session Lexicon or Corpus Editor see section 4 2 In order to link files you need to provide information about a their names and b their directory location Usually if your corpus is managed by a corpus manager you only need to enter the file name In this case type the file n
203. t Descriptions RFC17B5B5 sil er T Contact Name Birgit Hellwig Language RFC1766 x sil eng lel Address Postbus 310 6500 AH Nijmegen The Netherlands Text he media file is not available to the public E mail Birgit Hellwig mopi nl e Organisation Max Planck Institut f r Psycholinguistik description of the access restrictions information about whom to contact in order to 4 Clear Access gain access 2 Clear Access i Open Policy Save Policy Cancel Cancel Information General information about access rights to the resource Availability Information about the availability of the resource This field is not standardized yet for the moment please enter a prose text e g not available available in 5 years available to person XY Date Date at which the access rights were set Please enter the date in the following format YYY Y MM DD e g 2000 12 30 Owner Name of the person institution that owns the resource Publisher Name of the publisher who is responsible for the distribution of the resource 125 Contact Information about whom to contact in order to gain access to the resource Name Address E mail address and Organisation Descriptions Prose description of the access restrictions 126 Appendix B IMDI Browser version 2 0 Manual This manual was last updated 30 Jun 2004 The manual can be download
204. t are displayed in the form of pull down menus some of them only accept values from the pull down menu while others allow you to enter alternative values see below Furthermore it is sometimes possible to enter more than one value in this case please separate the values with a comma See section 1 5 2 for the different options and how they are symbolized in the IMDI Editor Genre and Subgenre Information about the genre and the subgenre whereby Subgenre is dependent on Genre I e the value that you enter under Genre will determine the values that are available under Subgenre Both Genre and Subgenre are open vocabularies i e Please see the document IMDI Metadata Elements for Session Descriptions under htt WWW mM l nl IM DI for updated lists of values and their definitions 68 you do not have to select a value from the pull down menu but can enter an alternative value instead The following values have been suggested Genre Examples Subgenre Discourse Narrative a folktale a historical narrative a personal experience narrative Oratory a summing up speech by a legal counsel a political speech a church sermon Procedural a recipe an instruction on how to build a house Formulaic proverbs greetings leavetakings Language Play riddles humor Interview Conversation Description a description of the layout of a compound Uninte
205. t is necessary to quickly A FR remind you about this four different sites all in the vicinity of Kwande beyond SCSSIOD However it should the football field The four media files correspond to not contain an elaborate these four sites The recording took place in early Z evening Because ofthe twilight the video recordings description of the content are nottoo clear reserve this information for the Content screen see section 2 3 2 E g 62 See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 2 1 3 Location The location at which the data was collected For example Descriptions Location Keys continent and country Location where the data was Continent Africa collected Country Nigeria address where the data Address Was collected C B at school XY Regions Region Qu an Pan Local Government Area kwande beyond the football field possibility to specify further regions amp Clear Session Continent The continent where the data was collected Choose the continent from the pull down menu or type it in It is a closed vocabulary 1 e only values listed in the pull down menu are accepted by the
206. ta Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 95 2 5 3 Written Resources Each written resource associated with the session has to be listed on the Written Resources screen To add a written resource see also section 1 3 do the following E IMDI Metadata Editor File View Options Help B ss e ES Editor HTML 35 Links E EE i Resources Anonymous Media Files writen Resources Se Type Format Reshurce Link Annotation EAF TreesSM 1 eaf E click on Add to add another Written resource the list of written resources to edit or remove a written resource click on the written resource to highlight it then click on Edit or Remove 96 The Add and Edit buttons give you access to the Written Resource screen that contains the following kind of information Ej IMDI Metadata Editor loj x File View Options Help B am 9 section 4 D Edtor B ur B Li 2 5 3 l A Written Resource Resource ID N101DD Date 2001 06 26 section Resource Link section 2 5 3 2 Pdia Resource 2 5 3 4 section section pudicus ciocrob lacs Jiteral transl 2 5
207. ta Editor Ie x fie View Options Help E an 9 IC cuo f Hmc us 5x Language Name looemal evi ID RFCT TG siLank i Bj IMDI Metadata Editor Mother Fae eta ee File View Options Help i ES fie View Options Help ls am ol Dominant La PEE Eator HTM IE Links eo caror mac 33 unie es EZ Source Lar i Actor Language ummary Actor Information iE Name Hausa fev RoleConsultant NameJan R A Coded Social Family Role Age24 SexMale Anonymizedfafse co 0 Language sa Edo inizi screens om e 9 Wm GOEMAI RFC17663csil dhnk P umen 7 uris c HAUSA RFC17683csil hfTa general i p ENGLISH RFC17662 sil ang information about each language Name ENGLISH Ip RFC T6632 si ENG Tongue laise dE SS Primary Language false Dominant Lannua BUIEL lnquae ETETETIEESS D rsicripitiars neo meme 5 tens Fe IMDI Metadata Editor File View Options Help ummary Actor Information Text FEruglish is tue naliona Erud of Mig Officially it is he larsgurangie usid ir adminishaian arid higher education especially Migarians ling iri mural areas are not millar with English description of the actor s use m Of and familiarity with the a languages am Aged Languages D ngl
208. ta file in the IMDI Editor see section 6 3 4 1 List Sessions Lists all sessions contained under this node in the Info Content panel see section 6 4 Media Synchronized Viewers Opens the session annotation in a separate Viewer synchronized with the media file see section 6 3 4 2 MetaData Search Opens the IMDI Metadata Search window see section 7 Open with Acrobat Reader 5 0 Opens the file with Acrobat Reader 5 0 see section 6 3 4 3 142 Open with Winamp Remove Remove from basket Save File Content Session Count Show Show As HTML Show Content Type Show Description Show File Content Show format Show Info Show LR s Show Services Show URL Windows Media Player Opens the file with Winamp see section 6 3 4 2 Removes the node from the Metadata Descriptions Tree panel Removes the node from the list of nodes to be searched see section 6 3 3 Downloads the file in a non compressed format Displays the number of sessions contained under this node in the Info Content panel see section 6 4 Displays the file content in the Info Content panel see sections 6 3 4 2 and 6 4 Displays the file in an HTML format in the Info Content panel see sections 6 3 4 1 and 6 4 Displays the file format in the Description panel see section 6 5 Displays the file description in the Description panel see section 6 5 Displays the file in an XML format in the Info Content p
209. the bottom of the IMDI Metadata Search window to start the search During the search the number of hits matches to your search criterion is shown as well as the progress towards completion of the search Once the search process has started you can use the Stop button to stop the search 165 Note Because of the large amount of data that 1s searched it may take some time before the search 1s completed 7 4 Displaying the search results The box Query specification and results displays the results i e the name and location of each session You can access any search result by double clicking on it This will open up an IMDI Browser window containing the selected session its metadata descriptions and its media and annotation files 7 5 Saving the search results You can save the search results in form of a corpus node This corpus node is displayed in the Metadata Descriptions Tree panel of the IMDI Browser window Do the following 1 In the IMDI Metadata Search window enter a name into the text box following the button Save results in 2 Click the button Save results in The results are saved under the specified name You can access the results through the IMDI Browser interface Do the following 1 In the IMDI Browser window go to the Bookmarks panel 6 2 2 Double click on Search Results The Metadata Descriptions Tree panel see section 6 3 displays the new corpus node below the node labeled Results i I
210. the code To create such a file do the following 1 Click on Options 2 Click on Anonyms 30 The Anonyms overview dialog window appears It allows you to create modify and remove mappings and it displays all available mappings e g Ef Anonyms overview ES Institute Project User Code Name MPI Goemai birghe Code X Birgit Hellwig MPI Goemai birghe Code_A Li MPI Goemai birghe Code_J S Anonyms overview dialog window Click on a column header to sort The following options are available You can sort the available mappings Click on any of the column headers to sort the corresponding column alphabetically To add a mapping click on Add The Anonyms input dialog window appears Please fill in the institute e g MPI the project e g Goemai the code of the actor and his her full name Ef Anenyms input To save the mapping click first on Apply and then on Close Project The new mapping will be displayed in the Anonyms overview dialog window Code Note The code that you enter into the Anonyms input dialog window should be different from the code that you enter in the field Code on the Actor screen see section 2 4 1 If the codes are identical the IMDI Editor will also display the full name in the field Code on the Actor screen not the intended code Note The column User in the Anonyms overview dialog win
211. the following 1 Click on the node that you want to export It will be highlighted in blue color 2 Make use of the Export options a Either click on the Export button E Collectors C g E Content Ef Participants e Languages E Access Projects J New Folder Export button b Or right click on the highlighted node and then click on the Export item of the pull down menu e g J New Group Export item APH ATE oun C Export The Export Repository dialog window appears 3 Give the name and location of the file that should contain the exported node Then click on OK or press the key ENTER 2 Browse to the location 46 The file 1s saved in the specified folder Furthermore the exported repository is displayed as an additional node in the Repository panel e g 1 Exploring D WINNT Personal test File Edit View Tools Help E alle m E E Local Repository The files are saved in the specified folder T d Al Folders 0000000000 Folders Contents of E WINNT Personalitest E l aM HL Software Q i Participants 2 EL sound 3 pem ns Folder a test MyParticipants File Folder 0 P e 4 MyLanguages n C MyParticipant amp E Languages E Access The exported file Projects T BLE is added to the
212. the following 1 In the Metadata Descriptions Tree panel see section 6 3 click on the node to select it It will be highlighted in blue color 2 Right click on the selected node to open a pull down menu Select Remove from the pull down menu The node will be removed Note After selecting Remove the icon of the removed node remains visible although it cannot be accessed anymore The icon will be removed after exiting the node Results 7 6 Printing You can print the search parameters see section 0 together with the search results see section 7 4 To do so click on the Print icon e g LXITCITO 9 uimiag S Session Participant w T FulNam J Query specification andre nuewe imdi sibus www mpi nl world corpora m s ww npi nl world corpora rm iptive Metadata T search parameters and intiva Ne Cuda ai nimal Miydi results to be printed pee Rue c NENNEN Search Service httpwww rnpt nthwordicolura 2 E Close Search Saveresultsim results 7 7 Closing the IMDI Metadata Search window Click on File menu then click on Close to exit the window 167 Appendix C Access Rights Management for Language Resource Archives at the MPI Manual This manual was last updated March 23 2004 Introduction The Access Rights Management web pages are developed at the Max Planck Institute for Psycholinguistics Nijmegen The Netherlands
213. tion informing you of the successful conversion 4 D Editor HTML Links Content Summary Content Information Genre Discourse Modalitiesspeech pointin Channelface to face Task Subgenre Description p Event Structuremonologue 2 Social Contextpriyate Subjectplants IMDI 1 SlInteractional IMDI 1 8 Discursive IMDI 1 S Interactional goan the old Genr additionally displayed in the form keywords UE e is 2 7 3 Conversion of old repositories to the 3 0 standard When you start the IMDI Editor version 3 0 for the first time 1t will automatically convert all old repositories to the new IMDI 3 0 standard see section 1 4 for details on repositories The converted repository is not added to the node Local Repository but to the node Local Old IMDI Version The old repositories are not deleted if you need to revert back to an older version of the IMDI Editor the unconverted repositories will be used again E IMDI Metadata Editor 1 ini xi 5 a8 0 e Standard Session Resource 1 Project Content Actors Resources References 9 Local Repository 6J Projects Content Gl Actors Languages Content 9 5 Actors 9 E Languages Goemai EJ Access Projects ile View Options Help HTML IS Links Session S
214. tions Help inl xi l B ug e 2 HTML E Links Be Content Summary Content Information Channelface to face Subjectplants Genre Discourse SubgenreDescription Task Modalitiesspeech pointin Social Contextprivate Event Structure monologue hh nt sM p liam ID E IMDI Metadata Editor File View Options Help 4 gt D Editor HTML 3 Links mu IMDI Metadata Editor Mel xl le View Options Help A ag o l 9 HTML 3 Links Bm Language Mother Tory Primary Language La n g ua g e screen Dominant Langua Content Summary Content Information Genre Discourse Subgenre Description Task Modalitiesspeech pointin Soojal Contextprivate Event Structure monologue Channelface to face Subjectplants Source Language Languages Languages Languages Descriptions Target Language Name Goemal v ID IRFC1786 csik ank amp information about the language see section 2 3 3 1 Descriptions Languages Descrip Text Goemai is the only language that is used throughout his session Both during the recording itself and in he preceding non recorded instructions given by me nglish description of the how the language s are used in the session see section 2 3 3 2 wO S0639 en Language ISO639 eng iy cv g Text Goemaii
215. tm and the Ethnologue identifiers under http www ethnologue com web asp 2 0 3 9 Validation Gives information about the validation state of the written resource e Type The type of validation It is a closed vocabulary 1 e you have to choose a value from the pull down menu 1 e value comments formal content e Methodology The methodology of validation It is a closed vocabulary 1 e you have to choose a value from the pull down menu 1 e value comments hand automatic semi automatic e Level Gives an estimation of how much of the resource was validated enter a value between 0 and 100 e Description A description of the validation See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in a Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 2 5 3 4 Access Information about the access rights to the file containing the written resource See Appendix 1 Recurring schemata Access for instructions on how to fill in an Access schema 2 9 9 Keys Keywords that are relevant to the written resource See Appendix 1 Recurring Schemata Keys for instructions on how to fill in a Keys schema 2 5 3 6 Descriptions A description of the written resource See Appendix 1 Recurring Schemata Descriptions for instructions on how to fill in
216. tor A more comprehensive manual of the IMDI Metadata Editor can be downloaded from www mpi nl IMDI tools 10 1 Viewing the available controlled vocabularies The IMDI Metadata Editor can display a list of all available controlled vocabularies To view this list click on the menu item View and then on Vocabularies The Vocabs overview window opens and displays information on the Name the Tag the Link To Canonical CV the Description and the Language of a controlled vocabulary see section 9 2 for details Furthermore it informs you whether the file is stored locally an X symbol is entered into the column Local Only or on the Web the column Local Only remains blank For example Es Vocabs overview x Used vocabularies until now Name Tag Link To Cannonical CV Descriptio Language Local Only ignS httpJwww let kun nl sign Iang lMDl vocabs SignSystemsHow often does the actor use DS839 eng X SignSystemsUse Signs SignSystemser upportedSpeech AedSpeech Use SignSupportedSpeech xml sign supported speech in everyday communication http veww mpi nlilldDl SchemaiContent Genre DiscursivA list of values for the exml genre discursive aspect D b http www mpi nl l dDI Schema Content Genre DiscursivA list of values for 1 e xml genre discursive a information about A list of li isti s ema available controlled Dummw file used fi vocabularies key CGN locName
217. ture is optional if you don t want your users to accept any agreement select none Click the AddGroup button to create the group Click the Reset button to clear all fields 3 2Remove a group From the main index window click the remove a group link In the opened page all groups are listed with the creator If you want to list only the groups that you have created check the creator s only box You can select how you want the list to be ordered by group name creator first name or creator last name from the pull down menu Next click the Del button in front of the group you want to remove You can only remove groups that you have created 175 3 3 Modify groups From the main index window click the modify groups link The opened page allows you to add and remove members from a group You can select more members at a time to add or to remove Only creators are allowed to modify their groups To add or remove users do the following E Browsable Corpus Access Rights Management Microsoft Internet Explorer provided by MPI Nijmegen f o xj File Edit View Favorites Tools Help Back gt gt O A GQ Qusearch Favorites eda Ay GS EO Select the group you want to Jiesheng modify from the pull down menu Listed are all groups and their Access Rights Management creators in brackets Modify a group i move members from a group You can sele
218. unction The IMDI Browser allows you to search through the metadata files By default this search 1s done throughout an entire corpus see section 7 but it 1s possible to limit it to one or several selected corpus and or session nodes 1 e to limit it to nodes that have been put in the basket To put a corpus or session node into the basket do the following 1 In the Metadata Descriptions Tree panel double click on the node to open it 2 Click on the open node to select it It will be highlighted in blue color 3 Doone of the following a Click the Add button at the bottom of the Metadata Descriptions Tree panel b Or click with the right mouse button on the highlighted node then select Add to basket from the pull down menu 145 The icon of any selected node will change its color to gray e g non selected node selected node Once an item is selected the List button at the bottom of the panel will be highlighted in red color Note The red color was chosen to alert you to the fact that a list has been compiled The existence of such a list has consequences for the search process metadata search will only search the listed nodes 1 e the nodes that were put into the basket see section 7 1 If the search process does not yield the expected results please make sure that the list contains all relevant nodes And if you want to search through an entire corpus not just through the listed nodes you h
219. until the IMDI Browser responds to your command and opens a node Click on any closed node and then point with the mouse to it to activate the tool tip that displays brief information about the node Click on any open node to select it It will be highlighted in blue color The information relevant to the selected node is displayed in the Info Content see section 6 4 and Description panels see section 6 5 Right click on any selected 1 e highlighted node to open a pull down menu that displays the available options Click on any item in the pull down menu to select the corresponding option Depending on the kind of node the following options are available Add to basket Adds the node to the list of nodes to be searched see section 6 3 3 Add to Bookmarks Adds the node to the Bookmarks panel see section 6 2 Bookmark Info Displays information about the bookmark in the Info Content panel see section 6 4 Clone Node Opens a second IMDI Browser window that displays only the corresponding node Create Service Files Creates a file that is needed for searching through the metadata see section 7 1 2 Delete Bookmark Deletes the node from the Bookmarks panel see section 6 2 Delete Local Service Files Deletes the locally stored file that is needed for searching through the metadata see section 7 12 ELAN Opens the media file or the session annotation in ELAN see section 6 3 4 2 IMDI BCEditor Opens the metada
220. urce archives act as data providers and create DC like descriptions according to the Open Archives Initiative model IMDI does not only tackle the aspect of resource discovery via searching but also wants to support browsing and resource management Further the IMDI set was designed bottom up to meet the needs of the language resource community in more detailed types of descriptions It should be mentioned here that this work is carried out in relation to WP3 and the newly formed ISO TC37 SC4 committee All categories from IMDI were entered into the Data Category Registry It is the intention to also enter all OLAC categories and to make the relations that are now included in wrappers explicit and available via open repositories Yet there is no proposal for open relation registries that could be used by search engines Access Rights Management Another very important framework that has been developed recently and that is based on the IMDI infrastructure is a complete framework to define users and groups and associate access rights with the groups The framework also allows managers to define new managers to delegate the right to define rights since otherwise the access administration is not tractable Also here for reasons of efficiency it is possible for the manager to select a node in the metadata domain and assign access rights to different types of resources that can be found under this node A user manual was being written that describes the ful
221. user class definer you want to authorize from the User name pull down menu and click the submit button 2 2Add a new access rights manager from the current point in tree Instead of creating a new access rights manager from a top node authorized persons can also create a new access rights manager from the current point in the tree From 173 the main index page click the add new access rights manager from current point in tree link To add a new access rights manager do the following E Rrnwsablr Corps Access Rights Management Minos Internet Esuplorer provided by MPI HEjmegen Eile Edt Wem Pavontes Tools Hap Pk OA E sen te em F O oo ao Peres T 3B hip lust mpi nd cgi bin fem ran beiren poe CLR ias In the opened page the current Access Rights Management node is indicated under Current node Select the correct sub corpus node that will be the starting point from the Node name pull down menu You can click a field in the list and scroll the list Next select the definer from the User name pull down menu and click the Submit button 2 3 Removing access rights managers To remove an access rights manager click the remove a manager link from the main index page In the opened page all managers are listed with login name creator and path to the node that is set as the starting point for the sub corpus node If you want to list only the managers that you have created ch
222. utton Links at the top of the Content panel to view all links e g IMDI Metadata Editor lekh 0 x File View Options Help CLICK here to view u a o all links KE E Editer 9 HTML A Links Source LBHGWQ15Jun01 2 P TreesSM_1 wav N10198 reesSM_1 mpg N1015 00 00 00 0 00 00 43 0 00 00 00 0 00 00 43 sources actors and languages Actor Birgit Hell reesSM_1 eaf N101DD Actor Jan Smits reesSM_1 wav N10198 00 00 and the reesSM_1 mpg N1015 00 00 00 0 00 00 43 0 D0 0 00 00 43 resources Languagecof they are reesSM_1 wav N10198 linked to sesSM 1 mpg N1045 00 00 00 0 00 00 43 0 reesSM_teaf N 101DD 00 00 00 0 00 00 43 resources file name resource identifier and time position if applicable Within this Link window you can click on any source actor language or resource to jump to the corresponding screen Note If you remove a resource the specified links are automatically removed as well 2 5 1 Anonymous If you have made the names of actors anonymous on the screen Actors see section 2 4 1 1 and if you have created a file that maps these codes onto full names by using the menu item Options Anonyms see section 1 2 3 2 you can use the screen Anonymous to specify access rights to this mapping file As illustrated in section 1 2 3 2 the mapping file 1s
223. video data link to transcription and annotation file s link to info file s providing general background information oF HI B link not available either because the file does not yet exist or because access 1s denied 147 Click on any of these icons and the IMDI Browser will display the relevant information 6 3 4 1 Metadata information Metadata information is displayed under each session node It allows you to a read about the circumstances under which the session data was collected and b search for relevant data see section 7 It contains the following kind of information e Information about the date and location Information about the project within which the data was collected e Information about the person who collected the data e Information about the content e Information about the participant s e Information about the source i e the audio video tape the digitized media file s and the annotation file s e Cross references to other relevant sessions and publications To view the metadata information of a session click on the corresponding metadata icon The information is displayed in the Info Content panel see section 6 4 or the Description panel see section 6 5 Ej IMDI BCBrowser for MPI on M03402985 Bl x File Options Search Help Metadata Descriptions Tree Description ES Description of the uses oftrees We EN A b a gt 9 E TreesSm
224. vocabulary 1s added to the Keys schema SILT BERS ARERR REELS UE T EO EO ER ROVER Een ET SPN Set oe eC POE TOTO See ee aoe hor De EO TO ee Ee Oe ee eee Pd Org PSE Pane oF a Pe See ee DE oe Pe ed PY MN RR CY QR QE IE EN QE IE I E LEER Y EEL E UY NI PERE I Y ER EI IE Y I QE EY ATER UE Y ETO I ETE I LOL EPEAT INI Y EIE RE ER Eee Editor HTML 3 195 of 196 Intera D2 2 1 196 of 196
225. w Favorites Tools Help Access Rights Management for Language Resource Archives at the MPI DOBES Corpus Dutch Spoken Corpus MPI Copora ESF Corpus These pages allow authorized persons to define access rights for language resources that are part of the archives stored at the MPI This ights managemer im is the second version This version is based on centralized mechanisms For efficiency reasons read rights can b ated with nodes in the corpus structure The specifications are extended to those video audio image and textual resources that can be found under the selected node It should be mentioned again that metadata descriptions are open It is assumed that a clear responsibility exists for certain sub corpora In DOBES for example every sub corpus resulting from a specific project team has a responsible rese for that sub corpus Central corpus management at the MPI has to take care that these top privileg this version only central corpus management is allowed teary uut wite and delete operations Current point in the tree The current point in the corpus tree RUNE is always indicated All available Eere on the main index aa oe ERE page Authorized persons ca features are ne modify gro define There is a link back to the main index page on every If you want to access one of these links you will be asked to identify your self Users can modify their password by clicking here This manual explains an
226. which the metadata information is entered ston Participants see section 1 5 2 hat E omai E Tenisu Bua db Pii Collector e Some screens contain uum sub screens that are E Ein booted fete The four m dia ties comespoed to Tarte Tour ies Thereccedng look glace in early ererig Decauee of Era ibah tha ides meardinga are netted clear amp GE Paripari displayed like filing Linguapes t E Access amp EZ Projects cards These sub screens give access to additional schemata and fields 52 Some screens have Add buttons and corresponding Remove and Edit buttons that allow you to enter multiple types of information of the same kind e g one sub screen added for each language spoken by an actor These additional sub screens are then displayed as separate sub nodes in the Browse panel see also section 1 3 e g guage Name Id ENGLISH RFC17B5 sil eng HAUSA RFC1766 x sil hua 3 sub screens GOEMAI RFC1766 sil ank displaying 3 languages To add a new sub screen click on Add IMDI Metadata Editor MEE To modify a sub screen click on it to highlight it Esie fe ae as E Treessm and then double click on it or click on Edit E Project E y Ron 4 4 To remove a sub screen click on it to highlight it Q 4 Participants Pisa and then click on Remove ENGLISH HAUSA
227. will be displayed e To modify a value click on it to select it then click on Edit to modify it The Controlled Vocabulary Entry window will be displayed e To delete a value click on it to select it then click on Remove to remove it It will be removed without further warning ini x A Controlled Vocabulary Editor Continents ger xml File Hame Continents ger Language German v Tag Kontinente Title Die Kontinente URL http va mpi nviMDbuSchema continents xml Description i of linguistic continents in German Tag Value Description Unbekannt Unknown to be used ifthe continent is unknow Unspezifiziert Unspecified to be used ifthe continent is unspecified Afrika Africa Asien Asia Europa Europe use these buttons to add Australien Australia Ozeanien Oceania modify or delete values Nordamerika North America Mittelamerika Middle America S damerika South America Sorted on column Description Note If you work with a controlled vocabulary that 1s predefined by the IMDI tools the buttons Add and Remove will be disabled This is also true for different language versions of predefined controlled vocabularies That 1s 1n such cases you are only allowed to modify existing entries 188 of 196 Intera D2 2 1 When the Controlled Vocabulary Entry window opens up you are asked to enter the following information d me Controed Vocabulary
228. xcept for the underscore and do not use blank spaces 2 Corpus Title The complete title of the corpus or subcorpus Usually it is the spelled out version of the abbreviated Name 3 Descriptions Descriptive information about the corpus or subcorpus See Appendix 1 Recurring Schemata Descriptions for instructions on how to fil in a 117 Descriptions schema Remember The field Language refers to the language in which the description is written not to the language under investigation 4 Corpus Links Specify links to a all IMDI Session files or b all IMDI Corpus files that belong to the subcorpus or corpus a Specify the IMDI Session files that belong to the subcorpus E g in the IMDI Corpus file Natural imdi 1 e Natural data specify that it contains the IMDI Session files Session1 imdi Session2 imdi and Session3 imdi 3 Exploring D WINNT Personal IMDI Metadata_files File Edit View Tools Help S Metadata filas zl rl qp 2 E cma BI Ef Natural imdi IMDI Metadata Editor v0 11 Schema vlM Mi E3 E E Xe E File View Window Help pr of DO AINN T Persona AMON Metacdata_files General Location Project Collector Content Participants References Type Elicited imdi 3KB IMD Fi Corpus Corpus Name Natural Ie E inm d Corpus Title Natural data I Has 7EB IMO Fi FEB RADI Fi Corpus Links Descriptions Keys KB IMDI
229. xicon Resource Bundle fi New Corpus ci Open Ctr O Open URI Ctrl U Close All B Save All Page Setup Print Ctrl P Exit Ctrl Q 1 2 1 1 New New allows you to create new 9 J Resource Bundles Sessions IMDI files and to assign them to a IN Standard Session Resource Bundle type and a profile see section pes I Lexicon RESturem Bart 1 2 3 1 for details L corpora profiles Choose first a type e g Resource Bundles Sessions Lexicon Resource Bundles or Corpora and then a profile e g Standard Session Resource Bundle Click on the profile then click on OK The new file will be assigned to this type and profile 1 2 1 2 New Session Click on New Session to create a new IMDI Session file It will be automatically assigned to the default profile for Resource Bundles Sessions see section 1 2 3 1 see section 2 1 2 1 3 New Lexicon Resource Bundle Click on New Lexicon Resource Bundle to create a new IMDI Lexicon file It will be automatically assigned to the default profile for Lexicon Resource Bundles see section 1 2 3 1 see section 3 24 1 2 1 4 New Corpus Click on New Corpus to create a new IMDI Corpus file It will be automatically assigned to the default profile for Corpus bundles see section 1 2 3 1 see section 4 1 2 1 5 New Catalogue Please ignore this option for the moment 1 2 1 6 O

Download Pdf Manuals

image

Related Search

Related Contents

Cobra Alpha HD User Manual  pH 300/310    Philippe Claudel, Bon anniversaire, Monsieur  Instrucciones de funcionamiento  brochure  Atmel ATSAMR21ZLL-EK (USER GUIDE)  Kensington 70W Network Card User Manual  Octobre 2011 - Info Sierre N°17    

Copyright © All rights reserved.
Failed to retrieve file