Home
User Manual - Respondent Driven Sampling
Contents
1. 14250028 114250016 14250040 Sample Size 264 Number of Coupons per Recruit 7 jojo ojojolojojo Value for Missing Data 0 14256002 14250013 14250013 14250019 14250005 14250031 14250004 114250034 14250012 14250103 14250026 14250037 FIGURE 2 3 RDSAT Spreadsheet View 18 T RDS INCORPORATED Setting Options For Analysis Before conducting an analysis check the options that will be used Choose Options from the main window The window of figure 2 4 will appeat Number of Resamples for Bootstrap Psoo Confidence Interval tail alpha hos F9 cut Outliers of Network Sizes Minimum Net Size Maximum Net Size Algorithm type Aus Data Smoothing OEnhanced Data Smoothing FIGURE 2 4 RDSAT Options Window Adjust Average Network Sizes In a chain referral sample those with more connections and larger personal network sizes tend to be over represented in the sample This can potentially bias sample estimates The phenomenon can be corrected however and the RDS analysis tool does so by default To learn more about the methods used refer to Sampling and Estimation in Hidden Populations Using Respondent Driven Sampling by Douglas Heckathorn and Mathew Salganik If you do not wish to adjust the average network sizes for this sample bias uncheck the flag Note For NHBS i
2. occurrences of a period to the missing data value integer This can be done by clicking Edit gt Replace in the Excel menu bar In the window that appears type a period in the Find what textbox and the missing data value in the Replace with textbox see Figure 1 8 Then click Replace All 11 RDS INCORPORATED Preparing data from SAS If the data to be analyzed is in a SAS data file then the following steps will transform the data from a SAS data file to a data file that can be read by RDSAT First export the SAS data file using the following code fragment The portions highlighted in bold are specific to the dataset and must be altered data one set name of your main SAS data file gt file Target Directory RDSATdata txt gt put 1 SurveyID RDS_INJ Coupon submitted Coupon given 0 Coupon given 1 Coupon given 2 age sex race Run Note The lt gt brackets indicate that user fills in this information Age sex and race are examples of variables you might want to analyze There are two features of note in the above code First the output file must be a text file suffix txt or a data file suffix dat RDSAT only reads these file types Second the variables that comprise the main data set SurveyID RDS INJ Coupon submitted Coupon given 0 Coupon given 1 Coupon given 2 must be in the order shown above Then add variables you want to analyze such as age sex race RDSAT requires th
3. proportions This helps in determining how many waves of recruitment are necessary before the population is at equilibrium First click on Estimate Number of Waves Required in RDSAT s Analyze menu This will cause the window of Figure 6 2 to appear Then select a starting group for a hypothetical sample Next choose a convergence radius The smaller this number the higher the confidence intervals will be However the dataset will take longer to analyze The default is 02 which should setve as a good starting point A radius of 02 means that the population proportions will change by less than 02 with further recruitment In other words the sample population proportions are considered converged at 43 RDS INCORPORATED equilibrium when the change in population proportions in between waves is less than the convergence radius times of the population proportions Select analyze and this utility will use the Markov process implicit in the calculated transition probabilities to check how many waves are required for the sample population proportions to reach equilibrium The results of the analysis will be output to a new report page See Figure 6 3 NN Group with Initial Recruit Convergence Radius FIGURE 6 2 RDSAT Waves Estimation Window EEE Data File y mem EA Number Of Waves Required 4 NN bee istory of convergence of sample population proportions Wave number 1 Group Number 1 1 0 Group Num
4. refer to the documentation included with this distribution More help and resources are available on the web at http www respondentdrivensampling org FIGURE 3 1 RDSAT Analyze Partition Button 21 RDS INCORPORATED A partition is a user defined set of groups Everyone in the population belongs to a group in a partition The groups are defined by common traits For instance a simple partition would consist of just one trait such as gender Those with a gender of 1 in this case male would form one group those with gender of 2 female another A multi trait partition of race and gender can also be created A group would then be defined by both a gender and race value For example race gender 1 1 white male would be a separate group from race gender 2 1 black male although both groups have the same gender Analyze Partition Attributes Attributes to be analyzed Q9 Analyze FIGURE 3 2 RDSAT Analyze Partition Window The partition panel is divided into three parts see Figure 3 2 The top left contains a list of all traits that may be used for analysis The top right contains a list of all traits that will be used to make the partition The bottom contains options for parsing the trait data To include a trait in the partition select it and press the right arrow To remove it from the partition select it and press the left arrow For each of the traits included
5. Recruits age 51 or older Likewise a Step of 1 would produce 27 different categories one for recruits 25 or under one for a recruit of every age between 25 and 50 and one for recruits age 51 or older 25 RDS INCORPORATED Chapter Interpreting Analysis Results vatious size and proportion estimates are explained along with their T his chapter explains how to interpret the results of an RDSAT analysis The cotresponding graphs and diagrams Interpreting a Partition Analysis First create a simple partition with one variable and the complete option as shown in Figure 4 1 Click Analyze Attributes to be analyzed Mereakpoint e ZD gt FIGURE 4 1 RDSAT Single Variable Partition Analysis 26 RDS INCORPORATED After a moment the results of the analysis will be output to the pages in the main window To move between pages of the analysis click on its corresponding tab Recruitment Displays general statistics regarding the recruitment Estimation Network Sizes and Homophily Graphics and Histograms Data Smoothed Transition Probabilities Demographically Adjusted Recruitment Matrix Key of Group and Trait Correspondence FIGURE 4 2 RDSAT SINGLE VARIABLE PARTITION ANALYSIS RECRUITMENT Note Seeds are not included in the sample population sizes 27 RDS INCORPORATED Key of Group and Trait Correspondence The Key of Group and Trait Correspondence is used to interpret the d
6. homophily within 3 different groups Each group is shown as a sepatate bar This graph illustrates that Group 2 the middle bar has the highest homophily roughly 3 followed by Group 1 the leftmost bar and Group 3 rightmost Population Proportions Population Proportions 1 0 0 8 This graph displays the population proportions of each group The y axis is the population proportion and should be read as a percentage We see that Group 1 the leftmost bar comprises more than half the total population followed by group 2 and 3 33 RDS INCORPORATED Average Adjusted Network Sizes Avg Net Sizes 120 115 110 105 100 This graph displays the adjusted network sizes of each group Observe that group 3 the rightmost bar has the highest network size Transition Probabilities This is a 2 dimensional histogram of the transition probabilities A brighter color cotresponds to a higher value It is basically a way to better visualize the corresponding transition matrix Transition Probabilities 34 RDS INCORPORATED Degree List List of all network sizes reported in the sample The list is sorted from least to greatest fot easy view of the distribution Sorted Degree Sequence Degree 3900 800 700 600 500 400 300 200 100 0 150 Recruit In the graph above we see that there are a few respondents with networks as large as 900 but most respondents fall within a degree of 100 30
7. in the partition how to patse the data values must be selected Data Parsing Options Complete This option will find every distinct value in the data file associated with that trait and create new groups based on that value For example if the trait gender has two values in the data file 1 2 the complete option will make a new group associated with each of these values If the trait race has three values 1 2 3 then the complete 22 RDS INCORPORATED option will create 3 more groups corresponding to those trait values If both gender and race are included in the partition there will be 2 x 3 6 groups in all race gender 1 1 2 1 8 0 1 2 2 2 B 29 Breakpoint This will take every value below the specified breakpoint and create a new group based on it a 2nd group is created based on every value greater than or equal to the specified breakpoint This is different from a breakpoint analysis discussed in the next section in that only one breakpoint is chosen for the dataset rather than a range of breakpoints The analysis is identical to a complete partition analysis with the exception of creating exactly 2 groups from a partition in the dataset rather than one for every possible trait value For example the trait age has a range of values associated with it It would be impractical to create a group for every distinct age but by choosing breakpoint with a value of 40 the population can be
8. or choose the data type that best describes your data Original data type pe that best describes your data i Characters such as commas or tabs separate each field Fields are aligned in columns with spaces between each field Fixed width import at row fi File origin 437 OEM United States Preview of file C Tempirdsatinyjazz txt a 350 O 14250004 14250005 14250006 14256002 0 0 901 1 1 40 350 0 014250007 14250008 14250009 14256003 0 0 902 1 264 0 1 2 585 0 14250010 14250011 14250012 14256004 0 0 903 2 3 41 585 hd J Y Cancel lt Back u gt Finish FIGURE 1 9 Excel text import window To load data exported from the RDSCM v2 0 click File gt Open in Excel s menu bar and select the exported data The window of Figure 1 9 should appear Select Delimited in the file type section and click Next Note For NHBS variables such as network size gender race age etc will be found in the questionnaire data file and cannot be exported from RDSCM v2 0 14 RDS INCORPORATED Text Import Wizard Step 2 of 3 2 xl This screen lets you set the delimiters your data contains You can see how your text is affected in the preview below v Treat consecutive delimiters as one Text qualifier Semicolon Comma Other FIGURE 1 10 Excel text import window In the next wizard screen be sure to check the box entitled Space You should se
9. 0 Bootstrap Simulation Results Shows the histogram of Bootstrap estimates of Least Squares population proportions The horizontal axis depicts population estimates for the specified group The vertical axis shows the frequency of the Bootstrap estimate Frequency of Population Proportions from Bootstrap Procedure Frequency 0 06 0 05 0 04 0 03 0 02 0 01 0 00 0 50 0 55 Population Prop 35 RDS INCORPORATED Degree Distributions Distribution of network sizes for each group and for the population as a whole The diagram below happens to be of the entire population We see that most members of the population have network sizes close to 100 or 200 and the frequency of higher netwotk sizes decreases with the exception of an anomaly at 500 Complete Degree Distribution Frequency 0 20 300 400 500 600 700 Degree 36 RDS INCORPORATED Interpreting a Breakpoint Analysis A breakpoint analysis breaks a dataset into groups based on a single continuous variable A continuous vatiable of interest might be Age where one wouldn t examine each individual age as a separate group but rather a range of Ages As such there is no recruitment data for breakpoint analyses Rather there are interesting trends to notice in Homophily and population proportion as the breakpoint is shifted and respondents ate moved from the upper group of the lower group The Estimation tab shows a table of Least Squares population estimates
10. 03 Cancun Mexico Street and Network Sampling in Evaluation Studies of HIV Risk Reduction Interventions By Salaam Semaan Jennifer Lauby and Jon Liebman AIDS Review 2002 Oo Comparison and Evaluation of Alternate Methods for Sampling Hidden Populations Review of Sampling Hard to Reach and Hidden Populations for HIV Surveillance By Robert Magnani Keith Sabin Tobi Saidel and Douglas Heckathorn In AIDS 2005 51 RDS INCORPORATED Appendix Appendix 1 The RDS Data File Components of Core Data Files Note that all data outside of the first two lines must be integer valued Header on line 1 Every core data set must begin with the string RDS on the first line Parameters on line 2 From left to right the second line must contain the following integer valued information o o o Sample Size Maximum number of coupons received by a recruit in the sample Value for missing data This value will be used throughout the analysis to refer to missing data It will over ride all other values so it is important to choose an integer value that will not occur elsewhere in the data Main data set Subsequent lines contain the main recruitment information with each line corresponding to a recruit Arrange the columns from right to left as followed o o Survey Rectuit ID an integer code acting as the recruit s name Personal Network Size The serial number of the coupon the recruit rec
11. 09 14 3 585 0 14250010 14250011 14250012 14 4 400 0 14250025 14250026 14250027 14 9 150 0 14250022 14250023 14250023 14 6 100 0 14250028 14250029 14250030 14 T ann ni 1425nn1 amp R 14 amp 5n0417 1A9RNNAR 14 FIGURE 1 2 Sample RDS Data in an Excel Spreadsheet Note In this sample data set each recruiter is given 4 coupons to distribute and the coupon numbets are 8 digits and the coupon numbers are 4 digits For NHBS each recruiter is initially given 3 coupons RDS INCORPORATED G H J K Gender mt Race wBo Age Airplay 14256002 1 1 40 1 14256003 1 2 64 1 14256004 2 3 41 1 14256009 2 2 77 0 14256008 1 1 33 1 14256010 1 3 31 2 14256006 1 2 70 1 FIGURE 1 3 Excel Spreadsheet Custom Field Headers and Data Column headers must be entered for all fields other than the main data set Le respondent or survey ID network size coupon received from recruiter coupons given to respondents such as Gender Race Age etc If a data value corresponds to a specific group for example if a value of 1 corresponds to Male and 2 to Female you can indicate this in the data set Abbreviate the group with a single character for example m for Male and f for Female Add the abbreviations in order of increasing value to the gender header surrounded by parentheses In this example the resulting header would be Gender mf Similarly to indicate for the Race header that Whites correspo
12. 7 cormell edu Douglas Heckathorn douglas heckathorn cormell edu Department of Sociology Cornell University FIGURE 1 1 RDSAT Main Window RDS INCORPORATED Preparing Data from Excel RDSAT Accepts data in the form of a text file To load an existing excel spreadsheet into RDSAT the columns of the dataset must be in the following order Respondent ID Note for NHBS this will be the Survey ID Self Reported Network Size Coupon Received from Recruiter Coupons given to Respondent C1 to C4 Other variables then follow e g gender race age etc The first two rows of the spreadsheet make up the RDSAT header The first line must be RDS The second line is the sample size the maximum number of coupons given to each respondent the symbol for missing values In this sample dataset the number of respondents in 264 the maximum number of coupons distributed to each respondent is 4 and O entries ate treated as missing data napa For NHBS the data will not include the network information RDS_INJ because it comes from the questionnaire data file and not the coupon manager data file In this case the network information must be taken from the questionnaire data file and merged into the coupon manager data file before the data is exported F4 Microsoft Excel Book3 62 File Edit View Insert Format Tools Data Window Help Acrobat 264 a 0 1 350 0 14250004 14250005 14250006 14 2 0 0 14250007 14250008 142500
13. 7 voice messages may be left for any team member Note For urgent requests please call the phone number and identify the message as urgent 50 RDS INCORPORATED References Respondent Driven Sampling A New Approach to the Study of Hidden Populations By Douglas D Heckathorn Social Problems 44 174 199 Oo The original article in which RDS was introduced Respondent Driven Sampling II Deriving Valid Population Estimates from Chain Referral Samples of Hidden Populations By Douglas D Heckathorn Social Problems 2002 O Article extending the RDS method to include calculation of standard errors and post stratification to control for differences in network size and clustering across groups Salganik Matthew J and Douglas D Heckathorn In press December 2004 Sampling and Estimation in Hidden Populations Using Respondent Driven Sampling Sociological Methodology O Article showing through both analytic means and simulations that the RDS population estimator is statistically unbiased O Outstanding Article Publication Award of the Mathematical Sociology Section of the American Sociological Association Extensions of Respondent Driven Sampling A New Approach to the Study of Injection Drug Users Aged 18 25 By Douglas D Heckathorn Salaam Semaan Robert S Broadhead and James J Hughes AIDS and Behavior 2002 Oo Empirical evaluation of some of the assumptions underlying RDS and its use to study yo
14. RDS INCORPORATED RDS Analysis Tool v5 3 User Manual RDS INC RDSAT 5 3 User Manual O RDS Incorporated 45 Beckett Way Ithaca NY 14850 Phone 607 257 0787 Table of Contents FDSAT 5 3 B35l68 ie poo eR RI et tee uno obe ania seus 3 Installing the RDS Analysis Tool v5 3 sss 3 Basic Layout Information esses 4 Preparing Data from Excel esses 5 Preparing Data from SPSS utet exe iid ax utut 8 Preparing data from SAS uoces i Insee eaae eR RARE a REIR RN 12 Preparing Data from the RDS Coupon Manager 14 Loading Viewing and Editing Data in RDSAT 16 Loading Dat s an Rt a uod eS 16 Viewing Datas s M aM terea actes ded dose et bed eh ode 17 Setting Options For Analysis 19 Adjust Average Network SiZ S oooooccccccccccccoconcconcnoccnnnananannncnnnnnnnns 19 Number of He samples usos need 19 Confidente Interval oU D UA PUDE 20 CULOS ED SE Es 20 Analyzing a Dataset cere tete tp erica teen tdeetee 21 Parton An Sica so idad 21 Data Parsing ODIOFS i Luo eden e Ioan rias 22 Complete od RP TRENDS 22 A dah seit o ie A aahtents ad E 23 Gum HO 23 Breakpoint Analysis ttd t 24 Interpreting Analysis Results essesseessssss 26 Interpreting a Partition Analysis ssssusssss 26 BAADT EIEE EEA 27 Key of Group and Trait C
15. Re Analyze with Specified Missing Data This feature allows each trait to be chosen and to specify which value the missing data within that trait to have This option can also be used to give missing data a unique value to allow groups to form on the basis of whether they have missing data To re analyze a dataset simply load it into RDSAT and click Re analyze with specified missing data see Figure 5 1 40 RDS INCORPORATED ADS RDS Analysis Tool Analyze Analyze Partition Analyze Breakpoint Estimate Number Of Waves Required Re Analvze with Specified Missing Data E Impute Missing Data and Re Analyze Burm FIGURE 5 1 RDSAT Re Analyze with Specified Missing Data are Impute Missing Data and Re Analyze Sets missing data to their most probable value given the transition probabilities For instance if someone is recruited by Group 1 and the missing data prevents that person from being classified as Group 3 or Group 4 transition probabilities of Group 1 will be used to find the most probable trait value for the recruit and then assigns him or her to Group 3 or Group 4 In cases where missing data is not distributed randomly over trait values this option can help resolve a potential source of sample bias To re analyze a dataset simply load it into RDSAT and click Impute Missing Data and Re analyze see Figure 5 2 m5 RDS Analysis Tool Analyze Analyze Partition Analyze Break
16. alizations Enhanced Data Smoothing An RDSAT option that allows analysis to take place even in a dataset with no recruitment data for a particular group Homophily A measure of preference for connections to one s own group Vaties between 1 completely heterophilous and 1 completely homophilous Impute Missing Data and Re Analyze Sets missing data to their most probable value given the transition probabilities Initial Recruits Reports the number of seeds i e people recruited by the researcher in each group Least Squares Population Proportions Reports the estimated population proportions of each group using linear least squares to solve the population equations LLS Population Weights Multiplicative factors by which the Least Squares Estimates are different from the naive estimates Partition A user defined set of groups Everyone in the population belongs to a group in a partition The groups are defined by common traits Re Analyze with Specified Missing Data This feature allows each trait to be chosen and to specify which value the missing data within that trait to have It can also be used to give missing data a unique value to allow groups to form on the basis of whether they have missing data 48 RDS INCORPORATED Recruitment Matrix Matrix of recruitments to and from each group The vertical axis rows depicts the recruiters and the horizontal axis columns show rectuits Re samples This is the number o
17. at the data be placed in this order and doing so in the output step will save time For NHBS the data will not include the network information RDS_INJ because it comes from the questionnaire data file and not the coupon manager data file In this case the network information must be taken from the questionnaire data file and merged into the coupon manager data file before the data is exported This will be the same for any additional variables you want to analyze Once the data has been exported open the file using NOTEPAD or WORDPAD and add the two line header as described in the Section of this chapter entitled Preparing Data From Excel An example header is displayed highlighted in bold in the data file fragment below 12 RDS INCORPORATED The data file is ready to be read by RDSAT Note that SAS will export the data as a space delimited data file and not a tab delimited data file RDSAT is capable of reading both file types The completed data file will resemble the example below RDS 530 11 0 sex agecat race 3 3310000000000 25 2 0000000000 50 3 17 608 607 609 18 0 5 6 0 0 N N N 10 4 20 21 414 416 41 40 17 25 23 24 000 AJ OGY UI 45 OOON N OOON N POON N NO Oo Nor 13 RDS INCORPORATED Preparing Data from the RDS Coupon Manager RDSCM v2 0 Text Import Wizard Step 1 of 3 2 xl The Text Wizard has determined that your data is Fixed Width If this is correct choose Next
18. ata related to recruitment in the analysis It lists all of the various groups that were analyzed and relates them to the trait they have in common In this example Group 1 corresponds to Race 1 Looking at the Race variable we see that the races are listed in parentheses by their initials WBO W White B Black O Other So Group 1 corresponds to the first race in the list namely White Group 2 corresponds to Black in the same manner and Group 3 corresponds to Other Recruitments Matrix of recruitments to and from each group The vertical axis rows depicts the recruiters and the horizontal axis columns show recruits For example this matrix tells us that Group 1 recruited 94 other people in Group 1 from the same group Transition probabilities Normalizes recruitments by dividing by the total number of recruitments and gives the probability of one group recruiting another For example Group 1 recruited 94 from the same group and so the normalized transition probability is 94 94 32 18 652 where the denominator is the total number of recruits Group 1 made Demographically adjusted Recruitment Matrix Gives hypothetical recruitments if each group recruited with equal effectiveness Transition probabilities implied by this matrix are identical to those of the original Recruitment Matrix It is well known that some groups of respondents recruit more than others e g HIV positives often recru
19. ates of population proportions The term naive is used because the proportion is a simple ratio of how many of a particular group were recruited to the total number of recruits It is not adjusted for any statistical biases To learn more about the methods used refer to Sampling and Estimation in Hidden Populations Using Respondent Driven Sampling by Douglas Heckathorn and Mathew Salganik Equilibrium Sample Distribution The equilibrium sample population proportions indicate each group s population size after the proportions have converged to their equilibrium value This occurs when further recruitment waves do not change the population proportion by a significant amount Population Weights The population weights can either be calculated using the linear least squares algorithm or the data smoothing algorithm depending on how the options are set for the RDS analysis In the above diagram the data smoothing algorithm was used See the Algorithms section of Chapter 2 for more information on the difference between various estimation algorithms in RDSAT 1 LLS Population Weights Multiplicative factors by which the Least Squares Estimates are different from the naive estimates 2 Data Smoothed Population Weights Multiplicative factors by which the Data Smoothed Estimates are different from the naive estimates Confidence Intervals Are obtained by bootstrapping the original sample The confidence intervals only correspond to t
20. ber 2 0 0 FIGURE 6 3 RDSAT Waves Estimation 44 RDS INCORPORATED Figure 6 3 is ascreenshot of the waves estimation output The actual output is listed below for a partition analysis of the New York Jazz dataset See Chapter 2 for more information on this dataset Number Of Waves Required 4 History of convergence of sample population proportions Wave number 1 Group Number 1 1 0 Group Number 2 0 0 Wave number 2 Group Number 1 0 836 Group Number 2 0 164 Wave number 3 Group Number 1 0 79 Group Number 2 0 21 Wave number 4 Group Number 1 0 778 Group Number 2 0 222 What this information means is that it took a total of 4 recruitment waves before the population estimates changed by less than 02 times the population proportion Assuming a convergence radius of 02 As we can see the change in proportion estimates of Group 1 from wave 3 to 4 is 79 778 012 which is less than 02 79 0158 The same is true of Group 2 Save RDS Analysis in the File menu Allows the report pages from the analysis to be saved to a formatted html file The analysis can then be viewed at any time with any web browser and it can be cut and pasted onto most spreadsheets In the current version of RDSAT only saving to HTML is possible however copying and pasting should allow the data to be imported into many applications including plain text editors Export DL Network File in the File menu Allows a DL netwotk f
21. corresponding to each breakpoint value Similatly the Network Sizes and Homophily tables are arranged by breakpoint value see Figure 4 5 File Analyze Help Rds Data File z 78 open New RDS Analyze Partition CProgram Files r Data Included Add Data Analyze Breakpoint po Uf Edit Data NH Change Options Race vyBO s A mmm Recruitment Estimation Network Sizes and Homophily Graphics and Histograms Population Proportions Linear Least Squares and Data Smoothed 37 RDS INCORPORATED FIGURE 4 5 RDSAT Breakpoint Analysis Estimation Tab Viewing the data in the graphics tab will often make patterns very clear For example in the breakpoint analysis of Chapter 3 New York Jazz musicians were analyzed based on their age Try clicking on Homophily in the graphics tab of the RDSAT main window Homophily Homophily B Lower Group 1 0 2 Upper Group 15 Breakpoint There are several visible patterns Homophily tends to zero as the age variable increases This implies that differences in age become less important for choosing relationships the older the recruits are It is also notable that the older group is always more homophilous than the younger group Finally it is possible to see that homophily is strongest where age is the lowest 25 This implies that young jazz musicians show strong preference for relationships with other young jazz musicians 38 RDS INCORPORATED Po
22. divided into a group less than 40 years old and a group 40 years old and greater Custom This allows partitions to be created based on non overlapping ranges of values For instance selecting a trait such as age and using a custom partition with parameters 10 20 21 30 31 40 41 50 would create 5 groups based on 5 intervals of ages Each range must be enclosed in curly braces and delimited with commas Ranges should not overlap Upper and lower bounds may be the same however e g 30 30 if a group must be based on only one value Note It is very easy to create a partition with a large number of groups e g mote than 10 by selecting complete with a trait with many values e g age In general the amount of data is insufficient to handle partitions with such a large number of groups and the analysis will fail 23 RDS INCORPORATED Breakpoint Analysis Breakpoint analysis allows one trait to be analyzed over a range of possible breakpoints This is very useful for continuous vatiables such as age ADS RDS Analysis Tool EJES File Analyze Help Rds Data File z T E Open New RDS C Program Filesrdsatnyjazz txt i E Data Included Add Data Gender MF Race WBO Edit Data Recruitment Estimation Network Sizes and Homophily Graphics and Histograms Respondent Driven Sampling Analysis Tool v 5 0 1 If you are new to Respondent Driven Sampling refer to the docume
23. e the data line itself up properly at this point see Figure 1 10 Finally click Finish Ea Microsoft Excel FromRDSCM txt E File Edit View Insert Format Tools Data Window Help amp x Oe 4Y amp amp 2 l 7 113 E fe 1 154 2 1 3 1 1 1 1234 1111 4 2 1 1 1002 1 5 3 1 1002 1 1 B 4 1 2128 4563 453 5 1 2546 4563 4565 B B 1 5452 7456 2314 9 7 1 4566 4564 4564 FIGURE 1 11 Imported RDSCM Data Change the network sizes to their appropriate values by double clicking the appropriate cells and save the data as described in the section entitled Preparing Data from Excel Figure 1 11 shows fictitious NHBS data exported from RDSCM v2 0 15 RDS INCORPORATED Chapter Loading Viewing and Editing Data in RDSAT his chapter covers how to load data into RDSAT Topics covered include loading RDSAT format files setting options for analysis and viewing editing the data Loading Data RDS Analysis Tool File Analyze Help Rds Data File f y open New RDS Analyze Partition f C Program Filesrdsatnyjazz txt Data Included Analyze Breakpoint Gender MF Change Options Race WBO ge Ort Recruitment Estimation Network Sizes and Homaphily Graphics and Histograms Respondent Driven Sampling Analysis Tool v 5 0 1 If you are new to Respondent Driven Sampling refer to the documentation included with this distributi
24. eived NOTE if the recruit is a seed then this number must be set to the missing data value Serial numbers of the coupons given to the recruit This data will take up the number of columns specified by the max number of coupons given to a recruit parameter specified on line two If the recruit was given a number of coupons less than that set some of the values to the missing data value For example below are the first 7 lines of the core data set for Doug Heckathorn s New York jazz musicians RDS 264 7 0 1 3500 14250004 14250005 14250006 14256002 901 0 0 200 14250007 14250008 14250009 14256003 902 0 0 3 585 0 14250010 14250011 14250012 14256004 903 0 0 4 4000 14250025 14250026 14250027 14256009 904 0 0 5 150 0 14250022 14250023 14250023 14256008 905 0 0 52 RDS INCORPORATED Appendix Appendix 2 RDSAT Questions amp Answers Are seeds included in the RDSAT analyses calculations Yes because recruitments by seeds are treated like any other recruitments and all recruitments in combination ate used to calculate the transition probabilities In contrast the self reported network sizes of seeds are not used to calculate network size estimates because seeds were not recruited by a peer they were recruited by key informants or in some other manner If a participant reports that the person who gave them a coupon is a stranger are they included in the RDSAT analysis If so what are the implications for the recruitment chain
25. f the data you wish to analyze is in an SPSS spreadsheet see Figure 1 5 you may convert it to the RDS format by copying and pasting the data into an excel spreadsheet First organize the columns so that the main data set appears in the standard RDSAT format namely Respondent ID Survey ID for NHBS IDU Self Reported Netwotk Size Coupon Received from Rectuiter Coupons given to Respondent C1 to C3 in Figure 1 5 and finally other variables you want to analyze like gender race age etc RDS INCORPORATED Note In this sample data set the variable label for Respondent or Survey ID is rid for the network size is net for the coupon received from the recruiter is coupon for the coupons given to respondents is C1 C4 For NHBS these variable labels may look differently when exported from RDSCM v2 0 or from the questionnaire data file Variable Variable label Data Source Survey ID SurveyID RDSCM v2 0 Network Size RDS INJ questionnaire data file Coupon received from recruiter Coupon submitted RDSCM v2 0 Coupons given to respondent Coupon given 0 RDSCM v2 0 cc ee 55 Coupon given 1 Coupon given 2 For NHBS the data will not include the network information RDS_INJ because it comes from the questionnaire data file and not the coupon manager data file In this case the network information must be taken from the questionnaire data file and merged into the coupon ma
26. f times random subsets of the data are sampled to derive the bootstrap confidence intervals More re sampling will result in better confidence intetvals but will be more CPU intensive Respondent A participant in an RDS sampling study Respondent ID A unique integer representing a respondent in a given RDS dataset Sample Population Proportions The naive estimates of population proportions without correction of over sampling and othet biases Sample Population Sizes The total number of recruits in each group Self Reported Network Size The number of individuals a respondent reports he or she has in his her netwotk Transition Probabilities Normalizes recruitments by dividing by the total number of recruitments and gives the probability of one group recruiting another Unadjusted Network Sizes A straight forward arithmetic mean of the sample s network sizes Waves Estimation This feature allows hypothetical recruitment scenarios to be examined The sample population proportions ate considered converged when the change in population proportions in between waves is less than the convergence radius times of the population proportions 49 RDS INCORPORATED Help and Support In addition to this manual you may also contact the RDS Coordinating Center for technical assistance with any RDS Inc product RDS staff will respond to all requests for assistance within 24 hours Email RDS CDC gmail com Phone 607 257 078
27. g frame depends on the aims of the study Can RDSCM allow the user to override an individually expired coupon Are there anticipated implications for RDSAT as it relates to the expiration dates of coupons an or referral cards RDSCM will allow a coupon s void status to be overridden in the following way 1 Increase the validation timeframe so that coupons won t be automatically voided 2 Change the voided coupons status to UNPAID the auto void will not trigger anymore due to step one 3 Enter the records for the individuals that arrive with the coupons 4 Finally return the validation timeframe to normal The coupons in question should be in a PENDING state so they will no longer be auto voided How does RDSAT account for missing data For example one of out sites lost 2 interviews handheld computer malfunction one from a seed and the other from a non seed respondent Currently RDSAT will not process the entire recruitment chain linked to a record with missing data How does RDSAT adjust for differential coupon distribution For an in depth look at the methods used in RDS analysis please consult Sampling and Estimation in Hidden Populations Using Respondent Driven Sampling The citation for this paper can be found in the references section of this manual Please also consult the References section for more RDS related literature 54
28. he Least Squares population estimates and can be set in the options panel click options in the main window 31 RDS INCORPORATED Network Sizes and Homophily This tab displays Homophily Affiliation and Average Network Sizes f Recruitment Estimation Network Sizes and Homophily Graphics and Histograms Adjusted Unadjusted Average Net Average Net Key of Group and Trait Correspondence FIGURE 4 4 RDSAT Single Variable Partition Analysis Network Sizes Tab Adjusted Average Network Sizes Netwotk sizes are adjusted for sampling bias In a chain referral sample those with more connections and larger personal network sizes tend to be over represented in the sample This can potentially bias sample estimates To learn more about the methods used refer to Sampling and Estimation in Hidden Populations Using Respondent Driven Sampling by Douglas Heckathorn and Mathew Salganik Unadjusted Network Sizes Straight forward arithmetic mean of the sample s network sizes Homophily A measure of preference for connections to one s own group Vaties between 1 completely heterophilous and 1 completely homophilous Affiliation Matrix Displays the same preference measures as homophily but for all group paits 32 RDS INCORPORATED Graphics and Histograms This tab displays visual illustrations of data presented in the previous sections of this chapter Homophily Homophily 1 0 This graph displays
29. ile to be exported to the recruitment chain data DL format is recognized by numerous network analysis packages including UCI net and Pajek Pajek in particular can be used to create attractive social network visualizations as seen in Figure 6 4 45 RDS INCORPORATED FIGURE 6 4 Pajek Generated Social Network Visualization UCINET http www analytictech com ucinet 5 description htm PAJEK http vlado fmf uni j si pub networks pajek 46 RDS INCORPORATED RDS Glossary of Terms Adjust Average Network Size Option In a chain referral sample those with more connections and larger personal network sizes tend to be over represented in the sample This RDSAT option corrects this bias Adjusted Average Network Sizes Netwotk sizes that are adjusted for sampling bias Affiliation Matrix Displays preference measures for connections between all group pairs The diagonal of this matrix is Homophily within a group Bootstrap Simulation Results Shows the histogram of Bootstrap estimates of Least Squares population proportions The horizontal axis depicts population estimates for the specified group The vertical axis shows the frequency of the Bootstrap estimate Breakpoint Analysis A Breakpoint analysis allows one trait to be analyzed over a range of possible breakpoints This 1s very useful for continuous variables such as age Complete Variable Analysis This option will find every distinct value in the data file as
30. it substantially more than do negatives This is shown in the recruitment matrix if the number of recruitments by HIV positives 1 e the row sum in the matrix exceeds the number of recruitments of HIV positives 1 e the column sum in the matrix The demographically adjusted recruitment matrix shows what the recruitment matrix would have looked like if all groups had recruited equally 1 e so row and column sums are equal without any change in recruitment patterns 1 e no change in transition probabilities This type of adjusted matrix is useful for testing one of the assumptions of the statistical theory on which RDS is based which holds that if recruitment effectiveness is uniform across groups cross group recruitments will tend to be equal Therefore the cross group fecruitments in the adjusted matrix will differ only by amounts consistent with stochastic variation Thus if positives recruit more than negatives then in the original recruitment matrix all else equal the number of negatives recruited by positives will tend to be greater than the number of positives recruited by negatives However in the demographically adjusted matrix these will be if not equal at least strongly correlated 28 RDS INCORPORATED Sample population sizes Reports the total number of recruits in each group Initial Recruits Reports the number of seeds from each group i e people recruited by the reseatcher in each group Note Much of
31. ment Estimation Network Sizes and Homaphily ns ae eters and Histograms Respondent Driven Sampling Analysis Tool v 5 0 1 ESOpen New RDS Analyze Partition If you are new to Respondent Driven Sampling refer to the documentation included with this distribution More help and resources are available on the web at http www respondentdrivensampling org FIGURE 2 2 RDSAT Edit Data Button 17 RDS INCORPORATED View the data loaded by clicking on the Edit Data Button or select View Edit RDS from the file menu A new window will pop up displaying the contents of the data files you have loaded see Figure 2 3 Sample size 264 the value for missing data 0 and the number of coupons per respondent 7 are displayed on the left The table columns may be rearranged by clicking and dragging them Click on Save RDS Data to save the data loaded into one file with an rds extension The next time this file is loaded all data including the core and trait data will load automatically Trait data is any vatiable that is not core data Core data consists of the respondent id network size and coupons Trait data can be Race Age etc Notice that when a cell in the table is clicked on its contents may be changed The changes will be saved to any data file created with the Save RDS Data button Note Be careful not to delete data unintentionally 14250004 14250007 14250010 14250025 114250022
32. nager data file before the data is exported RDS INCORPORATED NYJazz SPSS Data Editor fx File Edit View Data Transform Analyze Graphs Utilities Window Help ag e B 2 2 53 m a Er Extat s v al 1 airplay fi 0 14 e M ves gt 36 al 23 al Ds 42 00 54 x Data view Variable View Jr SPSS Processor is ready FIGURE 1 6 RDS Data highlighted in SPSS Highlight all relevant columns in the dataset To do this first click on the left most column header this should highlight the entire first column Next hold down the Shift key and press the right arrow key until all the desired fields have been highlighted see Figure 1 6 Finally either press Ctrl C on the keyboard or click Edit gt Copy on the menu screen to copy the data to the clipboard Paste this data into the third line of a blank excel spreadsheet see Figure 1 7 and add the relevant header information described in the previous section entitled Preparing Data from Excel 10 RDS INCORPORATED CE Microsoft Excel Book3 i File Edit View Insert Format Tools Data Window He Search ey Rows Match case T Find entire cells only FIGURE 1 8 Excel replace dialog window Note Tf there are missing data entries in the SPSS dataset they will be denoted by a period However RDSAT only accepts integers in the dataset Before saving to the Tab Delimited Text Format you must replace all
33. nd to group 1 Blacks to group 2 and all other races to group 3 you may use Race WBO RDS INCORPORATED RAData90 E nygender E LMSM_SF3 RADatas0 J NvEth7 2 LMsM sF2 RADataS E nyeth4 J LMsM sF1 RAData30 E NYEth E LMSM_CH9 J nydata J LMsM CH8 Nybreak E LMSM_CHS YAirU J LMSM CH4 E nyair LMsM CH2 nyunion nyage junk nyrace LMSM_SF9 CUYear2 4 nynet E LMSM sF8 E CUYear nyjazz4 S LMSM_SF7 J cmtest2 NYJazz E LMSM_SF6 E cmtest p nygroup fz LMSM SF5 E abu2 LO NEO Fle names fwaz O A Bse w iiu Save as type Text Tab delimited z Sarel FIGURE 1 4 Excel Save As Dialog N N To save this data set to a file choose Save As and choose the Text Tab Delimited format RDS INCORPORATED Preparing Data from SPSS fy NYJazz SPSS Data Editor File Edit View Data Transform Analyze Graphs Utilities cg B aja El 6 e HE Ela v el Vid o m loa met coupon ci c2 e 14250004 14250005 14250006 14250007 14250008 14250009 14250010 14250011 14250012 14250025 14250026 14250027 14250022 14250023 14250023 14250028 14250029 14250030 14250016 14250017 14250018 14250040 14250041 14250042 14256002 14250013 14250014 14250015 14250013 14250019 14250020 14250021 14250005 14250031 14250032 14250033 14250004 14250034 14250035 14250036 FIGURE 1 5 Sample RDS Data in SPSS I
34. ntation included with this distribution More help and resources are available on the web at http www respondentdrivensampling org FIGURE 3 3 RDSAT Analyze Breakpoint Button To analyze a breakpoint click on Analyze Breakpoint in the main window see Figure 3 3 A Breakpoint analysis can be done on any trait but it is more effective to use traits with many values such as age in the data set of New York jazz musicians The bound fields allow the range of values to be chosen over which the breakpoint will be set For example from the NYC Jazz dataset located in the RDSCM distribution folder see Chapter 2 for details age is selected from the drop down list The step size is set to 1 and 25 and 50 are entered for the lower and upper bound see Figure 3 4 This will perform a breakpoint analysis for groups above and below 25 then above and below 26 and so on 24 RDS INCORPORATED Breakpoint Analysis alo Trait to amp nalyze Age Lower Bound Upper Bound Analyze INN Step FIGURE 3 4 RDSAT Breakpoint Analysis Window In the above window we are selecting Age as the variable to be analyzed and choosing where the breakpoints will lie A Step of 5 with lower and upper bounds of 25 and 50 will break the dataset into the following 7 categoties e Recruits age 25 or under e Recruits 26 30 e Recruits 31 35 e Recruits 36 40 e Recruits 41 45 e Recruits 46 50 e
35. on More help and resources are available on the web at http www respondentdrivensampling org FIGURE 2 1 RDSAT Open New RDS Button First open the core data set The core data set contains information about the sample size missing data values and number of coupons per respondent Start the RDS Analysis Tool and choose Open New RDS or select the file menu and click on 16 RDS INCORPORATED New RDS see Figure 2 1 When a file chooser dialog window appears select the RDS data file and choose Open The nyjazz txt file included in this distribution is a good sample file to work with if no real dataset is available If the default installation directory was used this sample file will be located at C Program Files rdsat nyjazz txt For more information on the core data set refer to Appendix 1 Data pertaining to other population features of interest can also be included in this file Analysis cannot be carried out until this data is loaded Note The sample RDS data set of New York jazz musicians was collected by Douglas Heckathorn and analyzed in Finding the Beat Using Respondent Driven Sampling to Study Jazz Musicians Douglas D Heckathorn and Joan Jeffri Poetics 2000 Viewing Data MEROS Anatysis roo OR File Analyze Help Rds Data File C Program Filesrdsatinyjazz txt Data Included Analyze Breakpoint Gender MF Y EdiDia O Data Change Options Race WBO Recruit
36. orrespondence 28 Recruitment cnt 28 Transition probabilities sisas 28 Demographically adjusted Recruitment Matrix 28 Sample population sizes 3 2 a aaa 29 Initial Recruits 2 2 0 ccc ceececececceeececceecceececeueccecceecaecceeceeaeeceeeaeeseeeaees 29 ESUMAHOTN us dns ee Du D E hte eds 30 Estimated Population Proportions usus 30 1 Least Squares Population Proportions 30 2 Data Smoothed Population Proportions 30 Sample Population Proportions eeseseeeeeeess 31 Equilibrium Sample Distribution eeeseeseeeeeessssssss 31 Population We lOnisi sai adi 31 1 LLS Population WOIglils iie D DAVE 31 2 Data Smoothed Population Weights ssss 31 Confidence Intervals sas oet aaa 31 Network Sizes and Homophily ssssssss 32 Adjusted Average Network SiZ8S oocooccnccoccccccnccccccnnnnnnanncnnnnncnns 32 Unadjusted Network Sizes oocoononcccccccnccccccncnonancnnnccnnnnnnnnnnnannnnnos 32 Homopbiily na id 32 Affiliation A A O 32 Graphics and FStOGraM Sicilia 33 Transition ProbabilitieS oocoooccocnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnos 34 Degree acoso 35 Bootstrap Simulation Results oooooocccccccccnnncccccnccanoccccnnnnncnnnnnnnnnnnnos 35 Degree Distrib
37. point C Estimate Number Of Waves Required Re Analyze with Specified Missing Data Impute Missing Data and Re Analyze L IRace BO gt FIGURE 5 2 RDSAT Impute Missing Data and Re Analyze Help 41 RDS INCORPORATED Note These options only allow trait data for traits which have already been used to analyze a partition set imputed Like version 4 of RDSAT a partition analysis must always be completed before using the data set or impute features Also once enabled these features cannot be turned off within RDSAT To analyze a dataset without specified values or imputation of missing values close and re open RDSAT or reload the dataset via the Open New RDS Button 42 RDS INCORPORATED Chapter Extra RDSAT Features T he RDS Analysis Tool has several extra features that will be discussed in this chapter Estimate Number of Waves Required ADS RDS Analysis Tool Duc Help Rd Analvze Partition A Analyze Breakpoint C Estimate Number Of Waves Required Da Re Analyze with Specified Missing Data c Impute Missing Data and Re Analyze we FIGURE 6 1 RDSAT Estimate Number of Waves Required Menu Item abe This feature allows hypothetical recruitment scenarios to be examined A group is selected to be the initial recruiters and they are allowed to recruit based on their transition probabilities until the population proportions converge to the actual sample
38. pulation Proportions Population Proportions A Lower Group Upper Group Breakpoirt Next click on LLS Population Proportions on the Graphics page to find the breakpoint where the population of the upper group equals that of the lower group From this it can be inferred that half of the musicians are less than 43 years old Note that although the graph s x axis ranges from 0 to 25 we ate conducting a breakpoint analysis on groups age 25 to 50 Therefore the above intersection corresponds to an age of 43 18 25 not 18 39 RDS INCORPORATED Chapter Handling Missing Data in the Dataset ost data sets contain missing data RDSAT offers two ways of setting missing data and re analyzing them Both of these options will be covered in this chapter RDSAT employs two data imputation features The first makes it possible to reassign another value to missing data In this way respondents for whom data is missing can be included in the analysis to see if missing data is random or associated with other variables For example in an analysis of HIV prevalence respondents would be divided into three categories positive negative or missing One could then run analyses to see if having missing data was correlated with other terms such as race ethnicity The other data imputation procedure uses a regression like logic to assign values to respondents with missing data based on the estimate regarding what the missing value might be
39. s that follow In RDS studies recruitment rights are both scarce and valuable so respondents tend not to waste them on strangers so recruitment by strangers tends to be rare generally 1 to 3 A reasonable research strategy is to check to see if the respondents recruited by strangers differ significantly from other respondents and if not then to treat these as valid recruitments A maximally conservative research strategy would be to delete from the data set the serial number linking the recruit to the stranger rectuiter The recruit would then be treated as a seed and the stranger recruiter would become the terminus of a recruitment chain Neither respondent would be deleted from the data set but the number of peer recruitments would be reduced Are there any other essential variables we should be analyzing in RDSAT Other than gender race and age The variables to be analyzed depend on the research questions being addressed RDS is a sampling method a method for drawing statistically valid samples so its role is to help ensure that the answers are statistically valid 53 RDS INCORPORATED How does restricting recruitment to specific races affect the legitimacy of the survey and or RDSAT analysis This restriction of the sampling frame narrows the scope of the study e g limiting recruitment to Latino IDU would mean that the study would yield no information about non Latino IDU or Latina IDU How to best choose the samplin
40. sociated with a variable trait and create new groups based on that value Confidence Interval The value of this parameter determines the level of confidence for the confidence intervals reported in the analysis The default 05 measures the normalized length of a tail of the distribution of population proportions In short it determines 90 confidence for the intervals reported in the analysis Cut Outliers An RDSAT option that eliminates extremely small and large outliers in netwotk sizes from the dataset Data Smoothed Population Proportions Reports estimated population proportions for the Data Smoothed population equations Data Smoothed Population Weights Multiplicative factors by which the Data Smoothed Estimates are different from the naive estimates 47 RDS INCORPORATED Degree Distributions Distribution of network sizes for each group and for the population as a whole Degree List List of all network sizes reported in the sample The list is sorted from least to greatest fot easy view of the distribution Demographically adjusted Recruitment Matrix Gives hypothetical recruitments if each group recruited with equal effectiveness Transition probabilities implied by this matrix are identical to those of the original Recruitment Matrix DL Network File DL format is recognized by numerous network analysis packages including UCI net and Pajek Pajek in particular can be used to create attractive social network visu
41. t is recommended to keep box checked Number of Re samples This is the number of times the data is re sampled to derive the bootstrap confidence intervals For accurate confidence intervals keep this option at least the default value of 2500 For optimal accuracy a number over 15 000 is recommended Be aware 19 RDS INCORPORATED howevet that the bootstrap is demanding of CPU time There may be a short wait if this value is set to a high number Note For most NHBS analysis 2500 is recommended Confidence Interval The value of this parameter determines the level of confidence for the confidence intervals reported in the analysis The default 05 measures the normalized length of a tail of the distribution of population proportions In short it determines 90 confidence for the intervals reported in the analysis Cut Outliers With this option you may eliminate extremely small and large outliers in network sizes Check the box and input the desired minimum and maximum network sizes to be used in the analysis If this option is selected when the program encounters an individual whose netwotk size is outside of the specified bounds their network size will be set to the value of the nearest lower or upper bound To view the changes use the View Edit utility The changes enacted by the Cut Outliers option may then be saved to a data file Note Check for outliers by running a univariate frequency in SAS SPSS Excel before impor
42. the data reported above also have corresponding data smoothed estimates Data Smoothing is a method for eliminating deviations in cross group recruitments that occur due to chance For more information about data smoothing refer to Douglas D Heckathorn 2002 Respondent Driven Sampling II Deriving Valid Population Estimates from Chain Referral Samples of Hidden Populations Social Problems v 49 No 1 pages 11 34 29 RDS INCORPORATED Estimation Displays estimates of population proportions Key of Group and Trait Correspondence FIGURE 4 3 RDSAT Single Variable Partition Analysis Estimation Tab Estimated Population Proportions The estimated population proportion can either be calculated using the linear least squares algorithm or the data smoothing algorithm depending on how the options are set for the RDS analysis In the above diagram the data smoothing algorithm was used See the Algorithms section of Chapter 2 for more information on the difference between various estimation algorithms in RDSAT 1 Least Squares Population Proportions Reports the estimated population proportions of each group using linear least squares to solve the population equations 2 Data Smoothed Population Proportions Reports estimated population proportions for the Data Smoothed population equations 30 RDS INCORPORATED Sample Population Proportions Report the sample population proportions also called the naive estim
43. ting data to RDSAT Algorithm Type Three different algorithms are available for analyzing an RDSAT dataset Linear Least Squares LLS Data Smoothing and Enhanced Data Smoothing The recommended algorithm is Data Smoothing which adjusts recruitments across groups providing tighter Confidence Intervals than the naive LLS method Enhanced Data Smoothing assigns tiny non zero number to all cells in recruitment matrix then uses Data Smoothing This allows for an analysis to include non recruiting groups which would normally fail using LLS or Data Smoothing 20 RDS INCORPORATED Chapter Analyzing a Dataset his chapter introduces the analysis features of RDSAT This is the heart of the softwate s functionality Topics include Partition Analysis Breakpoint Analysis and Custom Analysis Partition Analysis When an RDS dataset is successfully loaded click on Analyze Partition in the upper right of the main window see Figure 3 1 By clicking on this button the window of Figure 3 2 will appear RDS Analysis Tool File Analyze Help Rds Data File i openNew RDS Analyze Partition C Program Filesrdsatinyjazz txt Data Included Add Data e D eakpoi Gender MF Race WBO Edit Data Change Options Recruitment Estimation Network Sizes and Homophily Graphics and Histograms Respondent Driven Sampling Analysis Tool v 5 0 1 If you are new to Respondent Driven Sampling
44. unger drug injectors Group Solidarity as the Product of Collective Action Creation of Solidarity in a Population of Injection Drug Users By Douglas D Heckathorn and Judith E Rosenstein Advances in Group Processes 2002 Development of a Theory of Collective Action From the Emergence of Norms to AIDS Prevention and the Analysis of Social Structure By Douglas D Heckathorn In New Directions in Sociological Theory Growth of Contemporary Theories Joseph Berger and Morris Zelditch editors Rowman and Littlefield 2002 O History of RDS and the research project from which it emerged Heckathorn Douglas D and Joan Jeffri 2003 Social Networks of Jazz Musicians pp 48 61 in Changing the Beat A Study of the Worklife of Jazz Musicians Volume III Respondent Driven Sampling Survey Results by the Research Center for Arts and Culture National Endowment for the Arts Research Division Report 43 Washington DC 2003 o Use of RDS fo study a non stigmatized hidden population jazz musicians Finding the Beat Using Respondent Driven Sampling to Study Jazz Musicians By Douglas D Heckathorn and Joan Jeffri Poetics 2001 O Useof RDS to study a non stigmatized hidden population jazz musicians Making Unbiased Estimates from Hidden Populations Using Respondent Driven Sampling By Matthew J Salganik and Douglas D Heckathorn Paper presented at the International Social Network Conference February 20
45. utiONS ocooononnnnnccccnnnnnnnncnnnnnncccnnccnnnnnnnnnnnnncnnnos 36 Interpreting a Breakpoint Analysis sss 37 Handling Missing Data in the Dataset 40 Re Analyze with Specified Missing Data 40 Impute Missing Data and Re Analyze 41 Extra RDSAT Features ess otc etie bna a tuor Sato caera 43 Estimate Number of Waves Required ssusse 43 Save RDS Analysis in the File menu 45 Export DL Network File in the File menu 45 RDS Glossary of Terms consisti nie 47 Ese O 50 RETErENCOS reno les 51 Appendix 1 The RDS Data File sssss 52 Appendix 2 RDSAT Questions 8 Answers 53 RDS INCORPORATED Chapter RDSAT 5 3 Basics his chapter will introduce the basics of the RDS Analysis Tool version 5 3 Topics covered include installing the Analysis Tool and preparing data from SPSS Excel SAS and the RDS Coupon Manager Installing the RDS Analysis Tool v5 3 The RDS Analysis Tool RDSAT is installed using a standard windows installer application First download the installer to a temporary folder from the following web address URL http www respondentdrivensampling org Click on Downloads and select the download that matches
46. your particular operating system and java configuration If you are unsute about your java configuration and are running windows choose Option 2 which includes the Java Virtual Machine JVM Once the file is downloaded double click the newly downloaded application RDSAT windows 5 3 exe The installer program will guide you through the installation process Default installation options are recommended and assumed throughout this manual To open the program double click the RDSAT icon or select it from the Programs listing in the Start Menu RDS INCORPORATED Basic Layout Information All RDSAT features are located in the right hand side of the main screen as buttons or in the menu bar See Figure 1 1 The current dataset being analyzed is displayed in the selection menu entitled Rds Data File When a dataset has been analyzed all graphs and figures can be found in the set of tabbed windows at the bottom of the main scteen RDS Analysis Tool O x File Analyze Help Rds Data File v open New RDS Data Included Change Options precrulmenta Estimation Network Sizes and Homophily Graphics and Histograms Respondent Driven Sampling Analysis Tool v 5 0 1 If you are new to Respondent Driven Sampling refer to the documentation included with this distribution More help and resources are available on the web at htto Avww respondentdrivensampling org Erik Volz amp mv
Download Pdf Manuals
Related Search
Related Contents
LG 49LF6300 48.5" Full HD Smart TV Wi-Fi Black LED TV Philips TCM350 MittelstandsMonitor 2008 Kopie - copie - Operator`s Manual Einbau- und Betriebsanleitung Lite-On 16x Half-Height DVD-ROM Untitled - Wissensnavigator User`s Manual of Laser Engraving Cutting Software Projecta Permscreen Deluxe 244 x 320 cm Copyright © All rights reserved.
Failed to retrieve file