Gender Discrimination in Job Ads: Evidence from China Peter Kuhna Kailing Shenb June 6, 2012 We study explicit gender discrimination in a population of ads on a Chinese internet job board. Gender-targeted job ads are commonplace, favor women as often as men, and are much less common in jobs requiring higher levels of skill. Employers’ relative preferences for female versus male workers, on the other hand, are more strongly related to the preferred age, height and beauty of the worker than to job skill levels. Relative gender preferences are also highly job-specific. Overall, the patterns suggest a model in which firms have heterogeneous preferences for particular job-gender matches, which are overridden in skilled positions by factors such as thinner labor markets or a greater incentive to search broadly for the most qualified candidate. ___________________________ a (corresponding author) Department of Economics, University of California, Santa Barbara, Santa Barbara, CA USA 93106, NBER and IZA, pjkuhn@econ.ucsb.edu; b Wang Yanan Institute for Studies in Economics, MOE Key Laboratory of Econometrics, and Fujian Key Laboratory of Statistical Sciences, Xiamen University, China 361005 and IZA. E-mail: klshen@xmu.edu.cn. The authors thank UC Santa Barbara’s Institute for Social, Behavioral and Economic Research (ISBER) for financial support. Kuhn thanks UC Berkeley’s Center for Labor Economics and the IZA for their generous hospitality during the research on this project. We thank Loren Brandt for providing us a copy of the 2005 Chinese Census microdata. Catherine Weinberger and participants at numerous seminars and conferences provided many helpful comments. 1. Introduction A fundamental problem facing empirical research on discrimination is the fact that discrimination is illegal in almost all the markets that are studied. Given market participants’ obvious incentives to keep discriminatory actions hidden, researchers have been forced to rely on a variety of indirect methods to infer whether discrimination has occurred. Yet explicitly discriminatory actions, such as posting a job ad which states that a particular race or gender is preferred, were commonplace in the United States as recently as 1960 (Darity and Mason 1998). Indeed, they continue to be both legal and accepted in many if not most of the world’s labor markets. In this paper, we study one of these labor markets –China’s—in order to establish a new set of facts about gender discrimination. Using data from a large sample of Internet job ads, we find that explicit advertised gender discrimination is commonplace. For example, during our twentyweek sample period, over one third of the firms that advertised on the job board --which caters to highly educated urban workers seeking private sector jobs-- placed at least one ad stipulating a preferred gender. We also find, perhaps surprisingly, that the share of ads favoring men versus women is roughly equal. Put another way, when it is legal to express gender preferences in job ads, a significant share of employers chooses to do so, and uses gendered ads to solicit women as often as men. One of the strongest and most robust relationships in our data is a negative ‘skilltargeting’ relationship: as a job’s skill requirements rise, the share of ads stipulating a preferred gender declines. This relationship holds whether we measure skill by the job’s education requirements, experience requirements, or its advertised wage, and is present in simple comparisons of means as well as regressions that control for detailed job attributes. While we do not find a robust relationship between skill levels and the direction of firms’ gender preferences, we do find that these preferences are highly correlated with firms’ requests for other ascriptive worker characteristics, specifically age, height and beauty. For example, firms’ requests for young, tall and attractive workers are highly predictive of their explicit requests for women, while requests for older workers predict requests for men. Finally, we find that firms’ revealed gender preferences are highly job specific, with many firms requesting men for some jobs and women for others. Indeed, while occupation is predictive of firms’ gender preferences, the 2 gendering of occupations is not very consistent across firms. In fact, over one third of the variation in gender preferences takes place within firm*occupation (“job”) cells. Putting these patterns together, we argue that they are not consistent with a model where advertised gender discrimination is driven primarily by firm-level preferences for one gender over another, nor with ‘glass ceiling’ models where firms’ primary motivation is to restrict women’s access to highly-skilled or well-paid jobs. Instead, Chinese firms mostly use targeted job ads to divide their pool of less skilled positions into ‘male’ versus ‘female’ jobs, while largely abandoning these distinctions at higher skill levels. To account for these facts, we offer a simple model of hiring from two pools of workers. In the model, firms have job-specific preferences for a worker’s type, in which factors such as beauty-related customer discrimination and social perceptions of gender-appropriate work can play important roles. These preferences must, however, be balanced against a desire to fill jobs with the best candidate. When this desire becomes more important (for example, when the pool of potential applicants is small or the job’s skill level is high), firms broaden their search to include the less-preferred pool, thus abstaining from type-targeted job ads. 2. Related Literature Legally, and according to most dictionary definitions, “discrimination” refers to taking an action, such as paying a different wage or choosing to hire a person, based not on that person’s individual merit but on his/her membership in a particular group.1 As already noted, when such acts are illegal, economists generally resort to indirect methods, such as searching for unexplained wage gaps, to infer whether they have occurred. Indeed, a large literature has used such indirect methods to study both whether (and where) employer discrimination occurs, and what motivates it (e.g. statistical versus taste motives). Recent reviews are available in Bertrand (2010), Fryer (2010), and Charles and Guryan (2011). In contrast, to our knowledge, previous studies of explicit discrimination in labor markets are rare. Darity and Mason (1998) are the only economists we are aware of who have examined discrimination in U.S. job ads; they reproduce examples of ads from 1960 newspapers, but do not conduct any statistical analysis. Goldin (1990, 2006) examines data from a Department of 1 For example, the dictionary.com definition is “to make a distinction in favor of or against a person or thing on the basis of the group, class, or category to which the person or thing belongs rather than according to actual merit”. 3 Labor Women’s Bureau survey of employers concerning their 1939 employment policies for office workers. In a sample of several hundred firms, she finds that a majority reserved some positions for men, and others for women. Lawler and Bae (1998) is the only article we know that studies discrimination in a recent sample of job ads. Their focus is on the impact of a multinational firm’s home country culture on its stated gender preferences in a sample of 902 ads placed in an English-language newspaper in Thailand.2 Aside from the above, the discrimination studies that are probably most closely related to ours are correspondence studies, which study employers’ reactions to identical resumes with randomly assigned race or gender. For example, Bertrand and Mullainathan (2004) submitted resumes to U.S. employers in response to newspaper job ads, with the respondent’s apparent race randomly assigned via race-specific first names. They found large differentials in callback rates for identical black and white resumes. Full audit studies, such as Neumark (1996), carry this procedure one step further and send matched, trained actors to interview for jobs in response to the callbacks. Given the lack of previous research on explicit discrimination, it is perhaps useful to contrast our ad-based approach to the more familiar audit or correspondence-based approach. Importantly, the two approaches measure distinct aspects of employer discrimination, with ambiguous implications about how much discrimination we should expect to observe. On the one hand, because ads are formulated before resumes arrive, ad-based measures do not condition on the information that appears in a worker’s resume; in this sense they measure an ex ante rather than an ex post decision to discriminate. Thus, for example, if an employer expects to receive lower quality resumes from the ‘disfavored’ group, firms might choose to engage in discriminatory job advertising even when audit-type studies (which compare identical resumes or candidates) would show no discrimination. On the other hand, since targeting an ad excludes an entire group of applicants from consideration, we expect (and show theoretically) that this decision will be reserved for cases when an employer’s ex ante preferences against (or prior beliefs about) a particular group are especially intense. Thus, when firms’ gender preferences 2 Banerjee et al. (2009) study caste and other preferences in a small sample of marriage ads in India. Barron, Bishop and Dunkelberg (1985) and Van Ours and Ridder (1991) study employers’ advertised hiring restrictions (in their case, minimum education requirements) using microdata on job ads. Both of these papers treat advertised requirements as exogenous vacancy characteristics, rather than as a choice variable for the employer. Altonji and Pierret (2001) study whether observerd wage patterns are consistent with employers’ use of race and education as hiring screens but do not study firms’ explicit screening policies. 4 are mild, we might not see gender-targeted ads, but could still see discrimination in an audit study. A third difference between ad-based and ‘traditional’ measures is that advertised discrimination necessarily involves a conscious decision by the employer to invite only one group to apply. In contrast, audit studies are designed to detect both the conscious choices and unconscious biases of employers.3 In sum, we should expect firms to post discriminatory ads when they consciously expect large differentials in productivity (or desirability) between two or more groups. These differentials include expected differences in the quality of resumes that would arrive, in addition to expected differences in worker quality given identical resumes. Taken together, these conditions differ from the conditions in which we would see discrimination in audit-type studies.4 In addition to the above conceptual differences, ad-based studies of discrimination have some methodological advantages relative to audit and correspondence studies. Among these, no fictitious resumes have to be designed (and no actors need to be trained). Thus, the comparability, realism and representativeness of the resumes and actors is not a concern. A second difference concerns cost and sample size: Since our data is collected costlessly by a web crawler, our sample consists of over one million observations, compared to about 5,000 in Bertrand-Mullainathan and much smaller numbers in most audit studies. This large sample size allows us to paint a statistical portrait of gender discrimination across an entire labor market, spanning a wide array of occupations, industries, provinces, and firm types. In contrast, audit and correspondence studies typically restrict their attention to a small number of narrowlydefined occupations and/or cities. 3. Data Our data is the universe of unique job advertisements posted on Zhaopin.com, the third largest online job board in China, during four observation periods: May 19 2008 - June 22 2008, 3 See Crosby et al. (1980) for a review of studies of unconscious bias. Price and Wolfers (2010) provide recent evidence of unconscious bias in a sports context; Greenwald et al (1998) propose a more direct measure (the Implicit Association Test). 4 It is perhaps worth noting that the mere presence of either ad-based or audit-based discrimination sheds no direct light on whether the employer’s motives for discrimination are taste-based or purely statistical. Distinguishing these sources of discrimination requires additional evidence on the patterns of discrimination that are detected, for example across employers facing different degrees of product market competition or between customer-contact and other occupations. We study these patterns later in the paper. 5 January 19 2009 - February 22 2009, May 18 2009 - June 21 2009, and January 18 2010 February 21 2010.5 Procedures for downloading the data and defining variables are discussed in Appendix 1. Descriptive statistics are provided in Table 1. All told, we study a total of 1,051,706 job ads, 41 percent of which require a bachelor’s degree or more, 46 percent of which require some post-secondary education, and 13 percent of which require a high school degree or less. Overall, just over one in ten ads on the site expressed a gender preference; this was evenly split between preferences for men and women.6 About 80 percent of ads required at least a year of experience. Explicit age requirements (which, like gender requirements, also violate some U.S. antidiscrimination laws), appear in 24 percent of all ads. While the mean requested age is quite young (30.6), ads specify a minimum age almost as often as they specify a maximum age. Of potential relevance to the design of equilibrium job search models (e.g. Rogerson et al. 2005), only about 16 percent of Chinese job ads contain information that could reasonably be construed as a posted wage.7 Among those ads, the mean wage was 4,279 RMB per month. Another piece of information that is often provided in Zhaopin job ads is the number of positions advertised: this is unspecified about half the time; when specified the modal number is one and the mean is 1.7. Just under 8 percent of the ads explicitly requested that the applicant be physically attractive “xingxiang”, and 2.6 percent stipulated a minimum height.8 Because job ads on Zhaopin are linked to the firm’s page on the site, and this page contains some standardized information, we can follow these links to gather measures of firm size and ownership. The average ad was from a firm with 1,565 employees, and an overwhelming share (92.6 percent) of the ads were placed by private-sector firms (both foreignand Chinese-owned). The remaining employers in our sample are State-Owned Enterprises 5 Note that firms frequently re-post the same ad; our sample includes each unique ad only once. Our choice of Zhaopin is largely for the technical reason that its site structure allowed us to easily and accurately identify such renewals. 6 Our measure includes all intensities of preference, though the most typical employer statements were either “female [male] preferred” and “female [male] only”. 7 Zhaopin.com prompts employers to enter a minimum and maximum wage. In most cases these fields are blank; in other cases, employers select both the lowest possible wage (1,000 RMB/month) and the highest (50,000 RMB/month) from the drop-down menu for a particular job. After eliminating all ads with a gap of more than 20,000 RMB between the minimum and maximum wage, along with other uninformative answers, we are left with 16 percent of all ads. The number reported is the midpoint between the advertised minimum and maximum. 8 Loosely, xianxiang translates as having a pleasing image, form or figure. 6 (SOEs).9 Fully 36 percent of all ads were placed by firms with some foreign ownership. (Our “foreign-owned” category includes Foreign Direct Investment (FDI), joint ventures, and a small number of representative offices). Finally, we note that Zhaopin.com categorizes ads into 39 occupation and 46 industry categories.10 Especially for the jobs on this site (which are disproportionately skilled, service jobs) these categories provide more detail than is available in social surveys or in available Census microdata. Together with the site’s indicators for the province where the job is located, these industry and occupation categories allow us to implement detailed controls for the type of job on offer. To assess how our sample of jobs ads compares to the overall population of jobs in the Chinese labor market, Appendix Table A1 compares the characteristics of Zhaopin ads to a representative sample of employed urban workers in China’s eight highest-income provinces, taken from the 2005 Census.11 These eight provinces --Beijing, Shanghai, Guangdong, Jiangsu, Shandong, Tianjing, Zhejiang and Liaoning—coincidentally have the eight highest GDPs per capita and are also the eight provinces with the most Zhaopin ads. The main result is that, compared to employed persons in the same set of provinces, Zhaopin ads are aimed at workers who are much younger, better educated, better paid, and who work in the private sector. For example, while about 28 percent of the workforce in these areas is over 40 years old, only three percent of Zhaopin ads stipulate a desired age over 40.12 While 77 percent of workers in the high-income provinces had high school education or less in 2005, only 11.4 percent of Zhaopin ads request this level of education, reflecting the job site’s focus on skilled occupations and on a highly-educated younger generation of workers. State-owned enterprises employed about 27 percent of workers, but only account for 7 percent of Zhaopin ads. More than three quarters of 9 We dropped a very small number of ads from non-profit organizations and for government jobs. In China, recruiting for government jobs takes place through a separate testing and recruitment system. 10 The most common occupation is sales, at 18 percent of the ads, with IT second at 11 percent. The top five industries were construction, consulting, IT, marketing, and trade, in order. 11 We expect the mix of Zhaopin ads to differ from a representative sample of jobs for several reasons. First, relative to a sample of occupied jobs, any sample of vacancies will overrepresent entry-level jobs, as well as jobs in expanding and high-turnover occupations and industries. In addition, the vacancies on an Internet job board likely require a significantly higher skill level than the median vacancy. Finally, relative to a flow sample of new vacancies, long vacancy spells will be overrepresented in a stock sample of vacancies such as ours; this would affect our estimates if there is parameter heterogeneity in the determinants of ad content that is correlated with vacancy durations. See Bergeron et al. (2008) for a recent discussion of the effects of length-biased sampling. 12 The working population in these cities is very young by U.S. standards for at least two reasons. First, the Census data include migrants from rural areas and from other cities, who account for 32.5% of the employed urban population and have an average age of 29.8. Second, China has a number of policies that strongly encourage early retirement. 7 workers in these provinces earned under 1500 yuan per month, while only 14.5 percent of the wages posted on Zhaopin ads fall below this level.13 In sum, even within the high-income provinces that are disproportionately served by Zhaopin, it is clear that Zhaopin.com –while hosting a wide variety of jobs and imposing no restrictions on who can use the site-- disproportionately caters to a very special slice of the Chinese labor market: young, highlyeducated workers looking for well-paid jobs in the private sector.14 To shed additional light on the relation between Zhaopin ads and the overall Chinese workforce, Appendix Table A1 also compares the gender mix of the workforce across various categories with the gender mix of Zhaopin ads. Overall, the key message of these comparisons is that conditional on stating a gender preference, Zhaopin employers are more likely to request women in the occupations, industries, age categories, education levels, and wage levels where women are also more likely to work. Combined with our anecdotal impressions of Chinese hiring practices, this pattern suggests that voluntary affirmative action --i.e. a practice of deliberately seeking to increase hiring of the minority gender in a given occupation or job—is not a widespread motivation for the gendered job ads in our data. 4. When do Employers Gender-Discriminate in Job Ads? The Stylized Facts A first, overall look at the types of jobs for which firms specify they want male or female employees is provided in Table 2. This table reports three fractions --the share of job ads requesting women, those requesting men, and those not indicating a gender preference—for samples of job ads differentiated by indicators of skill demands, by other requested ascriptive characteristics, and by firm characteristics. Panel A of the Table focuses on the correlation between job skill requirements and advertised gender preferences. It shows that the propensity to gender-target a job ad is strongly related to the job’s skill level: only about 6 percent of jobs requiring a university education 13 In terms of industry and occupation, the IT/ communication and R&D/consulting industries are highly overrepresented on Zhaopin, while manufacturing but also health/education/welfare are underrepresented. Professional and technical workers are highly overrepresented on Zhaopin, while production and construction workers are underrepresented. See Appendix 1 for additional details. 14 iResearch Inc, (2010) estimates that 11.2 percent of Chinese firms covered by the State Administration for Industry and Commerce posted ads on at least one Internet Job Board in 2010. Anecdotal evidence from firms posting on these sites suggests that their main alternative to using an Internet job board is on-campus recruiting. At lower skill levels, both newspaper ads and local government employment offices can also play important roles in recruiting. Note, however, that none of these recruiting methods are mutually exclusive; many ads appearing on sites like Zhaopin may also be posted in these other venues. 8 specify a preferred gender, while 23 percent of jobs requiring high school or less are explicitly gendered in this way. Strikingly, the share of ads requesting men, and the share requesting women, both decline precipitously with education requirements. For the subsample of ads that post a wage, exactly the same pattern is visible: the share of ads requesting men and the share requesting women both decline precipitously with the advertised wage. For example, 27 percent of jobs paying under 1500 yuan/month specify a preferred gender, compared with 7 percent of jobs offering 8000 yuan or more. Together, these patterns are inconsistent with simple glassceiling models where the primary purpose of advertised restrictions is to keep women out of highly-skilled or well-paid jobs.15 Finally, Panel A shows that the amount of gender targeting also falls with a job’s experience requirements, though here the decline occurs only among ads requesting women. The lack of a decline for men is related to a tendency for firms to prefer men when they are seeking to hire older workers, documented in Panel B below. Together, we refer to the decline in gender-targeting of job ads with all three measures of job skill requirements in our data as the skill-targeting relationship. Panel B of Table 2 shows the relationship between advertised gender discrimination and firms’ advertised preferences for three other ascriptive worker characteristics: age, beauty and height. One immediate conclusion is that advertised preferences for all these other attributes are complementary with those for gender, in the sense that these ascriptive screens tend to be used together. Thus, only 5.6 percent of ads with no age restrictions are explicitly gendered, compared with 28 percent of ads stipulating a specific age range. 9.1 percent of ads that do not request beauty are gendered, compared with 27.7 percent of ads that do request beauty. And 9.3 percent of ads that have no height requirement are gendered, compared with 56.2 percent of ads that do list a minimum height. In sum, gender preferences are much more likely to appear in job ads when the firm also requests workers of a specific age, height, or workers who are good looking. A second clear pattern in panel B occurs in the subsample of ads that specifies both a maximum and minimum age, so an “ideal” age (the midpoint of this range) can be calculated. Here, the patterns are starkly opposite for men and women: as the desired age rises, firms 15 That said, note that the share of targeted ads directed at women does fall with education, experience and the wage. Unlike our overall result on the incidence of targeting per se, this general ‘tilt’ towards men as skill requirements rise does not survive controls for other observable ad characteristics. See Table 7. Panel A also shows that gender targeting is about 3 percentage points higher among ads that post a wage than among ads that do not. In part, this reflects the lower average skill level of the posted-wage jobs. 9 become much more likely to look for men --the share of ads requesting men triples as the desired age rises from under 25 to 35+--, and become much less likely to be seeking women: The share of ads requesting women increases eight-fold, from 4 to 33 percent, as the desired age falls from 35+ to under 25. This suggests that, in contrast to skill, which has relatively weak and inconsistent effects on the direction of firms’ gender preferences, age plays an important role in firms’ preferences for men relative to women for a job. Finally, panel C of Table 2 shows that there is no strong relationship between a firm’s size and its propensity to gender target its job ads. Thus, to the extent that having many employees is associated with having formal personnel and HRM policies, our data show no obvious incompatibilities between having such policies and engaging in explicit hiring discrimination. That said, foreign-owned firms behave differently from Chinese-owned firms, engaging in less advertised gender discrimination than either private Chinese-owned firms or State-Owned Enterprises. While the differences between SOEs and private Chinese firms are not dramatic, SOEs are somewhat less likely than privately-owned Chinese firms to request women in the hiring process. Table 3 explores the striking association between firms’ requests for gender, beauty, height and age in more detail by looking at some intersections between these characteristics. Specifically, recall that 5.0 percent of ads explicitly request women (Table 1), and that this rises to 23.9 percent if we look only at ads requesting physically attractive applicants (row 1 of Table 2). According to Table 3, this number rises to 55.9 percent among ads requesting applicants who are tall and good looking, and to 87.1 percent if the applicant is also required to be under 25 years of age. These dramatic changes in conditional probabilities illustrate the high predictive power of demands for other ascriptive characteristics in ‘explaining’ firms’ requests for women. At the same time, it is worth noting that --since only 7.7 percent of ads request beauty and 2.6 percent of ads request height-- this particular constellation of employer desires can only account for only a minority of firms’ advertised gender requests.16 Thus, a more general theory is still needed to understand the wider patterns of gender discrimination in our data. 16 For example, of the 5.01 percent of ads that request women, 1.85 percent, or 37 percent, also request beauty. Also, the robust skill-targeting relationship and other key patterns remain strong even when we exclude all ads requesting an age range, beauty or height from our data. 10 Yet another perspective on the frequency of discriminatory job ads in China emerges when we organize our data by firms rather than ads. Overall, 73,642 distinct firms placed ads on Zhaopin during our sampling period; thus the typical firm placed 1,051,706 / 73,642 = 14.3 ads. Characteristics of these firms’ hiring policies are summarized in Table 4. According to Table 4, 20 percent of the firms in our data placed at least one ad that specifically invited men to apply; for women this number was 25.8 percent. For obvious reasons, these shares rise with the number of ads the firm placed on Zhaopin during our sample period. Thus, for example, among firms that placed more than 50 ads, over 70 percent expressed a gender preference at least once, and 39 percent placed both male-only and female-only ads during our sample period. This suggests, perhaps surprisingly, that a substantial share of the variation in firms’ advertised gender preferences in our data might occur within, rather than between firms. Additional detail on the role of firms, occupations, and their intersection (“job cells”) in accounting for patterns of explicit hiring discrimination is provided in Table 5, which presents a decomposition of the variance in our three main indicators of discrimination. Our approach follows Groshen’s (1991) decomposition of wages into occupation- and firm-specific components. Accordingly, row 4 of Table 5 reports the R2 from a regression of advertised discrimination on a full set of firm and occupation fixed effects. Row 1 calculates the minimum contribution of occupation effects to that regression as the reduction in R2 when the occupation effects are removed from it. Row 2 performs the analogous exercise for firm effects, while row 3 equals row 4 – (row 1 + row 2). In sum, row 4 reports the total variance that can be explained by occupation and firm effects, and rows 1-3 in order partition this variance into components that can be unequivocally assigned to occupation effects, to firm effects, and a portion that cannot be unequivocally assigned to either of these two factors. Next, row 6 of Table 5 presents the R2 from a regression of advertised discrimination on a full set of occupation*firm interactions. Thus, row 6 shows the total variance that can be explained by these “job cells”, and row 7 (which is just one minus row 6) is the within-job-cell variance. Finally, row 5 is the difference in R2 between the regression with occupation*firm interactions (row 6) and the regression with occupation and firm effects (row 4). It is a measure of the extent to which firms ‘gender’ their occupations differently; for example it would equal zero in column 1 if the tendency for sales jobs to be ‘male’ was the same in all firms, and similarly for all other occupations. 11 The decomposition results in Table 5 are highly consistent across our three indicators of hiring discrimination. For all three measures, the 39 occupation categories in our data, while highly statistically significant as a group, explain only a small share (3 percent or less) of the variance in explicit discrimination. Thus, a simple occupational segregation model, in which differences in occupation mix explain why some firms explicitly search for women and others search for men, is not very powerful.17 Firm effects consistently explain between 28 and 32 percent of the variation in advertised discrimination, but the total explained by firms and occupations together is never more than 38 percent. Thus, while firm effects are important, a majority of the variation in advertised discrimination occurs within firms. For that reason, as already noted, simple firm-level animus model also cannot account well for the patterns in our data. Recalling that Row 5 of the Table measures the extent to which different firms gender occupations differently, we see that this tendency explains a significant share –about one third-- of the total variance in advertised discrimination, or about one half of the total explained (between-cell) variation. Thus, it appears that occupations are not very consistently gendered across firms. To the extent that our occupation categories measure the type of work performed in a job, this suggests that the nature of the work performed may not be a powerful predictor of whether firms seek men or women for a job.18 Finally, according to row 6, a full set of occupation*firm interactions explains about two thirds of the variation in advertised discrimination.19 While these factors are quite powerful, it follows that one third of the variance in discrimination occurs within job cells, i.e. because the same firm, at different times during our 20-week observation window, sometimes (say) requests men for sales jobs and sometimes does not. In other words, even after we let every firm gender the occupations in a different way, individual firms do not appear to assign genders to occupations very consistently over time, even within our short observation window. One 17 Somewhat more formally, note that according to rows 1 and 4, removing the occupation effects from the column 1 regressions reduces the R2 very little, from .334 to .334-.017 = .311. Thus, almost none of the between-firm differences in gender preferences can be explained by the fact that firms hire different mixes of occupations. 18 This inconsistent gendering of occupations, combined with firms’ frequent reliance on gendered job ads, is however consistent with a scenario in which firms strive to maintain gender homogeneity in detailed job categories, perhaps for reasons related to worker identities (Goldin 2006, Akerlof and Kranton 2008). 19 We replicated Table 5’s decomposition with alternative restrictions on minimum job cell sizes. If we restrict attention to job cells containing at least five ads, the share of variance that is within cells rises to about 50 percent. It remains stable at that level for much larger minimum cell sizes (at least up to 25). 12 interpretation of this pattern is, of course, that individual firms’ occupation-specific gender preferences are highly fluid over time. Alternatively, firms might consistently gender jobs at a finer level than our occupation indicators capture, while the mix of jobs firms are trying to fill changes from month to month. The latter interpretation is more consistent with Bielby and Baron’s (1984) striking evidence of highly detailed gender segregation in U.S. firms. In sum, Tables 1 through 6 have illustrated five key features of advertised gender discrimination in China. First, employers on this website (which caters to highly-qualified young Chinese workers) use gender as an ex ante hiring screen quite frequently. Second, explicitly gendered job ads favor women as often as men. Third, job ads are much less likely to be gendered when the job requires more education and/or experience, and when the job offers a high wage. Fourth, gender, as an ex ante hiring screen, tends to be used in conjunction with requirements for other ascriptive characteristics. In particular, firms that are looking for older workers are much more likely to be requesting male applicants. But if a firm is looking for someone who is tall, good-looking and under 25, it will almost certainly also request a woman in the ad. Fifth, a substantial majority of the variation in advertised gender discrimination in our data takes place within firms. About a third of the variation occurs because different firms ‘gender’ the occupations in our data differently, while another third occurs within firm*occupation cells over time. Since it is unlikely that the type of work associated with a firm*occupation cell changes rapidly over time, it is difficult to make a case that the nature of the work (or training) associated with a job plays a major role in explaining why firms request men for some jobs and women for others. 5. A Model of Discrimination in Job Ads As we have noted, a negative skill-targeting relationship is a prominent and robust feature of our data. In this section we present a simple model of job advertising that is consistent with this pattern, and which helps us think more formally about the conditions under which advertised discrimination is likely to occur. While there has been an active theoretical literature on labor market search (see Rogerson et al. 2005 for a review), to our knowledge the only aspects of job ad content that have been endogenized in this literature are the level of the posted wage (e.g. Mortensen and Pissaridies 1994), and whether to post a wage (Michelacci and Suarez 2006, 13 Menzio 2007). As far as we know, no one has yet studied firms’ choice of how broadly to advertise (i.e. which types of workers, whether differentiated by education, experience, age, or sex, to invite to apply). Since at least some restrictions of this type are a universal feature of job ads, our model may be useful in contexts beyond the study of explicit gender discrimination. To maximize transparency and intuition, we take a simple, partial equilibrium approach. a) The model for a single job ad Consider a firm soliciting applications for a single vacant position; applications can come from two groups, labeled M and F; M and F also denote the number of applications that would arrive from each group, if invited.20 Let the net value to the firm of an individual applicant, j, in a job with ‘standard’ skill requirements be (1) j G j vU  , ),( FMG , where the εj represent independent draws from a distribution with cdf F(εj). A worker’s net value in a job with skill requirement θ is assumed to be jU . Importantly, we think of the gender difference in baseline net value, vM - vF , as including not only between-group differences in revenue productivity in the job, but also differences in employer tastes, and in expected wage costs between the groups.21 Bearing this in mind we shall often use employers’ “preferences towards men” or “tastes for men” as shorthand for vM - vF . Simple, closed-form solutions are available when F(εj) has a type-I extreme value distribution, with F(εj) = exp(-exp(-εj/β)). It follows that Var(εj) = β2 π2 /6, and E(εj) = βγ, where γ is Euler’s constant (≈.577). The firm is assumed to choose the individual worker with the highest total value, Uj, from its pool of applicants. Its only nontrivial choice is which groups (M, F, or both) to invite to apply; this choice takes into account that it costs the firm a constant amount, c, to process each application that arrives, thereby learning its εj. Formally, the firm chooses DM and DF to maximize: (2) Π ≡ Emax(Uj; DM , DF ) - DM cM - DF cF, 20 In this section, we assume for simplicity that the number of applicants of a given type that arrive is the same, regardless of whether the ad is targeted to that type or not. We extend the analysis to allow workers to direct their search in response to targeted ads in Section 7. 21 We also acknowledge that what matters for the model is employers’ perceptions of gender differences in expected productivity, which may or may not coincide with actual differences. 14 where DM (DF ) is a (0,1) indicator for inviting men (women) to apply, and Emax(Uj; DM , DF ) gives the expected value of the maximum Uj drawn from the sample of applicants defined by DM and DF . Lemma 1. (a) The expected value of the highest Uj in a sample of size G drawn from a single group, M or F, in a job with standard skill requirements, is: (3) UG * = µG + βlog(G), ),( FMG , where µG ≡ vG + βγ is the expected net value of a single applicant from group G. (b) The expected value of the highest Uj drawn from the combined sample of all applicants in a job with standard skill requirements, is: (4) CU FM C logexp)1(explog*                           where C=M+F and δ = M/(M+F). Proof: See Appendix 2. To keep the notation simple, we focus on the case of an equal number of applications from each group, i.e. δ=.5.22 In that case, (4) can be written: (5) NU MF MC log(exp1log*                   , where N=.5C is the (common) number of applications expected from each group. We next define z ≡ (μM - μF )/β as the standardized gap in expected net value between the groups.23 Overall, the firm’s optimal recruiting policy is then described by: Proposition 1. The firm’s optimal recruiting policy is to: Solicit men only if z > z*, Solicit women only if z < -z* Post no advertised restrictions if -z* ≤ z ≤ z* 22 Results for unequal numbers are presented in an earlier version of this paper (Kuhn and Shen, 2009); nothing of importance changes. 23 Note that   /6 , where  is the standard deviation of net value. 15 where:  1)/exp(ln*  cNz > 0 if )]2ln(,0[/ cN (“high frictions” case), and z* = 0 if cN/θβ > ln(2) (“low frictions” case). Proof: See Appendix 2. Proposition 1 shows that the factors influencing a firm’s optimal recruiting strategy fall naturally into two categories: Unsurprisingly, factors that raise the index z, which is the expected productivity, cost and taste advantage of men for the job, raise the ‘likelihood’ that firms will invite only men to apply, and reduce the likelihood that firms will invite only women.24 As shorthand, we will refer to these factors as relative preferences, or preferences towards men. The remaining parameters in the model, c, N, θ, and β, operate only on the thresholds, z* and –z*. We refer to these factors as search-related. According to the Proposition, the effects of these two types of factors on the decision to gender-target ads vary between two regions of the parameter space. When cN/θβ > ln(2), the firm’s optimal policy is to invite only men when z is positive, and to only invite women when z is negative. Search-related factors have no effect (other than to determine whether one is in this regime). In other words, all jobs are sex-segregated in the sense that no firm invites applications from both groups. We refer to this as the “low frictions” case because it occurs when applications are plentiful; in this case our model essentially specializes to Becker’s (1957) model of discrimination in competitive labor markets.25 In contrast, targeted ads are sometimes the optimal policy in the high-frictions case, i.e. when (cN/θβ ≤ ln(2)). Here, the firm’s optimal policy is to invite women only when z is low, men only when z is high, and to accept applications from both groups for intermediate values of z. Of greater interest, the model has clear predictions for how these thresholds for searching broadly versus narrowly depend on parameter values. Specifically, the four search-related 24 We formalize this likelihood later in this Section, where we posit a distribution of expected net values across jobs. 25 To see this, consider a labor market with many employers, indexed by i, who can hire either men or women. All employers in this market face the same wages, wM and wF , but firms’ relative tastes for hiring men (tM – tF ) and possibly the expected gender productivity gap, qM - qF , can vary across firms. Thus, a firm’s baseline gender gap in total net value, vM - vF , can be decomposed into the following components: vi M – vi F = (qi M – qi F ) + (ti M – ti F ) - (wM – w F ). If we assume (as Becker does) that men and women are equally productive in these jobs, then in Becker’s frictionless world and in our ‘low frictions’ case, firms where (ti M – ti F ) > (wM – w F ) will hire only men, and firms where (ti M – ti F ) > (wM – w F ) will hire only women. The market wage differential ensures that enough firms of either type will exist to employ all the men and women. 16 factors (c, N, θ, and β) either move the thresholds -z* and z* closer together, making it more likely that firms will engage in gender restrictions of either type, or farther apart, with the opposite effect. 26 In more detail, the model predicts that: 1. Increases in β (i.e. greater idiosyncratic variance of applicant productivity) raise the likelihood that firms will search broadly, inviting both groups to apply. 2. Increases in θ (the job’s skill level) also raise the likelihood that firms will search broadly. 3. Increases in c (per-application processing costs) raise the likelihood that firms will search narrowly, inviting only their preferred group to apply. 4. Increases in N (the expected number of applicants) also raise the likelihood that firms will search narrowly. The intuition for these results is that higher variance in applicant quality raises the option value of searching from a larger pool (i.e. it raises the chance the best candidate will come from the group with the lower expected value), and higher skill levels raise the marginal value to the firm of identifying the best candidate. Thus, both these factors are predicted to reduce the incidence of gender-targeting. In contrast, increases in c and N directly raise the total cost of inviting the ‘disfavored’ group to apply, thereby reducing the incidence of gender-targeting. b) The model for a population of job ads In order to generate predictions for how the share of ads that invite (say) men to apply varies with ads’ observable characteristics, we need to specify how those observable characteristics map into the model’s parameters, and to introduce a source of unobserved heterogeneity across identical job ads. To that end, we now restrict attention to the high-frictions case, where both targeted and untargeted ads can be optimal.27 We index ads by i (recall that applicants are indexed by j) and suppose that the employer’s net relative valuation of men in the position described in ad i be given by (6) zi = xi b + νi 26 Note that the likelihood of gender targeting increases as the lower threshold, -z*, rises towards zero. Accordingly, we use –z* as shorthand for the propensity to gender-target the job ad at various points in the paper. 27 The conditions under which ads are not targeted within the high-frictions regime are qualitatively the same as the conditions for the high-frictions regime to apply, i.e. low values of cN/θβ. 17 where xi includes all the observable determinants of firms’ preferences towards men (and away from women) for that job, plus a constant term. According to Proposition 1, an ad will then be targeted at men if νi > zi* - xi b, targeted at women if νi < - zi* - xi b, and will not contain any gender restrictions otherwise. Suppose further that νi is independently and normally distributed across job ads with mean zero and variance σv 2 . The likelihood of observing each of the three ad types can then be written: Prob(restrict ad to women) ≡ PF =         v iz  bxi* (7) Prob(no gender restrictions) ≡ PC =                v i v i zz  bxbx ii ** Prob(restrict ad to men) ≡ PM =         v iz  bxi* 1 , where Φ is the standard normal cdf. If zi* and σv are constant across observations, (7) describes an ordered probit model. An important feature of our model, however, is that the ad’s observed characteristics – including indicators of its skill requirements as well as observable correlates of application processing costs, expected numbers of applications, and idiosyncratic worker quality—are expected to act on the two thresholds. Specifically, high skill requirements are predicted to move the two thresholds apart by equal amounts; this is reflected in (7) by the fact that the thresholds equal -z* and z* respectively. To incorporate this effect, let: (8) zi* = exp(-xid). which implicitly assumes that any variable that might affect firms’ relative valuation of men versus women (zi) can at least potentially affect zi* as well. Taken together, (7) and (8) comprise a simple model that can be estimated via maximum likelihood. This model allows all observable characteristics of a job or firm, including the job’s skill requirements, to affect the firm’s relative preference for men versus women in that job (z), and at the same time to affect the gap between the firm’s two thresholds, i.e. level of z*. Importantly, the effects of any given observable on z* are identified even if we believe that 18 characteristic also affects a firm’s ‘tastes towards men’ (z).28 This allows us to test the model’s prediction concerning skill requirements without needing to assume, for example, that firms’ tastes towards men are independent of the job’s skill level.29 Unfortunately, while we have estimated the model in (7) and (8) by maximum likelihood with a rich set of covariates, the model is computationally intractable in the presence of large numbers of fixed effects.30 This is a serious limitation since our main goal in the following section is to see whether the effects of jobs’ skill requirements on the thresholds for engaging in advertised discrimination survive detailed controls for other job characteristics –including occupation, industry and firm fixed effects-- that might affect both z and z*. Fortunately, Appendix 3 shows that under conditions that are approximately satisfied in our data, both b and d can be estimated (up to a factor of proportionality) by separate OLS regressions. Specifically, d can be estimated by an OLS regression of PM + PF on x, where PG is a dichotomous indicator for whether the ad states a preference for gender G. In other words, we just regress a dummy for the presence of gender-targeting on the covariates. Under the same conditions, b is identified by an OLS regression of PM - PF (an outcome which takes the values -1, 0 and 1) on the same covariates. 6. Regression Estimates of Gender-Targeting and Firms’ Preferences towards Men This section presents regression estimates of the determinants of gender-targeting in job ads (-z*), and of firms’ preferences in the direction of men (z), using the above framework. The goals are (a) to see whether the strong, negative skill-targeting relationship in the raw data survives detailed controls for other ad characteristics, and (b) to estimate the effects of covariates on firms’ underlying preferences in the direction of men in a framework that allows all the observables to affect both z and –z*. The hope is that the revealed patterns in these underlying 28 Intuitively, this is because we have data on two distinct outcomes (PM and PF ). Essentially, d is identified by the effects of x on their sum (PM + PF ) while b is identified by x’s effects on their difference (PM - PF ). 29 It is of course also possible that σν varies across ads; for example, we could have σν = exp(xig). When PM and PF are both less than .5 --as is the case at the mean of our data—it is easy to show that an increase in σν has the same qualitative effects as an increase in -z*: PM and PF both increase. This makes it difficult to separately identify b and g. Accordingly, we treat σν as fixed in this Section, while acknowledging that the estimated effects of parameters on -z* could also represent effects of the same parameters on σν. The consequences of allowing σν to depend on the covariates are explored in Section 7. 30 Results and Stata do-files are available from the authors on request. 19 preferences will be informative about firms’ motivations for preferring different genders for different jobs. Accordingly, Table 6 presents linear probability model estimates where the dependent variable, PM + PF equals one if the ad is gender-targeted (regardless of direction) and zero otherwise. Moving across columns (1) - (3) from left to right, we add increasingly detailed fixed effects, first for occupation, industry and province separately (116 categories), next for these three interacted (22,581 categories), and finally for a full set of occupation*firm interactions (258,751 fixed effects in total).31 Of special interest are the estimated effects of the indicators of job skill requirements –education and experience—on the propensity to gender-target ads. As predicted by the theory, all the effects are negative. The estimated effects are highly stable across specifications, highly statistically significant, and large in magnitude: for example, jobs requiring university education are 8 to 10 percentage points less likely to be gender targeted than jobs requiring high school or less.32 Jobs requiring 3-5 years of experience are about 3 percentage points less likely to be gender-targeted than jobs with no explicit experience requirement. It is perhaps worth noting how the estimates of skill requirements in column 3 are identified: essentially, we are comparing two or more ads for the same occupation (say, sales), posted on Zhaopin by the same firm, at two different times during our sampling period, requesting different levels of education or experience. The data show that when a given firm is trying to fill, say, two sales jobs requiring different levels of education, it is significantly less choosy about applicant’s gender when filling the position requiring more education. Clearly, the striking correlations between skill and gender-targeting observed in Table 2 are not an artifact of differences in the mix of occupations and firms that require high versus low educational qualifications. Column 4 of Table 6 probes the association between a job’s skill requirements and firms’ propensity to gender-discriminate one step further, by asking if jobs with higher posted wages discriminate more or less. Since, as mentioned, only a minority of the ads on the site posted a meaningful wage, the sample size for this exercise is much smaller (172,887 versus 1,051,706 in 31 All regressions also control for log firm size, the number of positions advertised, period effects, and whether the job is part-time. 32 To put this in context, recall from Table 2 that the mean share of gender-targeted ads for jobs requiring high school or less is 11.3 + 12.0 = 23.3 percent. 20 the full sample), and the results should be viewed more cautiously.33 Still, continuing to use column 3’s most tightly-controlled specification, we find a robust, negative relationship between this third indicator of the job’s skill level and firms’ tendency to discriminate in either direction in their job ads.34 In the specifications without firm fixed effects, Table 4 also shows the effects of firm ownership on the propensity to gender-target in the presence of detailed controls for occupation, industry, province and their interactions. While State-Owned Enterprises engage in about the same amount of gender targeting as private-sector Chinese companies, foreign-owned firms are much less likely to gender-target their job ads than both these groups. This may reflect cultural differences, or the extraterritorial effects of antidiscrimination laws in these firms’ home countries. Shifting our attention from -z* (the propensity to gender-target) to z (firms’ relative preferences for men), Table 7 presents results of OLS regressions where the dependent variable is PM - PF . The specifications are identical to Table 6. In sharp contrast to Table 6, a job’s education requirements have mostly weak and insignificant effects on whether employers prefer men over women for it. The only exception is when controls for offered wages are introduced in column 4: here, men seem to be dispreferred for high-education jobs when the offered wage is held constant. At the same time, however, Chinese employers’ preference for men relative to women increases strongly with a job’s experience requirements; this tendency is robust to controls for offered wages. Finally, the offered wage has a positive but statistically insignificant effect on firms’ preferences towards men. In sum, Table 7 shows that –with the exception of experience-- the effects of a job’s skill demands on the direction of firms’ gender preferences are typically weaker, and inconsistent across regression specifications and skill measures than the effects of skill demands on gender targeting per se. Further, as we show later in this section, the strong effect of experience is probably not a skill effect, but an artifact of how firms’ gender 33 Consistent with other studies (e.g. Brencic 2012), jobs requiring higher skills are less likely to post a wage: in our sample, 10.7 percent of jobs requiring university education post a wage, compared to 18.8 percent of postsecondary jobs and 26.4 percent of jobs requiring high school or less. This suggests that higher-wage jobs are less likely to appear in the sample with posted wages. Strictly speaking, sample selection on wage levels is not a source of bias in a linear model with wages on the right-hand side, but it is of course possible that the size of the wage effect differs across the skill distribution. Accordingly, throughout the paper we present regression estimates with and without wage controls; in almost all cases the results are very similar. 34 Column 4 also shows that differences in offered wages across education groups account for a substantial share of the negative effect of education on gender targeting. This is not surprising since offered wages are probably a more precise measure of skill demands, especially from the employer’s point of view. 21 preferences vary with the worker’s age. Finally we note that --relative to private-sector Chineseowned firms-- foreign-owned firms’ preferences lean towards women, while SOEs lean towards men. Given that SOEs typically face less product-market competition than privately owned firms, this finding is consistent with an employer taste-based motivation for at least this aspect of advertised discrimination against women in our data.35 For additional clues regarding when firms gender-target their ads, and when they prefer men to women, Figure 1 shows occupation fixed effects from regressions identical to column 1 in Tables 6 and 7 with one exception: education was removed from the list of controls, in order to illustrate its effects in the Figure. Occupations in the Figure are divided into two groups, based on our a priori impression of whether they involve a significant amount of customer contact. The six customer-contact occupations, indicated by triangles, are sales, customer service, hospitality/tourism/entertainment (“tourism”), editing/media/film/news (“media”), retail, and “healthcare/beauty/fitness” (“health”). Symbol sizes are proportional to the precision of the estimated fixed effect, and a regression line (estimated with these weights) and 95% confidence band are shown. Part (a) of Figure 1 shows the estimated fixed effects on the propensity to gender-target (PM + PF ) for the 39 occupations in our data. As predicted by our model, ads for the least-skilled occupational group (labor and domestic service) are almost 30 percentage points more likely to stipulate a preferred gender than in the reference occupation (accounting). The two most positive outliers are administration and tourism. Part (b) of Figure 1 shows the estimated fixed effects on firms’ relative preferences toward men (PM - PF ). Here, part (a)’s strong negative association is replaced by essentially a zero overall relation with education. Large positive outliers are manual labor, technical occupations, and communications/logistics; large negative outliers (indicating a preference towards women that cannot be accounted for by observable features of the firm, industry, or ad) are tourism, retail, health occupations, and administration. The first three of these are customer-contact occupations; the fourth refers mostly to secretarial jobs. We suspect that customer discrimination plays a role in the former cases, and that 35 See Black and Strachan (2001) and Black and Brainerd (2004) for other evidence of the effects of product market competition on gender discrimination. Our findings regarding gender and SOEs are consistent with Zhang, Han, Liu and Zhao (2008), who find that the share of the unadjusted gender wage gap that is not accounted for by observable productivity-related characteristics in China is smaller in market-oriented activities than state-owned ones. For other recent studies of gender differentials in China, see Gustafsson and Li (2000) and Liu, Meng and Zhang (2000). 22 managers’ tastes might account for the latter. Additional evidence supporting this interpretation is provided in the next section. 7. Robustness Checks, Alternative Explanations, and Extensions a) Gender and the demand for other ascriptive characteristics. Table 3 has already suggested a strong interaction between employers’ preferences for gender and for other ascriptive worker characteristics, namely workers’ age, height and physical attractiveness. In this subsection we explore these interactions further in two main ways. First, we estimate regressions for determinants of firms’ advertised requests for other ascriptive worker characteristics –age, beauty, and height— that are specified identically to Tables 6 and 7. These regressions show us whether advertised preferences for these other characteristics –which we think of as jointly determined with advertised gender preferences—respond to observables in similar ways to advertised gender preferences.36 Second, we add controls for these ascriptive characteristics to the regressions for gender discrimination, in order to glean additional descriptive information about how these screens interact. Table 8 reports OLS regression results with the same specifications as columns (3) and (4) of Tables 6 and 7. Because age restrictions, like gender restrictions, are ‘bilateral’ in our data, columns (1)-(4) of Table 8 adopt a similar strategy of distinguishing firms’ propensity to restrict their recruitment to specific worker ages (-z*) from their preferences in the direction of older workers (z). The former effects are estimated by regressions in which the dependent variable equals one if the ad specifies an age range. The latter uses the midpoint of that range, when it is specified, as the dependent variable. Since ads for beauty and height only go in one direction --we encountered no ads requesting unattactive or short people-- we simply present linear probability models for the presence of these restrictions; z and -z* cannot be separately identified here and it is important to take note of this in interpreting the estimates. 36 Theoretically, it is straightforward to extend our model to the case of multiple screens: rather than choosing whether or not to target its preferred group (say women), in the two dimensional case the firm would choose the best of four options: to request (for example) well-educated women, to specify only one of these attributes, or to post an unrestricted ad. Since the main effect of search-related factors in the model is to change the relative returns to searching narrowly (i.e. take a smaller sample with a higher mean) versus broadly, we expect that most of the model’s results will continue to hold in an appropriately redefined form in a model with multidimensional screening. 23 Table 8 clearly shows that, like gender-targeting, the incidence of age-targeting declines strongly with jobs’ education requirements. For example, even in the most saturated specification (column 2), jobs requiring a university degree are 4.9 percentage points less likely to specify an age range than jobs requiring high school or less. On the other hand, neither experience requirements nor the offered wage have a robust effect on the incidence of agetargeting. Firms’ relative valuation of older workers, however, rises with experience requirements and the offered wage; this is perhaps not surprising since experience and wages tend to grow with age. Firms’ tendencies to request beauty and height decline with education requirements, experience requirements, and the offered wage, but as noted these estimates cannot distinguish whether this is due to a reduction in the value placed on those attributes (z) or a reduction in the tendency to use them as screens (-z*).37 In which occupations do firms look for old, young, beautiful, or tall workers? Figure 2 presents occupation fixed effects for the outcomes in Table 8, in the same format as Figure 1. Perhaps of greatest interest, some of the occupations with the highest revealed preferences for female workers in Figure 1 –specifically tourism, retail, and administration—also have high unexplained propensities to request youth, beauty and height. Again, this suggests that customer and supervisor tastes for interacting with attractive women play a significant role in firm’s explicit requests for female workers, at least in several key occupations. Figure 2 also shows that the negative skill-targeting relationship we saw for gender discrimination also applies to explicit age discrimination. Perhaps unsurprisingly, it also shows that jobs requiring manual labor are 30 percentage points more likely to have a height requirement than the reference occupation (accountants). In Table 9, we add controls for age, height and beauty requirements to Table 6 and 7’s regressions for gender discrimination, to see more directly how gender interacts with ex ante screening on these other dimensions. Columns 1 and 2 add these controls to the gender targeting regressions in columns 3 and 4 of Table 6; columns 3 and 4 do the same for the “preferences towards men” (PM - PF ) regressions in Table 7. A first thing to note from Table 9 is that the negative skill-targeting relationship for gender discrimination survives the inclusion of controls 37 In specifications without firm fixed effects (not shown; identical to column 2 in Tables 6 and 7), we find that foreign-owned firms are much less likely to age-target their ads than Chinese-owned firms and SOEs. Incidentally, both SOEs and foreign-owned firms are less likely to request beauty, height and a specific age range than privatesector Chinese-owned firms. 24 for screens on other ascriptive characteristics.38 Thus, the skill-targeting effect is not just a consequence of the tendency for firms to seek tall, good-looking or young women in less-skilled positions. Second, as was noted in Table 2’s descriptive statistics, screens for all of these ascriptive characteristics are complements with gender screening: gender screens are more likely to be used when the ad specifies an age range, and when it contains beauty and height requirements. Third, the positive effect of experience requirements on firms’ preferences towards men noted in Table 7 falls in magnitude and loses most of its statistical significance when controls for the preferred worker age are added to the regression: firms’ tendencies to prefer men when they are looking for experienced workers seem to stem more from a correlation between firms’ age and gender preferences than from an experience effect per se. Finally, even within firm*occupation cells and controlling for all three measures of job skill requirements (education, experience and the wage), firms tend to strongly favor women over men when they are looking for young, tall and good looking workers. In sum, firms’ preferences for a specific package of ascriptive characteristics including youth, beauty and height play an important role in explaining when firms seek to hire women, even within firm*occupation cells. At the same time, the gender-targeting relationship that permeates our data is clearly more than an artifact of firms’ ‘search for female beauty’ in less-skilled jobs. b) Alternative explanations of the skill-targeting relationship So far, we have interpreted the negative skill-targeting relationship as supportive of our model’s prediction that a job’s skill demands (θ) should reduce advertised discrimination. But despite surviving detailed controls for the type of work that is performed and the type of firm in which it is done, this pattern could still reflect other factors that covary with jobs’ skill demands which we cannot control directly. We consider the possible impact of such confounding factors in this section, beginning with four factors -- c, N, σε, and σν—that our model predicts should also affect firms’ decisions to gender-target their job ads. 38 In the subsample with posted wages, the education coefficient is still negative but no longer statistically significant in the presence of a wage control. 25 First, it seems plausible that the cost of assessing the suitability of an individual applicant (c) rises with a job’s skill level.39 But this cannot help explain why gender-targeting falls with skill, since according to Proposition 1 increases in c should make advertised discrimination more likely. Intuitively, targeting reduces the number of applications that arrive, which is more useful to the firm when applications are costly to process. On the other hand, it also seems likely that ads for skilled jobs, on average, attract fewer applicants (N) than ads for less skilled jobs: skilled labor markets may be thinner because skilled workers are more specialized.40 A related possibility is that there is, on average, more idiosyncratic variation in the qualifications of applicants to skilled positions than among applicants to unskilled positions (i.e. σε, or β is higher). If so, then both of these correlates of skill (smaller applicant pools and greater applicant heterogeneity) might help account for the strong negative relationship between skill and discrimination in our data. Internal data from job boards like Zhaopin would be useful in assessing the importance of the former effect; the latter may be more difficult to test directly.41 A final possibility suggested by our model is that the idiosyncratic variance across jobs in the relative ability of men and women to perform them, σν, is lower in a sample of skilled jobs than a sample of unskilled jobs. To see this, refer to equation 7, and suppose now that (like z and z*) σν can also depend on the covariates, for example according to σν = exp(xig). Suppose also that a minority of ads are targeted at either gender, i.e. PM < .5 and PF < .5 (therefore –zi*- xib < 0 and –zi* + xib > 0) at some initial set of parameter and data values, which is clearly the case at the mean of our data. In this case, a small decrease in σv has the same qualitative effect as an increase in zi*: reducing the share of ads targeted at men and the share targeted at women. Thus it is possible that what appears to be a direct ‘price’ effect of higher skill requirements is instead a consequence of the fact that the dispersion, across jobs, of jobs’ ‘gender-suitability’ falls with skill levels. 39 For evidence supporting this notion, see Table 1 in Barron and Bishop (1985). In their employer survey, the total person-hours spent by company personnel recruiting, screening, and interviewing applicants to hire one individual ranged from 7.08 for blue collar workers to 16.99 for managerial personnel. 40 Barron and Bishop’s evidence on this point is not clear, however. The mean number of applicants per job in their survey was essentially same for blue collar versus managerial jobs (7.98 versus 8.08). The highest number of applicants per job was 10.82, for clerical jobs. While we do not have information on the number of applicants per job in our data, we do note that, because Zhaopin.com tends to serve a skilled workforce, its online markets for highly skilled workers –measured by the number of ads—are actually thicker than for less-skilled workers. 41 A direct test would require a scalar measure of the variance of all worker qualifications that are visible to the employer in the hiring process, which could be challenging to construct. 26 Empirically, why might σν might fall with skill? One possibility relates to the fact that skilled jobs do not typically require manual labor. Thus, if there are larger gender gaps in humans’ abilities to perform physical tasks than mental tasks, there will be greater cross-job variance in the gender-suitability of unskilled than skilled jobs. While this seems plausible, we note that the share of jobs in our sample that involve any physical labor is quite small. Furthermore, excluding all of these occupations from our estimation sample has essentially no effect on our main results, including the negative skill-targeting relationship.42 A second reason why σν might fall with job skill requirements relates to the tastes of hiring agents, applicants, or the applicants’ prospective co-workers: it is possible that all these groups care less about the applicant’s gender at higher skill levels.43 While this seems plausible, a number of patterns in our data suggest that it may not be the main cause of the skill-targeting effect. For example, if higher skill requirements operate by changing the marginal financial return to identifying the best candidate, higher skill should affect not only firms’ decisions to discriminate on the basis of gender, but their tendency to use other ascriptive screens as well. We have already shown this in Table 8. Explaining the same pattern with tastes requires skilled hiring agents and co-workers to also care less about age, beauty and height than other hiring agents and co-workers, which is certainly possible but requires additional assumptions. Also, note that all three of our measures of skill (education, experience and the wage) have the same qualitative effect on gender-targeting, and the strongest of these when all are included in the same regression is the one that is most directly related to a pure ‘price’ effect: the offered wage. Relatedly, the effect of experience on gender-targeting seems especially hard to explain using tastes: if jobs requiring more experience tend to have older hiring agents or coworkers, a parallel tastes story would require these older persons to have less strict notions of what is a ‘proper’ gender role for a job than young people. This seems unlikely, given the decline in gender-stereotyping across cohorts in most societies. A final piece of evidence on the distinction between σv and θ as mechanisms via which skill reduces gender-targeting is based on the fact that with the right variation in x, the effects of 42 The only occupations in our data that seem likely to involve any physical labor are construction, manufacturing and “manual labor”; together they constitute less than 11 percent of our sample. In contrast, sales, IT, marketing , accounting, and administration together account for almost half of the ads. 43 Note that this hypothesis requires a specific pattern of tastes; it is not sufficient, for example, for more-educated hiring agents (or co-workers) to be less biased against women. Instead, what is needed is that as we move down the skill ladder, some agents’ tastes need to become more intense in favor of women, while other agents’ tastes become more intense in favor of men. 27 observables on z, z* and on σv are all separately identified by a simple sign test, even in the absence of exclusion restrictions. To see this, return to the discussion of equation (7) and suppose that, for some subsample of our data, more than half the ads are targeted at one of the genders; for example PM < .5 and PF > .5. Now, according to (7) a small decrease in σv should still reduce PM , but should raise PF . More to the point, in a model with a constant σv, the effects of any given covariate on PM and PF should have the same sign regardless of whether PM or PF exceeds .5. If, instead, the effects of a covariate on, say, PF , switch sign depending on whether PF is above or below .5, that covariate must have an effect on σv.44 To explore this distinction, Table 10 presents some simple tabulations for two subsets of ads: one in which men are highly favored, the other in which women are highly favored. “Highly male” jobs are defined as in ‘technical’ occupations (the occupation with the strongest preference towards men in Figure 1b), where the maximum requested age is 25 years or higher. Highly female jobs are defined as jobs that request beauty and stipulate a maximum age under 25. When the job’s education requirement was high school or less, more than half of the ads for these types of jobs explicitly requested men and women respectively, which allows us to ask what happens to firms’ advertised preferences as we increase the desired skill level, starting from a situation where more than half the ads are targeted to men (women). Given the above discussion, if the effects of higher skill requirements in these highly gendered jobs on PM and PF qualitatively mirror their effects in the sample as a whole (which was to reduce both PM and PF ), this is consistent with our baseline model, where skill requirements operate only through z and z*. If, instead, the sign patterns are different in these highly-gendered jobs, it appears that skill requirements must also affect σv. The results of Table 10 clearly favor the “price effect” hypothesis over the preferenceheterogeneity explanation of the skill-targeting relationship: as skill requirements in these jobs rise from high school to university, the share of highly male jobs that request men falls from 64 to 51 percent, and the share of highly female jobs that request women declines from 82 to 52 percent. Both these differences are highly statistically significant. While these highly gendered jobs are clearly special, they provide an additional piece of evidence suggesting that a lower cross-job variance of gender-suitability in skilled versus unskilled jobs is not the main source of the negative skill-targeting relationship in our data. 44 Note that this property holds not just for a normal distribution of v, but for any distribution with a median of zero. 28 Stepping outside our model, we can think of one final alternative explanation of the negative skill-targeting relationship: What if highly skilled jobs are not gender-targeted simply because it is already common knowledge that skilled jobs are reserved for men? Given Table 2’s Census statistics on the male and female wage distributions, this strikes us as unlikely: While women are certainly under-represented among high-wage workers, 25 percent of workers earning over 8000 yuan/month [the top 0.3 percent of the wage distribution] are female. And 31.5 percent of the gendered Zhaopin job ads offering more than 8000 yuan/month request women. In short, women are sufficiently scattered across China’s job distribution that gendered ads are likely to be informative about what firms want in most jobs, even at high skill levels. In sum, our discussion of the possible effects of other model parameters suggests that at least two unmeasured factors associated with jobs’ skill requirements --specifically, the thickness of labor markets and differences in the variance of applicant quality-- might (along with the direct ‘price’ effect of higher skill demands) help explain the robust decline in gender-targeting with skill in our data. In contrast, unobserved variation in application processing costs cannot account for this relationship. A final unobserved factor, the cross-job variance of jobs’ gendersuitability, might also be at work but some available evidence points against it. Additional research to distinguish these mechanisms would be of considerable interest. c) Extensions This section explores two extensions to our analysis—one theoretical and the other empirical. The theoretical extension adds a type of directed search by workers to our model.45 Specifically, note that our baseline model assumes that the number of (say) male applications received by a firm that specifically invites men to apply is the same as the number of male applications generated by an ungendered ad (both equal N). But what if φN men respond to an ad directed specifically at men, where φ ]2,1[ measures the supply response of a group when an ad is targeted at them? Workers might respond favorably to targeted ads for a number of reasons; for example they might believe they have a better chance of being hired if they are the type of worker the firm is looking for. 45 Models of directed worker search are quite common (e.g. Moen 1997), and sometimes incorporate discrimination by firms (Lang, Manove and Dickens 2005). To our knowledge, however, our paper is one of the first attempts to formalize the idea of directed recruiting by firms. 29 In Appendix 4, we solve this expanded model for the optimal z*, which now depends on c, N, θ, β and φ. As expected, a greater expected supply response from targeting (i.e. a higher φ) raises the likelihood of gender targeting. In addition, the expanded model’s qualitative predictions for the effects of c, N, θ, β are all the same as our baseline model; this includes the key prediction that higher skill requirements should reduce the incidence of targeting. Aside from greater generality, the expanded model has two attractive features which enhance its realism. One of these is the fact that, in contrast to the baseline model, positive application processing costs are no longer essential to the explanation of targeted ads: such ads can be optimal even when processing costs are zero because they attract more applicants of the type the firm likes best. While available evidence suggests that such costs are both real and important, it is an open question whether they are important enough to explain all the ad targeting that occurs.46 The second attractive feature of the extended model is that its predictions are less sensitive to small changes in either (a) the level of stigma attached to posting a targeted ad, or (b) what might be called the firm’s “pre-screening” technology. To see this, imagine there is some stigma attached to posting a gendered job application, and that it is relatively easy for firms to ‘pre-screen’ resumes on the basis of easily-observed demographic indicators and simply discard applications belonging to the group with the lower expected value. (Pre-screening only reveals basic demographics and is a distinct process from actually assessing the applicant’s ε, which still costs c per applicant). Then even for fairly small levels of social stigma, we should observe no targeted ads in the baseline model. This is not true in the expanded model: if targeted ads induce a large-enough supply response of the worker types firms prefer, firms may continue to target even in the face of substantial stigma associated with posting a targeted ad. While all the available evidence suggests there is no stigma attached to posting a gendered job at in China, this reduced sensitivity to small changes in stigma is still an attractive feature of the expanded model. Once again, we note that all of the expanded model’s qualitative predictions for the effects of parameter changes are the same as the baseline model’s. 46 See for example the experimental evidence on search costs cited in Rubinstein and Salant (2006). In a previous version of this paper we conducted some calibration exercises with the baseline model to ascertain whether ads are likely to be gender-targeted for ‘reasonable’ values of the parameters: c, N, θ, and β. While targeting is possible for reasonable values of c and N, it requires either a high expected productivity gap between the two groups, or a low variance of idiosyncratic applicant ability. 30 The other extension described in this section is an empirical exercise. Specifically, we ask what our Zhaopin results –which are based on a highly specialized segment of the Chinese labor market—imply for the likely incidence of explicit gender discrimination in the Chinese labor market as a whole. To this end, Table 11 returns to the high-income provinces sample introduced in Table 2, and uses regressions similar to those in Tables 6 and 7 to predict the share of jobs targeted at men, and the share targeted at women. We generate predictions for both the Zhaopin sample, and a sample with the average characteristics of the employed population in those provinces. Since there are essentially no public sector ads nor any ads for workers over 50 on Zhaopin, the estimates apply to private sector workers under 50 only. Row 1 of Table 11 uses only the three education categories (high school or less, some college, and university degree), which are available for all observations in both data sets, to predict gender-targeting. Overall, 8.8 percent of Zhaopin ads in the high-income provinces were gender-targeted (a little less than the 10.5 percent in the full Zhaopin sample), with 3.8 percent preferring women and 5.0 percent preferring men. According to Row 1, these shares would rise to 8.8 and 10.3 percent respectively for a sample of job ads that had the same education distribution as the entire employed population in those provinces. Thus, the rate of gender targeting would more than double, from 8.8 to 19.1 percent of all ads. Row 2, which adds controls for industry, occupation, and firm type (private versus SOE) yields broadly similar results, with gender-targeting now rising to 20.8 percent. Rows 3 and 4 of Table 11 alternatively add controls for the 5 wage quintiles or the 3 age groups under 50 in Table 2. Recall that offered wages and desired ages are each only present in a minority of Zhaopin ads; thus these two rows are estimated on considerably smaller samples than Rows 1 and 2 (which use the full sample). Still, the results are suggestive: a population of ads that also reflects the wage and age distributions of the employed population in China’s highincome provinces would be 21.7 percent and 40.5 percent gender-targeted. To assess the plausibility of these predictions, we compare them to an independent source of information on gender-targeting of job ads in China. Specifically, China’s Ministry of Human Resources and Social Security (MOHRSS) collects quarterly data on the number of 31 vacancies listed on the public employment services of a sample of large cities.47 In the last quarter of 2011, the survey reported 5.78 million vacancies, of which 2.11 million (36.5%) were described as ‘male’, 1.85 million (32.0%) were female, and the remaining 1.82 million (31.5%) were not gendered. These government-reported vacancy statistics are thus even more gendertargeted than our estimates based on Zhaopin microdata, most likely reflecting the fact that we can only control for a small share of the observable ad characteristics at the same time; this limits our ability to extrapolate to a different mix of ads from what we see on Zhaopin.48 In sum, both our analysis of job board data and Chinese official vacancy statistics suggest that explicitly gendered job ads are commonplace in the broader Chinese labor market, but only our job board microdata allows us to examine the detailed patterns of gender-targeting and assess them in light of a simple but useful model. The MOHRSS data also dramatically illustrate the lack of social stigma associated with posting an explicitly gendered job ad in China. 8. Conclusions In a legal environment where firms are allowed to engage in explicit gender discrimination when advertising their jobs, when will they choose to do so? Our data show that firms in such an environment will use the option to discriminate much more often when hiring for positions requiring lower levels of skill, whether skill is measured by education requirements, experience requirements, or the offered wage. In this paper we propose a simple model of employer search from two populations that might help explain this pattern. The intuition is straightforward: as skill requirements rise, it becomes increasingly important for firms to identify the best individual candidate for the job. Our model also suggests that other factors which co-vary with skill --such as thinner labor markets or greater idiosyncratic variance in applicant productivity at high skill levels-- may also help account for this skill-targeting effect. Sorting out these channels seems a useful avenue for further research. Regardless of the exact mechanisms behind the skill-targeting effect, our results suggest that skill-upgrading may have a powerful, negative effect on the amount of explicit 47 Aggregate data from the vacancy survey are available online from CEIC Data (2012). The survey covers approximately 100 cities, with the number fluctuating slightly from year to year. For additional details on the MOHRSS data, see Ministry of Human Resources and Social Security of the Peoples’ Republic of China (2012). 48 Recall also that the MOHRSS data refer to cities from all over China, in contrast to the Zhaopin data which overrepresent the high-income provinces. 32 discrimination that is observed in labor markets. Indeed, the declining value to firms of the option to issue discriminatory ads as skills rise might even help explain why these ads have been successfully banned in most developed nations, but not elsewhere. In a sense, our results for the effects of skill complement Becker’s (1957) notion that enhanced product market competition reduces discrimination. At the same time, our results for the effects of labor market competition are, perhaps, unexpected: In our model, it is in thin labor markets (such as those for specialized, skilled workers) that firms will choose to abandon coarse demographic screens, thereby extending their recruiting to groups they expect to be less suitable for the job on average. Abstracting from firms’ decisions whether to gender-target their job ads, we have also provided a number of results concerning firms’ underlying preferences for men, relative to women in a job. In contrast to its effects on the frequency of gender-targeting, we find that a job’s skill level does not have a robust effect on firms’ preferences for men relative to women. That said, the direction of firms’ gender preferences is strongly related to firms’ visions for the appropriate age, height and beauty of the applicant. Firms tend to prefer women when they are looking for young, tall and attractive workers, especially in a small number of customer contact occupations and in the market for secretarial services. On the other hand, firms tend to prefer men when they are searching for older workers; it would be interesting to know more about why this occurs. Finally, the occupations in our data are not gendered very consistently across firms; more than half of the variation in estimated gender preferences takes place within firms; and about a third occurs within occupation*firm cells over time. This pronounced, within-firm heterogeneity in the gendering of unskilled jobs is consistent with a number of models, including models where firms (or co-workers) value gender homogeneity in detailed job categories per se, and models where beliefs about gender-appropriate roles are more broadly based, but apply to much finer occupational categories than we can measure in our data. Additional research to distinguish among these scenarios would be of great interest as well. 33 References Akerlof, George A; and Rachel E. Kranton. “Identity, Supervision, and Work Groups” American Economic Review, vol. 98, no. 2, May 2008, pp. 212-17 Altonji, J. and C. Pierret. “Employer Learning and Statistical Discrimination” Quarterly Journal of Economics, Vol. 116, No. 1. (Feb., 2001), pp. 313-350 Angrist, Joshua D., and Jörn-Steffen Pischke. Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton: Princeton University Press, 2009. Arcidiacono, Peter and Robert A. Miller. “CCP Estimation of Dynamic Discrete Choice Models with Unobserved Heterogeneity” Unpublished paper, Carnegie Mellon University, February 20, 2008 Banerjee, Abhijit, Esther Duflo, Maitreesh Ghatak, and Jeanne Lafortune. “Marry for What: Caste and Mate Selection in Modern India” NBER working paper no. 14958, May 2009. Barron, John M., and John Bishop. “Extensive Search, Intensive Search, and Hiring Costs: New Evidence on Employer Hiring Activity”. Economic Inquiry Vol. XXIII, July 1985 363-382. Barron, John M., John Bishop, and William C. Dunkelberg. “Employer Search: The Interviewing and Hiring of New Employees”. Review of Economics and Statistics 67 (1) (Feb. 1985): 43-52. Becker, Gary S. The Economics of Discrimination. 2nd ed. Chicago: U. of Chicago Press. [1957] 1971. Bergeron, Pierre-Jerome, Masoud Asgharian, and David B. Wolfson. “Covariate Bias Induced by LengthBiased Sampling of Failure Times” Journal of the American Statistical Association Vol. 103 (482). p 737-42. (June 2008). Bertrand, Marianne and Sendhil Mullainathan. "Are Emily And Greg More Employable Than Lakisha And Jamal? A Field Experiment On Labor Market Discrimination," American Economic Review 94(4) (September 2004): 991-1013. Bertrand, Marianne. “New Perspectives on Gender” in Orley Ashenfelter and David Card, eds. Handbook of Labor Economics volume 4B. Amsterdam: North-Holland (2010) pp. 1543-1590. Bielby, W. T., and J. N. Baron. (1984) “A woman’s place is with other women: Sex segregation within organizations”, in B. F. Reskin, ed. Sex segregation in the workplace: Trends, explanations, remedies. Washington, D. C.: National Academy Press. Black, Sandra E. and Elizabeth Brainerd. “Importing Equality? The Impact of Globalization on Gender Discrimination” Industrial and Labor Relations Review. July 2004; 57(4): 540-59 Black, S. and P. Strachan. “The Division of Spoils: Rent-Sharing and Discrimination in a Regulated Industry”. American Economic Review 91(4) (September 2001): 814-831. Brencic, Vera. “Wage posting: Evidence from job ads”. Canadian Journal of Economics, forthcoming 2012. CEIC Data (2012) China Premium Database, available at http://www.ceicdata.com/China.html. 34 Charles, Kerwin Kofi, and Jonathan Guryan. “Studying Discrimination: Fundamental Challenges and Recent Progress”. Annual Review of Economics, vol. 3, no. 1, 2011, pp. 479-511 (also available as NBER working paper no. 17156, June 2011). Crosby, F., Bromley, S., & Saxe, L. (1980). “Recent unobtrusive studies of Black and White discrimination and prejudice: A literature review” Psychological Bulletin, 87, 546-563. Darity, William A. Jr., and Patrick L. Mason. “Evidence on Discrimination in Employment: Codes of Color, Codes of Gender”. Journal of Economic Perspectives 12(2) (Spring 1998):63-90. Fryer, Roland G. Jr.. “Racial Inequality in the 21st Century: The Declining Significance of Discrimination”.in Orley Ashenfelter and David Card, eds. Handbook of Labor Economics volume 4B. Amsterdam: North-Holland (2010), pp. 855-971 Goldin, Claudia. Understanding the Gender Gap: An Economic History of American Women. New York: Oxford University Press, 1990. Goldin, Claudia. “The Rising (and then Declining) Significance of Gender, in The declining significance of gender? Eds. Francine D. Blau, Mary C. Brinton, and David B. Grusky. New York: Russell Sage Foundation, 2006. Gustafsson, Bjorn and Shi Li “Economic transformation and the gender earnings gap in urban China” Journal of Population Economics (2000) 13: 305-329 Greenwald, A. G., McGhee, D. E., & Schwartz, J. K. L. (1998). “Measuring individual differences in implicit cognition: The Implicit Association Test” Journal of Personality and Social Psychology, 74, 1464-1480. Groshen, Erica. “Sources of Intra-industry Wage Dispersion: How Much Do Employers Matter?” Quarterly Journal of Economics, vol. 106, no. 3, August 1991, pp. 869-84 iResearch Inc. (2011). iResearch China Online Recruitment Research Report 2010-2011 (simplified version), available at http://www.iresearch.com.cn/Report/1583.html Kuhn, Peter and Kailing Shen. “Employers’ Preferences for Gender, Age, Height and Beauty: Direct Evidence”. NBER working paper no. 15564, December 2009 Lang¸ Kevin, Michael Manove and William T. Dickens. “Racial Discrimination in Labor Markets with Posted Wage Offers” American Economic Review. Vol. 95 (4). p 1327-40. September 2005. Lawler, John J. and Johngseok Bae. 1998. “ Overt Employment Discrimination by Multinational Firms: Cultural and Economic Influences in a Developing Country Industrial Relations 37(2): 126-152. Liu, Pak-Wai, Xin Meng and Junsen Zhang. “Sectoral gender wage differentials and discrimination in the transitional Chinese economy”. Journal of Population Economics (2000) 13: 331-352 Menzio, Guido. “A Theory of Partially Directed Search” Journal of Political Economy. Vol. 115 (5). p 748-69. October 2007. Michelacci, Claudio and Javier Suarez. “Incomplete Wage Posting” The Journal of Political Economy, vol. 114, No. 6 (December 2006), pp. 1098-1123. 35 Moen, Espen R. “Competitive Search Equilibrium” Journal of Political Economy. Vol. 105 (2). p 385- 411. April 1997. Ministry of Human Resources and Social Security of the Peoples’ Republic of China (2012). Urban Labor Market Survey (in Chinese). Available from: http://www.mohrss.gov.cn/page.do?pa=402880202405002801240882b84702d7&guid=1394a37c f8934ba0b8d31a557008f1c4&og=8a81f0842d0d556d012d11155d8c0043 Mortensen, Dale, and Christopher Pissaridies. “Job Creation and Job Destruction in the Theory of Unemployment.” Review of Economic Studies. July 1994; 61(3): 397-415 Neumark, David. “Sex Discrimination in Restaurant Hiring: An Audit Study” Quarterly Journal of Economics 111(3) (August 1996): 915-42. Price, Joseph and Justin Wolfers. “Racial Discrimination Among NBA Referees” The Quarterly Journal of Economics (2010) 125(4): 1859-1887 Rogerson, Richard, Robert Shimer and Randall Wright. “Search-Theoretic Models of the Labor Market: A Survey”. Journal of Economic Literature 43 (December 2005): 959-988. Rubinstein, Ariel and Yuval Salant. "A model of choice from lists", Theoretical Economics 1 (2006): 3-17 Van Ours, Jan, and Geert Ridder. “Job requirements and the recruitment of new employees” Economics Letters 36 (1991): 213-218. Zhang, Junsen, Jun Han, Pak-Wai Liu and Yaohui Zhao. “Trends in the Gender Earnings Gap in Urban China, 1988-2004” Industrial and Labor Relations Review, Vol. 61, No. 2 (January 2008). 36 Table 1: Sample Means, Zhaopin Job Ads A. AD CHARACTERISTICS Gender requirements No gender preference .895 Prefer male? .055 Prefer female? .050 Education requirements High school or less .129 Some postsecondary .457 University degree .414 Experience requirements None or less than one year .205 1-3 years .399 3-5 years .237 More than 5 years .158 Age requirements No age restrictions .757 Ad specifies a minimum age .169 Ad specifies a maximum age .202 Mean age requesteda 30.59 Wages Wage not specified .836 Mean Wage, when advertisedb 4,279 Number of positions advertised Unspecified .481 Mean number, when specified 1.692 Other job characteristics: Job is Part Time .009 Job requires beauty (xingxiang) .077 Job has a height requirement .026 B. FIRM CHARACTERISTICS Firm size (mean number of workers) 1,565 Firm ownership type:c Private, Domestic .564 Foreign .362 State-Owned Enterprise .074 Number of Ads 1,051,706 a Midpoint of the maximum and minimum age when both are specified b Wages are measured in RMB per month. Zhaopin prompts firms to specify a minimum and maximum wage. Ads are categorized as not specifying a wage when the (a) either the maximum or minimum is blank, or (b) the maximum and minimum are more than 20,000 RMB/month apart . c “Private, Domestic” includes privately held companies, publicly-traded companies and reformed State-Owned Enterprises. “Foreign” includes Foreign Direct Investment, joint ventures, plus a small number of representative offices. 37 Table 2: Share of Job Ads Expressing a Gender Preference, by Ad Characteristics Share of Job Ads Requesting Women With no Gender Preference Requesting Men? A. JOB SKILL INDICATORS: Education Requirements High school or less .113 .766 .120 Some college .059 .892 .049 University .021 .938 .042 Experience requirements None or less than one year .087 .860 .053 1-3 years .060 .889 .051 3-5 years .025 .920 .055 More than 5 years .015 .917 .068 Wages Wage not specified .046 .900 .054 Wage is specified .072 .870 .058 Wage, if specified: under 1500 .167 .734 .099 1500-2999 .114 .808 .078 3000-3999 .053 .899 .048 4000-7999 .034 .918 .048 8000+ .038 .929 .033 B. OTHER ASCRIPTIVE JOB REQUIREMENTS: Age requirements No age restrictions .029 .944 .027 Ad specifies a minimum age .116 .746 .138 Ad specifies a maximum age .124 .724 .151 Maximum and minimum specified .131 .721 .149 Mean age, if max and min specified: Under 25 .332 .607 .060 25-29 .176 .686 .138 30-34 .083 .756 .161 35+ .041 .771 .188 Job requires beauty (xingxiang)? No .034 .909 .056 Yes .239 .723 .038 Job has a height requirement? No .040 .907 .053 Yes .419 .438 .142 C. FIRM CHARACTERISTICS Firm size (number of workers) Under 25 .056 .902 .043 25-99 .059 .888 .053 100-999 .047 .895 .058 1000+ .040 .905 .055 Firm ownership type: c Private, Domestic .061 .874 .065 Foreign .035 .928 .037 State-Owned Enterprise .042 .893 .065 38 Table 3: Job Ads Expressing a Gender Preference, for selected subsamples of ads. Share of Job Ads: Ad characteristic: Requesting women With no Gender Preference Requesting men Ad requests beauty .239 .723 .038 Ad requests both beauty and height .559 .383 .059 Ad requests beauty, height and maximum age under 25 .871 .103 .026 Table 4: Advertised Gender Preferences, by Firm Table 5: Variance Decomposition Dependent Variable (1) (2) (3) Share of variance explained by: Ad Requests Men (PM ) Ad Requests Women (PF ) Ad is Gender- Targeted (PM + PF ) 1. Occupation .017 .031 .017 2. Firm .291 .279 .322 3. Joint Occupation and Firm .026 .031 .033 4. Total, Occupation plus Firm .334 .341 .373 5. Occupation*Firm Interactions .304 .327 .294 6. Total between Job Cells .639 .668 .667 7. Within Job Cells .361 .332 .333 8. TOTAL 1.000 1.000 1.000 Share of Firms Specifying a Preference For: Total Number of Ads Placed by the Firm: Any Gender Men Women Both Genders N 1 .165 .056 .109 .000 12,688 2-10 .326 .155 .224 .053 40,932 11-50 .548 .345 .392 .189 16,244 51 and over .707 .535 .559 .387 3,778 All Firms .367 .200 .258 .091 73,642 39 Table 6: Effects of Jobs’ Skill Demands on the Probability an Ad is Gender-Targetted (PM + PF ), OLS Estimates: ` (1) (2) (3) (4) Education Requirement: Some Postsecondary -.0744** -.0681** -.0599** -.0209* (.0050) (.0048) (.0049) (.0088) University -.1006** -.0946** -.0806** -.0202 (.0057) (.0056) (.0057) (.0106) Experience Requirement: 1-3 years -.0156** -.0177** -.0219** -.0255** (.0023) (.0024) (.0025) (.0058) 3-5 years -.0323** -.0323** -.0324** -.0348** (.0027) (.0027) (.0032) (.0069) More than 5 years -.0285** -.0288** -.0343** -.0250* (.0035) (.0033) (.0038) (.0100) Log (offered wage) -.0403** (.0058) Firm Ownership Type: -.0314** -.0304** Foreign Ownership (.0028) (.0027) -.0065 -.0032 State-owned Enterprise (.0036) (.0031) Fixed Effects (number of groups) Occ, Ind, Province (116) Occ*Ind* Province (22,581) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) N 1,051,706 1,051,706 1,051,706 172,887 R2 .078 .155 .669 .761 40 Table 7: Effects of Jobs’ Skill Demands on Employers’ Preferences Towards Men, (PM - PF ), OLS Estimates: (1) (2) (3) (4) Education Requirement: Some Postsecondary -.0057 -.0032 -.0127 -.0403** (.0067) (.0065) (.0083) (.0136) University .0037 .0091 .0000 -.0350* (.0078) (.0076) (.0090) (.0162) Experience Requirement: 1-3 years .0172** .0128** .0132** .0058 (.0027) (.0027) (.0029) (.0075) 3-5 years .0432** .0398** .0387** .0386** (.0045) (.0047) (.0057) (.0113) More than 5 years .0621** .0575** .0489** .0536** (.0056) (.0057) (.0068) (.0206) Log (offered wage) .0154 (.0089) Firm Ownership Type: -.0101** -.0080** Foreign Ownership (.0022) (.0021) .0127** .0144** State-owned Enterprise (.0030) (.0029) Fixed Effects (number of groups) Occ, Ind, Province (116) Occ*Ind* Province (22,581) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) N 1,051,706 1,051,706 1,051,706 172,887 R2 .064 .143 .641 .716 41 Table 8: Effects of Selected Covariates on Preferences for Other Ascriptive Characteristics: Ad is Age-Targeted?a Mean Age Requested, given age-targeted Request Beauty? Height Requirement? (1) (2) (3) (4) (5) (6) (7) (8) Education Requirement: Some Postsecondary -.0273** -.0250 .9761** .0228 -.0297** -.0112 -.0342** -.0203** (.0045) (.0143) (.1768) (.2714) (.0046) (.0103) (.0037) (.0059) University -.0389** -.0494** 1.8288** -.2707 -.0443** -.0157 -.0390** -.0253** (.0049) (.0136) (.2332) (.3893) (.0053) (.0117) (.0044) (.0081) Experience Requirement: 1-3 years -.0071** -.0045 .3579** .2705 -.0160** -.0187** -.0130** -.0216** (.0021) (.0068) (.1305) (.2022) (.0027) (.0059) (.0022) (.0056) 3-5 years .0022 -.0068 2.6247** 2.3585** -.0400** -.0416** -.0222** -.0346** (.0027) (.0097) (.2032) (.3547) (.0055) (.0080) (.0037) (.0091) More than 5 years .0153** .0094 5.0780** 4.3621** -.0471** -.0503** -.0250** -.0348* (.0034) (.0157) (.2326) (.4078) (.0058) (.0101) (.0043) (.0143) Log (Wage) .0083 2.2116** -.0185** -.0099* (.0069) (.1944) (.0056) (.0041) Fixed Effects (number of groups) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) Occ*Firm, Province (50,201) Occ*Firm, Province (11,797) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) N 1,051,706 172,887 134,768 27,909 1,051,706 172,887 1,051,706 172,887 R2 .670 .758 .885 .929 .659 .754 .668 .762 ** p<.01, * p<.05. OLS estimates. Regressions also control for a the number of vacancies advertised, a dummy for part-time jobs and period fixed effects. Standard errors are clustered at the occupation*province level. Specifications are identical to columns (3) and (4) of Tables 6 and 7. 42 Table 9: Effects of Jobs’ Skill Demands on the Probability an Ad is Gender-Targetted (PM + PF ) and on Preferences towards Men (PM - PF ) , OLS Estimates with controls for Other Ascriptive Characteristics Ad is Gender-Targeted (PM + PF ) Preferences Towards Men, (PM - PF ) (1) (2) (3) (4) JOB SKILL REQUIREMENTS: Education Requirement: Some Postsecondary -.0451** -.0125 -.0274** -.0473** (.0043) (.0089) (.0085) (.0135) University -.0619** -.0072 -.0184* -.0427** (.0051) (.0106) (.0093) (.0164) Experience Requirement: 1-3 years -.0165** -.0188** .0076** -.0031 (.0021) (.0054) (.0025) (.0062) 3-5 years -.0235** -.0236** .0246** .0203* (.0023) (.0066) (.0038) (.0084) More than 5 years -.0250** -.0156 .0287** .0297 (.0029) (.0093) (.0045) (.0159) Log (offered wage) -.0384** .0053 (.0056) (.0089) OTHER ASCRIPTIVE CHARACTERISTICS: Ad specifies age range? .1954** .0863 -.3792** -.3013** (.0288) (.0586) (.0365) (.0713) Mean age, when specified (years) -.0024** .0013 .0134** .0107** (.0009) (.0019) (.0011) (.0025) Ad requests beauty? .0661** .0705** -.1141** -.1390** (.0052) (.0096) (.0098) (.0176) Ad specifies minimum height? .2545** .2292** -.2114** -.2448** (.0161) (.0245) (.0183) (.0469) Fixed Effects (number of groups) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) Occ*Firm, Province (258,751) Occ*Firm, Province (63,333) N 1,051,706 172,887 1,051,706 172,887 R2 .684 .772 .652 .727 43 Table 10: Effects of Education Requirements on Advertised Preferences in Highly Male and Highly Female Jobs Education Requirement Share of Ads requesting men in “Highly Male” Jobs: Share of Ads requesting women in “Highly Female” Jobs: High school or less .640 .815 Some college .534** .637** University .509* .517** Highly Male Jobs are Technical Workers, maximum age 25 or over Highly Female Jobs are Jobs that request beauty, maximum age under 25 **, * refer to differences from row one, significant at 1 and 5 percent respectively. Table 11: Predicted Share of Gendered Job Ads for All Employed Workers, High-Income Province Sample Share of Job Ads: Predictions based on: Sample Requesting Women With no Gender Preference Requesting Men 1. Education Zhaopin .038 .912 .050 Census .088 .809 .103 2. Education, Industry, Occupation, Firm Type Zhaopin .038 .912 .050 Census .076 .792 .132 3. Education, Industry, Occupation, Firm Type, Wage Zhaopin .056 .889 .056 Census .090 .783 .127 4. Education, Industry, Occupation, Firm Type, Age Zhaopin .104 .752 .144 Census .125 .595 .280 44 Figure 1: Occupation Fixed Effects for Gender, by Mean Education Requirements a) Dependent Variable: Tendency to Gender-Target Ads (PM + PF ) b) Dependent Variable: Employer’s Relative Preference towards Men (PM - PF ) Figure 2: Occupation Fixed Effects for Other Ascriptive Requirements IT mkting accting admin constr mfg HR fin trade comm_log mgmt electronic design ed_train QC const_mach consult tech biomed other lawtemp comm_techtransl studentenergy chem labor envir research primary textile vehicle sales cust_serv tourism media retail health -.1 0 .1.2.3 10 12 14 16 18 mean years of education required NoContact NoContact CustContact CustContact 95% CI Fitted values Propensity to target ads by gender IT mkting accting admin constr mfg HRfin trade comm_log mgmt electronic design ed_train QC const_mach consult tech biomedother law temp comm_tech transl student energy chem labor envir research primary textile vehicle sales cust_serv tourism media retail health -.1 0 .1.2.3.4 10 12 14 16 18 mean years of education required NoContact NoContact CustContact CustContact 95% CI Fitted values Preferences towards men 45 a) Dependent Variable: Tendency to Target Ads by Age (Specific Age Range is requested) b) Dependent Variable: Midpoint of Age Range, when requested. e) Dependent Variable: Propensity to Request Beauty IT mkting accting admin constrmfg HRfin trade comm_log mgmt electronicdesign ed_trainQC const_machconsult tech biomed other law temp comm_tech transl student energy chem labor envir research primary textile vehicle salescust_serv tourism media retail health -.1 0 .1.2.3 10 12 14 16 18 mean years of education required NoContact NoContact CustContact CustContact 95% CI Fitted values Propensity to target ads by age IT mkting accting admin constr mfg HR fin trade comm_log mgmt electronic design ed_train QC const_mach consult tech biomed other law temp comm_tech transl energy chem labor envir research primary textile vehicle sales cust_serv tourism media retail health -4-2 02 10 12 14 16 18 mean years of education required NoContact NoContact CustContact CustContact 95% CI Fitted values Mean Age Requested 46 f) Dependent Variable: Propensity to Specify a Minimum Height IT mkting accting admin constr mfg HR fin trade comm_log mgmt electronic design ed_train QC const_mach consult tech biomed other law temp comm_tech translstudent energychem labor envir researchprimary textile vehicle salescust_serv tourism media retail health -.1 0 .1.2.3 10 12 14 16 18 mean years of education required NoContact NoContact CustContact CustContact 95% CI Fitted values Propensity to have a beauty requirement ITmktingaccting admin constrmfg HRfintradecomm_log mgmt electronicdesign ed_trainQC const_machconsult tech biomed other law temp comm_techtransl student energy chem labor envir researchprimarytextile vehicle salescust_serv tourism media retail health 0 .1.2.3 10 12 14 16 18 mean years of education required NoContact NoContact CustContact CustContact 95% CI Fitted values Propensity to have a height requirement 47 Appendix 1: Data As noted, our overall sample consists of all job ads which appeared on Zhaopin.com between during four observation periods: May 19 2008 - June 22 2008, January 19 2009 February 22 2009, May 18 2009 - June 21 2009, and January 18 2010 - February 21 2010. At the end of each day, our program automatically searches for job ads that were posted on Zhaopin that day. The program starts at 11:30pm sharp each day for consistency. On the first day of data collection, all ads that were posted that day were kept. On subsequent days, all ads posted that day are compared with the master list of previously-posted jobs; since many such jobs are just renewals that are re-posted (employers can re-post and existing ad; this entails a small marginal financial cost but does require action on the employer’s part), we do not download these refreshed jobs but maintain a count of the number of renewals that occur during this time period. As a result, our data have information on every job that was posted or renewed during this time period, linked to information about the firm posting the job. All of our regression analysis is restricted to the sample of jobs for which we have matching firm information. The matching rate varies somewhat across specifications but was about 80%. Age, gender and other job requirements were extracted from each job’s html file. For example, in the case of gender, we look for “nue”(female) and “nan”(male) characters in the job description section of the file. We then constructed a match table summarizing about 1468 ways for a job ad to mention “nue”(female) and “nan”(male). After that, we use a program and this match table to derive the gender discrimination variable automatically. We consider our table quite exhaustive. In addition, we visually check all the ads that mentioned gender in a way that did not match these tables. Only about 100 ads fell into this category. For age variables, we search for “sui” (year of age); our approach could therefore miss jobs that ask for age only using numbers “25-35”. Therefore, our indicators should be interpreted as very explicit requirements for gender, age and other characteristics. Occupation and industry categories are those supplied by Zhaopin.com (firms choose from a list on the website). Note that Zhaopin’s occupational categories are not mutually exclusive: firms are allowed to check up to three categories. Since the share of vacancies corresponding to the each occupations is generally unspecified, we restricted our sample to ads for a single occupation; this reduces our sample by about 20 percent. Our results were, however, very similar when we included all ads and classified them according to the first occupation listed, or when we allocated ads fractionally, and equally, across all the occupations they listed. Firms can also list multiple industries; we resolved this by simply choosing the industry that was listed first, and similar robustness checks showed that this has little effect as well. Finally, the analysis sample for the current paper also excludes the approximately 20 percent of ads that did not specify what education level was required (inspection of these ads showed that these were not necessarily unskilled jobs). Another sampling issue is the fact that Zhaopin ads can be for either one or more vacancies; we handle this by treating the ad as the unit of analysis and including statistical controls for the number of positions on offer in all the regression analyses, with very little effect on the results. To assess the effect of length-biased vacancy sampling on our results, we replicated our analysis for a subsample of ads from near the end of each data collection period that consists, almost certainly, of newly posted ads. Again, the results were very similar. To assess the representativeness of our Zhaopin ads relative to a population of occupied jobs, columns 1 and 2 of Table A1 compare the observable characteristics of Zhaopin ads to the 48 employed urban Census population in China’s eight highest-income provinces.49 Together, these eight provinces account for 78 percent of all the Zhaopin ads and for 41 percent of national employment in 2009. They have an employment-weighted annual GDP per capita of RMB 46,930, compared with 31,919 for the nation as a whole.50 A key distinction from Table 1 is that, for characteristics such as gender, age, and wage which are often unspecified in job ads, the Table A1 Zhaopin means refer to the subset of ads in which firms actually specified a preference. For example, according to column 2, of the job ads in high-income provinces that expressed a gender preference, 56.5 percent requested men. The goal is simply to give a rough picture of the types of workers that firms are seeking in the Zhaopin data, and how these compare to urban working people in these eight provinces. As noted in the paper, compared to employed persons in the same set of provinces, Zhaopin ads serve young, well-educated, well-paid private-sector workers. And while the Zhaopin industry and occupation categories do not correspond neatly to the available Census categories, a few conclusions about industry and occupation mix can be drawn.51 First, relative to the general working population, the IT/ communication and R&D/consulting industries are highly overrepresented on Zhaopin, together accounting for over 33 percent of Zhaopin ads compared with under 4 percent of total employment. Three industry categories – construction/transporation, trade/hospitality/entertainment, and finance/insurance/real estate— are about equally represented in Zhaopin compared to total employment, while the remaining industries are underrepresented in Zhaopin ads. Notably, even though manufacturing is underrepresented on Zhaopin, almost one quarter of Zhaopin ads are from manufacturing firms. Less detailed matches are possible on occupation, but professional and technical workers are clearly overrerpresented on Zhaopin, with 60 percent of the ads, compared with 17 percent of employment. Columns 3 and 4 of Table A1 compare the gender mix of Zhaopin ads to the gender mix of employment; in the same spirit as columns 1 and 2, gender mix in the Zhaopin data is represented by the share of gendered ads that are female. According to columns 3 and 4, for workers under 30, the share of gendered ads requesting women closely mirrors women’s share of the workforce, at a little over 50 percent. Further, both the share of women in the workforce and the share of Zhaopin ads decline with the worker’s age.52 Turning to education, columns 3 and 4 of Table A1 show that the female workforce in these cities is a little less likely to have a university degree than the male workforce, and this difference is mirrored, somewhat more strongly, in the Zhaopin job ads. There is also a broad correspondence between industries that tend to employ women (trade/hospitality/entertainment and health/education/ welfare) and industries where job ads are targeted at women. The same is true for occupation, with sales and service occupations being highly female in both Zhaopin and the Census and production/ construction being highly male. In both Zhaopin and the Census, the private sector is more ‘female’ than state-owned enterprises; finally women are underrepresented in high-wage jobs in both Census employment and in Zhaopin ads. Overall, Zhaopin ads tend to request female workers in the types of jobs where, according to the Census, women are already employed. 49 2005 Census data are from the 1% National Population Sample Survey, conducted by China’s National Bureau of Statistics. The microdata were kindly supplied by Loren Brandt of the University of Toronto. Some details on methodology are available at: http://www.stats.gov.cn/eNgliSH/newsandcomingevents/t20060322_402312182.htm 50 GDP and employment data are from the National Bureau of Statistics. 51 The crosswalk used to match the two sets of occupation and industry categories is available on request from the authors. 52 The considerably more precipitous decline in the Zhaopin ads seems to reflect an interaction between firms’ underlying age and gender preferences, which we document in the paper. 49 Table A1: Descriptive Statistics, Zhaopin.com Ads versus 2005 Census Employed Population, High-Income Provinces Note: Zhaopin.com distributions refer to ads that stated a preference for the attribute (e.g. age, gender, wage) only. Sample is restricted to urban workers in the eight provinces with the most Zhaopin ads, which are also top 8 provinces in GDP per capita: Beijing, Shanghai, Guangdong, Jiangsu, Shandong, Tianjing, Zhejiang and Liaoning. The share female in Zhaopin ads for workers over 50 is not reported due to the extremely low sample size. Share in Category Share Female within Category (1) (2) (3) (4) Census Zhaopin Census Zhaopin Gender Male .548 .565 Age 30 or below .404 .521 .511 .555 31-40 .311 .450 .466 .226 41-50 .214 .029 .407 .113 51-60 .071 .001 .197 Education High school or below .766 .114 .455 .464 College .135 .431 .465 .508 University .100 .455 .414 .283 Industry: Primary, Manufacturing and Utility .453 .267 .465 .381 Construction and Transportation .118 .135 .202 .300 IT and Communication .016 .185 .405 .494 Trade, Hospitality and Entertainment .165 .175 .558 .590 Finance, Insurance and Real Estate .063 .052 .454 .506 R&D and Consulting .023 .153 .376 .389 Health, Education and Welfare .102 .033 .617 .687 Public Sector .060 .000 .318 Occupation: Senior Management .025 .021 .265 .425 Professional and Technical .170 .600 .562 .435 Sales and Service .230 .261 .556 .583 Production and Construction .443 .119 .399 .121 Public Servants .132 .000 .342 Firm ownership: Private Sector Sd or collective .589 .930 .482 .444 SOEs and collectives .271 .070 .384 .325 Public Administration .140 .000 .456 Wage distribution: 1500 or below .778 .145 .479 .503 1501-3000 .176 .164 .369 .559 3001-4000 .021 .214 .356 .509 4001-8000 .022 .244 .296 .407 8001 or above .003 .126 .251 .315 50 Appendix 2: Proofs Proof of Lemma 1: We begin by normalizing the net value of an applicant, defining uj ≡ Uj/β = vG /β + ej, where ej = εj/β follows a “standard” extreme value distribution with Var(ej) = π2 /6 and E(ej) = γ, j indexes applicants, and ),( FMG indexes groups. This normalization does not affect the firm’s optimal selection of a worker –the draw of ej that maximizes u corresponds to the draw of εj that maximizes U-- and the maximized value of U can be calculated as βu*, where u* is the maximized value of u. Further, this normalization expresses the problem in a standard multinomial logit format, which allows us to draw on some results from that literature. Among these, it is well known that the expected value of the maximum of vG /β + ej when ej is independently drawn N times from a “standard” extreme value distribution is vG /β + γ + log(N).53 Multiplying through by β the expected maximum of U when the firm samples from either the M or F pool separately is (A1) UG * = vG + βγ + βlog(G), which proves part (a) of the Lemma. Turning back to standardized net values, the expected value of the highest uj in the “combined” sample, uC *, equals uM *qM + uF *(1- qM ), where uG * is the expected value of the best overall worker given the best overall worker is from group G, and qM is the probability that the best overall worker turns out to be an M. Again using results from the MNL literature, we know that uM * = vM /β + γ – log(pM ), where: )exp()exp( )exp(   FM M M vFvM v p   is the probability that an individual type-M applicant turns out to be the best in the entire, combined pool. Similarly, we have uF * = vF /β + γ – log(pF ), where: )exp()exp( )exp(   FF F F vFvM v p   . Finally, the probability that the firm’s preferred applicant from this combined pool is drawn from the M’s is just: )exp()exp( )exp(   FM M M vFvM vM q   . Note that, as the variance of individual productivity (β) falls towards zero, the probability that the best overall worker will be from the group with the higher net value (v) approaches one; conversely as β approaches infinity, qM approaches the share of M’s in the population, i.e. M/(M+F). 53 See Arcidiacono and Miller (2008, p. 8) for a general proof. 51 Combining all the necessary expressions and simplifying, the expected standardized value of the best worker from the combined pool can be written as: uC * = γ + log [ Mexp(vM / β) + Fexp(vF / β) ] . Letting δ = M/(M+F) ≡ M/C be the fraction of M’s in the combined pool, this becomes: uC * = γ + log [ δ exp(vM / β ) + (1-δ )exp(vF / β ) ] + log C . Multiplying through by β, the corresponding maximized unstandardized value is therefore: UC * = γ β + β log [ δ exp(vM / β ) + (1-δ )exp(vF / β ) ] + β log C . Expressing this in terms of the means of the unstandardized distributions, µ G ≡ vG + βγ, yields after some algebra: (A2) CU FM C logexp)1(explog*                           , which proves part (b). Proof of Proposition 1: Consider first the difference in the firm’s objective function (“profits”) between recruiting strategy M (invite men only) and C (invite all). Expected profits from inviting only men to apply are (from (2) and (3)): ΠM = UM * - cN = μM + βlog(N) – cN. Expected profits from a combined strategy are (from (2) and (5): ΠC = UC * - 2cN = N MF M log(exp1log                   - 2cN. Subtracting and simplifying yields: (A3) ΠM - ΠC =    cNz  exp1log ≡ RM (z, β) + cN where N = .5C is the (equal) number of applications expected from each group, and   FM z   is the standardized expected net value advantage of group M. By symmetry, (A4) ΠF - ΠC =    cNz  exp1log = RM (-z, β) + cN ≡ RF (z, β) + cN 52 Inspection of (A3) and (A4) shows that RM (z, β) is always negative and increasing in z. RM (-z, β) is thus also always negative and decreasing in z. Further, these two functions intersect when z = 0, at the level 2log , as shown in Figure A1 below. When –cN > 2log , as is the case when cN=.4 in Figure A1, it follows from (A4) and (A5) that firms prefer to advertise only to men when z > z*, to advertise only to women when z<-z*, and not to restrict their ads when -z*0, and to women when z<0. Figure A1: Gains and Costs to Restricted Searches, β=1 The critical values, z* and –z*, can be found by setting A3 or A4 equal to zero, yielding: (A5)  1)/exp(log*  cNz This completes the proof for a job with standard skill requirements. To generalize the analysis to any job, recall that by assumption skill requirements are represented by a constant, θ, that multiplies all realizations of Uj . Thus, changes in θ multiply both the means (µ M and µ F ) and standard deviation )6/(   of the applicants’ net values (Uj) by the same constant, leaving the standardized expected gender gap in net values for a particular job,   FM z   -2 -1.5 -1 -.5 0 -2 -1 0 1 2 z RM RF z*-z* cN=.4 cN=1 -ln(2) 53 unchanged. Therefore, the only change in firms’ decision rules involves the threshold, z*, which becomes, for a job with skill requirement θ :  1)/exp(log*  cNz , as claimed.54 Appendix 3: A Linear Approximation to the Model Consider first the effects of an infinitesimal increase in covariate xk on the probability that an ad is targetted at men (PM ) or women (PF ) respectively in our model. Differentiating the corresponding terms in equation (8), these marginal effects are given by:   *)()exp(/ 2fbdxP kkk M  xd (A6)   *)()exp(/ 1fbdxP kkk F  xd , where ν1* (ν2*) is the value of ν below (above) which firms advertise a preference for men (women). Combining these to yield the marginal effect of xk on the probability that a firm engages in some type of gender discrimination: (A7) ∂(PM +PF )/ ∂xk =    *)(*)(*)(*)()exp( 1221  ffbffd kk xd . Now suppose for a moment that f(νi) is uniform with density m/2. In this case, (A7) reduces to (A8) ∂(PM +PF )/ ∂xk = )exp( xdmdk , while a parallel argument for (PM - PF ) yields: (A9) ∂(PM -PF )/ ∂xk = bkm. In other words, if f(νi) is uniform, then up to a factor of proportionality (equal to )exp( xdm for an individual observation or ))(exp( xdmE for the average marginal effect in the estimation sample), the parameter dk is equal to xk’s marginal effect on the probability that a firm discriminates in some direction in its ad. This is true regardless of xk’s effects on the firm’s preference towards men, z, because any such effects subtract out of (A7). Similarly, bk is identified by the marginal effect of xk on the difference between the probabilities, PM -PF . More generally, when f(νi) is not uniform, the average marginal effects become: (A10) E[∂(PM +PF )/ ∂xk] =    )()()]()()[exp( * 1 * 2 * 2 * 1 vfvfEbvfvfEd kk xd , (A11) E[∂(PM - PF )/ ∂xk] =    )()()]()()[exp( * 1 * 2 * 1 * 2 vfvfEbvfvfEd kk xd , 54 Since for given μM - μF , z depends by definition on β, Proposition 1’s predictions for the effects of the idiosyncratic variance parameter, β, must be interpreted as conditional on a given z, i.e. conditional on a given standardized gap in expected values. In consequence the effects of β are, in fact, identical to the effects of rescaling workers’ value by θ. However, it is easy to show that the effects of an increase in β conditional on a given absolute gap in expected values, i.e. a given μM - μF , is qualitatively the same but stronger in magnitude. To see this, note that for given μM - μF , an increase in β ‘shrinks’ all the z’s towards the origin. This effect reinforces the effects of β on the thresholds (z*’s), thus reinforcing the decline in the ‘probability’ that firms will adopt gender restrictions of either kind. 54 Now, if the share of ads targetted at men versus women is similar (PM ≈ PF ), then for any symmetric f(νi), the expected difference in densities  )()( * 1 * 2 vfvfE  will be approximately zero. This, combined with the fact that the covariate vector xd only moves ν1* and ν2* further apart or closer together, means that both the second term in (A10) and the first in (A11) will be close to zero. Thus, (A8) and (A9) remain approximately true for the average marginal effects in the estimation sample, with the appropriate expectations replacing the constants )exp( xdm and m respectively. Finally, we appeal to the fact that linear probability models typically estimate the average marginal effects well, even in limited dependent variable contexts. (According to Angrist and Pischke (2009), the correspondence is exact in the case of a single, dummy regressor (pp. 96-98), and approximate in more general applications (pp. 104-107). In sum, according to our argument, the OLS coefficient in a regression of PM + PF on xk will be approximately equal to gdk, and the OLS coefficient in a regression of PM - PF on xk will be approximately equal to hbk, where g =  )]()()[exp( * 2 * 1 vfvfE xd and )]()([ * 2 * 1 vfvfEh  . Both Maximimum Likelihood and linear probability model estimates of our baseline model in equations (6)-(8) are available from the authors for specifications in which both are feasible. In these specifications, the marginal effects and patterns of statistical significance estimated via the two approaches are very similar. Appendix 4: Introducing Directed Worker Search Continue to denote N=.5C as the (common) number of applications expected from each group when the firm posts an unrestricted ad. But now, to incorporate the possibility that requesting (say) men attracts a higher number of male applicants than an unrestricted ad, assume that a gender-specific ad attracts φN ads of that gender, where 1 < φ < 2.55 Now, the profit gap between a male-only ad and an undirected ad in a job with standard skill requirements (A3 in Appendix 2) becomes: (1) ΠM - ΠC =    )2(logexp1log   cNcNz . Setting this equal to zero and solving for the threshold, z*, and using Appendix 2’s arguments to generalize to a job of skill level θ yields: (2)              1)2(logexplog*    cN z . Since 2- φ >0, all four parameters in our original model (c, N, θ and β) have the same qualitative effects on gender-targeting as in the original model: Higher levels of c and N reduce z*, and thus increase the likelihood an ad will be gender-targeted. Higher levels of θ and β have the opposite effect; thus the model continues to predict that the incidence of gender targeting should decline with jobs’ skill levels. Note also that, when there is no workers’ supply response to targeted ads (φ=1) (2) simplifies to our original model. 55 If φ > 2 the firm will always restrict its ads to its preferred gender. 55 The effects φ on the rate of gender targeting are more subtle. To see this, consider first the polar case where application processing costs, c, are zero. Now, (2) simplifies to: (3)   1logexplog*  z . Thus, increases in the magnitude of workers’ supply response to targeted ads reduce z*, thus raising the share of ads that are targeted.56 Clearly, this positive response of gender-targeting to φ extends to parameterizations in c is positive but small in magnitude. For larger values of c, however, another force is at play that can change the direction of this effect. Specifically, if applications are costly to process (high c), already plentiful (high N), and have low option value (low β and θ), then firms might actually avoid posting targeted ads when workers’ supply response (φ ) is high. 56 Note also that ads may be gender-targeted in this new model even if c = 0; thus application processing costs are no longer required to explain the existence of gender-targeted ads.