//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (2024)

\floatsetup

[table]captionskip=0pt

Kara KedrickCarnegie Mellon University, kkedrick@andrew.cmu.eduEkaterina LevitskayaColeridge Initiative, ekaterina.levitskaya@coleridgeinitiative.orgRussell J. FunkUniversity of Minnesota, rfunk@umn.edu

Abstract

A growing stream of research finds that scientific contributions are evaluated differently depending on the gender of the author. In this article, we consider whether gender differences in writing styles—how men and women communicate their work—may contribute to these observed gender gaps. We ground our investigation in a framework for characterizing the linguistic style of written text, with two sets of features—informational (i.e., features that emphasize facts) and involved (i.e., features that emphasize relationships). Using a large sample of academic papers and patents, we find significant differences in writing style by gender, with women using more involved features in their writing. Papers and patents with more involved features also tend to be cited more by women. Our findings suggest that scientific text is not devoid of personal character, which could contribute to bias in evaluation, thereby compromising the norm of universalism as a foundational principle of science.

1 Introduction

In his classic essay, “The Normative Structure of Science,” sociologist Robert K. Merton identified universalism as a foundational principle of the scientific enterprise, one that distinguishes science from other competing systems of knowing. According to Merton and Storer’s formulation (Merton and Storer,, 1973, p. 270), universalism holds that the evaluation of scientific contributions “is not to depend on the personal or social attributes of their protagonist; his race, nationality, religion, class, and personal qualities are as such irrelevant.” The value of universalism is manifested perhaps most concretely in the practice of double-blind peer review, wherein the identities of both those making scientific claims and those evaluating them are obscured from one another (Bornmann,, 2011).

While scholars have long observed that adherence to the principle of universalism is far from universal (Mulkay,, 1976; Cole,, 1992; Long and Fox,, 1995), the growing availability of large-scale databases is creating opportunities for unprecedented insight into processes of scientific evaluation (Teplitskiy etal.,, 2018; Dondio etal.,, 2019; Lane etal.,, 2021), including the barriers that inhibit objective assessments. Recent literature in particular has raised considerable concern about the role of gender in scientific evaluation (Moss-Racusin etal.,, 2012; Reuben etal.,, 2014; Oliveira etal.,, 2019; Card etal., 2020a, ). There is growing evidence to suggest that the contributions of women are devalued relative to men (Joshi,, 2014; Sarsons,, 2017; Vásárhelyi etal.,, 2021; King etal.,, 2017; Ni etal.,, 2021), thereby undermining the imperative of universalism, and ultimately weakening the scientific enterprise by suppressing diverse perspectives. Prior work has shown, for example, that relative to men, women’s contributions are awarded less grant funding from prestigious sponsors (Oliveira etal.,, 2019; Kolev etal.,, 2020), less likely to be granted patent protection (Jensen etal.,, 2018), and also receive fewer citations from other academic papers (Larivière etal.,, 2013). These gaps appear to be driven at least in part by hom*ophily, as men tend to cite other men over women, even in fields with higher female representation, such as the social sciences (Potthoff and Zimmermann,, 2017; Dion etal.,, 2018; Ghiasi etal.,, 2018). Men also tend to self-cite at a much higher rate than women (King etal.,, 2017).

Yet much remains unknown about the role of gender in scientific assessment. Prior scholarship has highlighted differences in field, topic choice, or research quality as playing an important role in explaining observed gaps between men and women (Leahey,, 2007; Key and Sumner,, 2019). While these factors are undoubtedly important, studies have also shown that even when accounting for field, topic choice, and quality, men’s and women’s research is evaluated differently (Hengel,, 2017; Hengel and Moon,, 2020). Studies further report gender gaps even in the presence of double-blind review (Kolev etal.,, 2020; Mahajan etal.,, 2020), suggesting that conscious or unconscious bias based on the identity of the author is likely not the determinative factor (vander Lee and Ellemers,, 2015; Hospido and Sanz,, 2019; Card etal.,, 2021; Card etal., 2020b, ).

In this article, we aim to contribute to the systematic understanding of gender differences in the evaluation of scientific and technical work using a linguistic lens. Specifically, we draw on a prior framework, developed by Biber, (1988), which contrasts linguistic features characteristic of a high information density (“informational” features) with linguistic features associated with more affective, interactional content (“involved” features). This framework allows us to take a systematic approach to feature selection and also to examine features which have been overlooked as indicators of variation in the previous literature on gender gaps, such as the use of determiners or pronouns. Biber, (1988) found in a corpus of epistolary writing that women used more “involved” features (e.g. increased use of pronouns) for the purpose of building a relationship with the audience, while men used more “informational” features (e.g. increased use of cardinal numbers in texts) for the purpose of conveying the factual information. Using this framework, we conduct a large-scale analysis of gender differences in scientific and technical writing, leveraging two distinctive document types—academic articles and patented inventions—which collectively span all major fields of science and technology.

Several noteworthy findings emerge from our study. We document that even in such restricted genres as academic writing and invention descriptions, female authors tend to use more “involved” features than male authors, a pattern that holds across all scientific fields and patent subcategories. In the patent data specifically, we find that the gender of the lawyer appears to have more impact on the writing than the gender of the inventor, suggesting both the importance of attorneys in crafting patent text and the universal nature of the patterns we observe, even across individuals with very different professional training. Finally, we show that work that uses a higher number of “involved” features is more likely to be cited by female authors.

Our study builds on, but also extends, a small but growing stream of research in the Science of Science on gender and language (c.f., Fortunato etal.,, 2018). First, while existing literature has identified a number of differences in men’s and women’s scientific writing (Lerchenmueller etal.,, 2019; Kolev etal.,, 2020; Kim etal.,, 2022), the choice of features have not been motivated by a linguistic framework, thereby complicating the interpretation of the results and making comparisons across studies difficult. Second, existing research has been done within particular fields and on particular kinds of text (e.g., scientific papers), and therefore little is known about whether the patterns observed thus far will generalize across diverse fields and document types (such as patent data), which have diverse norms.

Our study also adds to the linguistics literature. First, previous research on gender differences in language utilization has largely been done on informal or conversational text, typically focusing on speech patterns, and using corpora like emails (Thomson etal.,, 2001), essays (Mulac etal.,, 2006), social media (Garimella and Mihalcea,, 2016), blogs (Pennebaker,, 2011). There has been far less research on gender differences in formal writing, and the work that has been done has relied on relatively small samples (Argamon etal.,, 2003; Biber,, 1988). Second, we study a new source of text data—patented inventions—that has not been studied in previous linguistic analyses. Patents represent a more restricted genre in terms of stylistic expression than academic papers, therefore, finding stylistic differences by gender is especially notable.

2 Results

2.1 Data Overview

We conduct the analysis on two datasets: abstracts of academic papers fromWeb of Science111Web of Science: https://clarivate.com/webofsciencegroup/solutions/web-of-science/ and abstracts of patents from PatentsView222PatentsView: http://www.patentsview.org/. For the purposes of our study, we narrow our focus to solo-authored papers andpatents (i.e., we exclude papers and patents written by teams). In thepatent data, we also narrow our sample to include only those inventors with a single lawyer, in order to more cleanly control for the gender of the lawyer, as lawyers are frequently involved in the writing and editing of patent text. In total, we use 256,260 publications and 16,053 patents in the analysis. Additional details on the dataset construction can be found in the Materials and Methods section.

For each paper and patent in our sample, we construct a series of measures tracking theutilization of informational features (using past tense, determiners, and cardinal numbers as indicators) and involved features (using pronouns, questions, and non-phrasal coordination (“and”) as indicators). Figure 1 gives an overview of the specific features of interest as well as more details about the computation of these features.333We also released an open-source Python package to compute these linguistic features, available here: https://pypi.org/project/stylometer. We define three separate measures: involved rate, informational rate, and Involved-InformationalRatio. Because our corpus includes academic writing and invention descriptions, there is already a high utilization of informational features; therefore, in addition to involved and informational rates, we are also interested in seeing the relative rate at which writers use involved features in relation to informational features (Involved-Informational Ratio). More information on the developed measures is included in the Materials and Methods section.

//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (1)

2.2 Writing Examples

Before presenting our quantitative results, we first highlight several examples from our data that illustrate the differences between the informational and involved styles. Beginning with papers, consider the following excerpts from two abstracts, one written by a female author (1) and the other written by a male (2). Both papers were published in the same field (Business & Economics), in the same journal (Financial Management Journal), during the same time period (1990s).

(1) Why do companies in the United Kingdom pay scrip (i.e., in stock) dividends? Are tax savings the sole motive for this option? If so, are all tax-loss companies offering this option? Or is this option driven by other motives, such as cash savings, signaling, and agency conflicts? In this study, I provide some insights into the motivation for scrip-dividend payment by comparing the operational performance and other characteristics of all companies that distributed scrip dividends with those of a control group of non-scrip-paying, but otherwise similar, firms. Scrip dividends are offered by an increasing number of companies in the United Kingdom as an option whereby shareholders are able to choose between receiving dividends in cash or the equivalent in the form of shares (scrip). Companies stress tax savings when they offer this option because, unlike cash dividends, the scrip option is not subject to a payment of the advanced corporation tax (ACT).

(2) The rich variety of securities innovations in recent years continues to intrigue academicians and practitioners alike. The array of innovative securities includes a variety of equity-linked debt securities. On January 28, 1991, the Republic of Austria publicly offered $100 million principal amount of a new equity-linked debt security called stock index growth notes (“SIGNs”) in the United States. The SIGNs were scheduled to mature in approximately 5.5 years. They would make no payments of interest prior to maturity. If the value of the S&P 500 is below 336.69 on the maturity date, the holder will receive $10, the initial offering price. If the value exceeds 336.69, the holder will get $10 plus $10 multiplied by the percentage appreciation in the S&P 500 above 336.69. SIGNS may thus be characterized as a package consisting of (i) a 5.5-year triple-A-rated zero coupon note plus (ii) a 5.5-year European call option, or warrant, on the S&P 500 with a strike price of 336.69.

Although both abstracts address similar topics, they are presented in dramatically different ways. Abstract (2), written by the male author, is striking for its heavy utilization of cardinal numbers (e.g., “$100 million”, “5.5 years”, “above 336.69”), which are mostly avoided by the female author. Abstract (1), by contrast, stands out for its greater utilization of “connector words” (e.g., “and”) and questions, the latter of which are not used at all in Abstract (2). The female abstract reads a lot like a story, building a relationship with the reader through the use of pronouns, present tense, asking questions, and by stringing the clauses together with “and” in a loose, storytelling-like manner; the male abstract is focused on relaying the factual information, via the use of numbers and determiners, and past tense. Figure 2 shows the full text of both abstracts, zoomed out, with informational and involved features highlighted in different colors (blue for informational, yellow for involved).

While the gender differences highlighted in the cases above are somewhat unusual in degree, the general pattern is consistent across papers and patents in our larger sample. Figure 3 show the distribution of the Involved-Informational Ratio, stratified by author gender, for paper and patent abstracts, respectively. In both figures, we observe a clear pattern wherein, as the Involved-Informational Ratio increases, so too does the share of female abstracts (as indicated in the figure, the observed gender differences are statistically significant). Note in the patents, the number of bins is smaller because we observe a narrower range of Involved-Informational Ratio values for patents, perhaps reflecting the more constrained style of the genre.

//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (2)

2.3 Writing Analysis

Table 1 shows the breakdown of informational and involved features for academic papers and patents, respectively. The results there suggest that, in absolute terms, informational features are used significantly more often in both text sources, by both men and women. This distribution seems plausible given the nature of our corpus (i.e., scientific articles and patents are generally concerned with communicating information), and is also consistent with prior work; Biber, (1988), for example, situated academic prose closer to the informational dimension, and Argamon et al. (2013) noted the “greater quantification inherent to most non-fiction genres.” Table 2 also shows the mean and standard deviation of involved and informational features combined and the Involved-Informational Ratio in the academic papers and patents samples.

The results of our regression analysis are shown in Tables 3 (papers) and 4 (patents)444We evaluated our regression models for multicollinearity; all variance inflation factors were below the conventional threshold of 10, suggesting little multicollinearity among the covariates.. Beginning with papers, we find, as expected, a statistically significant association between author gender and the Involved-Informational Ratio of paper abstracts (β=0.02;p<0.001formulae-sequence𝛽0.02𝑝0.001\beta=0.02;p<0.001italic_β = 0.02 ; italic_p < 0.001, Model 2); relative to men, women tend to use a higher ratio of involved to informational features in their scientific writing. Holding all other variables at their means, the predicted Involved-Informational Ratio (based on Model 2 of Table 3) is about 6.7% greater for female than male authors. Models 4 and 6, also of Table 3, help to unpack this finding by estimating separate regressions predicting the utilization of involved and informational features, respectively. We find that relative to men, women use involved features at a significantly greater rate (β=0.23;p<0.001formulae-sequence𝛽0.23𝑝0.001\beta=0.23;p<0.001italic_β = 0.23 ; italic_p < 0.001, Model 4), and informational features at a significantly lower rate (β=0.18;p<0.001formulae-sequence𝛽0.18𝑝0.001\beta=-0.18;p<0.001italic_β = - 0.18 ; italic_p < 0.001, Model 6) in their writing. This suggests that the differences we detect between men and women in the Involved-Informational Ratio is likely driven by both the numerator and the denominator.

While the patterns we observe appear to hold universally across fields of science and technology, we find heterogeneity in the magnitude of the effect. Figure S9 in the Supplementary Materials shows the predicted gender differences by field. The differences are greater in the Social Sciences and Arts & Humanities and less pronounced in Physical Sciences and Technology, which may be explained by the fact that the former fields generally permit greater freedom of stylistic expression and higher use of involved features than the latter fields, which have more restricted stylistic rules (Argamon etal.,, 2003). Full regression results for each field are presented in Table LABEL:detailed_papers (for academic papers) and in Table LABEL:detailed_patents (for patents) in the Supplementary Materials.

Turning next to patents, we find similar patterns. There is a statistically significant association between inventor gender and the Involved-Informational Ratio of patent abstracts (β=0.00;p<0.01formulae-sequence𝛽0.00𝑝0.01\beta=0.00;p<0.01italic_β = 0.00 ; italic_p < 0.01, Model 2); relative to men, women tend to use a higher ratio of involved to informational features in their technical writing. Once again holding all other variables at their means, the predicted Involved-Informational Ratio (based on Model 2 of Table 4) is about 2.3% greater for female than male inventors. In Model 2 of Table 4, we evaluate the relationship between the gender of the patent lawyer and the Involved-Informational Ratio of the patent. Here, we find results that are similar to those we observed for inventor gender, with a statistically significant association between lawyer gender and Involved-Informational Ratio (β=0.01,p<0.01formulae-sequence𝛽0.01𝑝0.01\beta=0.01,p<0.01italic_β = 0.01 , italic_p < 0.01, Model 2). The magnitude of the association, however, is larger than what we observed for inventor gender; holding all other variables at their means, the predicted Involved-Informational Ratio (based on Model 2 of Table 4) is about 5.1% greater for patents with female than male attorneys.

Following our analytical strategy with papers, in Models 4 and 6 (Table 4), we run separate regressions predicting the rate of utilization for informational and involved features, respectively. Here, the findings are a bit more complex. We find that women use both informational (β<0.14,p<0.001formulae-sequence𝛽0.14𝑝0.001\beta<0.14,p<0.001italic_β < 0.14 , italic_p < 0.001, Model 4) and involved (β=0.41,p<0.001formulae-sequence𝛽0.41𝑝0.001\beta=0.41,p<0.001italic_β = 0.41 , italic_p < 0.001, Model 6) features at a significantly greater rate in their writing than men, suggesting that the association we observe between gender and the Involved-Informational Ratio (Model 2) is driven by the numerator. Female lawyers use involved features in their writing at a rate that is statistically indistinguishable from men (Model 4), but statistically fewer informational ones (β=0.62,p<0.001formulae-sequence𝛽0.62𝑝0.001\beta=-0.62,p<0.001italic_β = - 0.62 , italic_p < 0.001, Model 6).

Academic papers
Informational features
numbersdeterminerspast tense
meanSTDp-valuemeanSTDp-valuemeanSTDp-value
female1.521.80<0.019.933.07<0.014.502.46<0.01
male1.541.8310.143.084.452.40
Involved features
questions“and”pronoun
meanSTDp-valuemeanSTDp-valuemeanSTDp-value
female0.030.190.133.161.49<0.011.601.53<0.01
male0.030.192.941.471.581.51
Patents
Informational features
numbersdeterminerspast tense
meanSTDp-valuemeanSTDp-valuemeanSTDp-value
female1.302.230.7715.913.71<0.014.392.09<0.01
male1.292.1515.443.764.472.13
Involved features
questions“and”pronoun
meanSTDp-valuemeanSTDp-valuemeanSTDp-value
femaleNANANA2.751.50<0.010.330.620.52
maleNANA2.601.430.340.60
Academic papers
Involved (normalized)Informational (normalized)Ratio (normalized)
meanSTDp-valuemeanSTDp-valuemeanSTDp-value
female4.782.14<0.0115.964.04<0.010.330.21<0.01
male4.552.0916.134.020.310.20
Patents
Involved (normalized)Informational (normalized)Ratio (normalized)
meanSTDp-valuemeanSTDp-valuemeanSTDp-value
female3.081.60<0.0121.604.07<0.010.150.100.01
male2.941.5321.204.090.150.13
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Author is female (1 = Yes)0.02superscript0.02absent0.02^{***}0.02 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.02superscript0.02absent0.02^{***}0.02 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.23superscript0.23absent0.23^{***}0.23 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.23superscript0.23absent0.23^{***}0.23 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.18superscript0.18absent-0.18^{***}- 0.18 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.18superscript0.18absent-0.18^{***}- 0.18 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.00)0.00(0.00)( 0.00 )(0.00)0.00(0.00)( 0.00 )(0.01)0.01(0.01)( 0.01 )(0.01)0.01(0.01)( 0.01 )(0.01)0.01(0.01)( 0.01 )(0.01)0.01(0.01)( 0.01 )
Constant0.31superscript0.31absent0.31^{***}0.31 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.14superscript0.14absent0.14^{***}0.14 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT4.55superscript4.55absent4.55^{***}4.55 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.70superscript2.70absent2.70^{***}2.70 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT16.13superscript16.13absent16.13^{***}16.13 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT19.90superscript19.90absent19.90^{***}19.90 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.00)0.00(0.00)( 0.00 )(0.01)0.01(0.01)( 0.01 )(0.00)0.00(0.00)( 0.00 )(0.15)0.15(0.15)( 0.15 )(0.01)0.01(0.01)( 0.01 )(0.29)0.29(0.29)( 0.29 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520512520
r20.00250.00250.00250.00250.12710.12710.12710.12710.00300.00300.00300.00300.16920.16920.16920.16920.00050.00050.00050.00050.09910.09910.09910.0991
p<0.001superscript𝑝absent0.001{}^{***}p<0.001start_FLOATSUPERSCRIPT ∗ ∗ ∗ end_FLOATSUPERSCRIPT italic_p < 0.001; p<0.01superscript𝑝absent0.01{}^{**}p<0.01start_FLOATSUPERSCRIPT ∗ ∗ end_FLOATSUPERSCRIPT italic_p < 0.01; p<0.05superscript𝑝0.05{}^{*}p<0.05start_FLOATSUPERSCRIPT ∗ end_FLOATSUPERSCRIPT italic_p < 0.05
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.00superscript0.00absent0.00^{**}0.00 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.00superscript0.00absent0.00^{**}0.00 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.14superscript0.14absent0.14^{***}0.14 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.14superscript0.14absent0.14^{***}0.14 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.42superscript0.42absent0.42^{***}0.42 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.41superscript0.41absent0.41^{***}0.41 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.00)0.00(0.00)( 0.00 )(0.00)0.00(0.00)( 0.00 )(0.02)0.02(0.02)( 0.02 )(0.02)0.02(0.02)( 0.02 )(0.05)0.05(0.05)( 0.05 )(0.04)0.04(0.04)( 0.04 )
Lawyer is female (1 = Yes)0.02superscript0.02absent0.02^{***}0.02 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.01superscript0.01absent0.01^{**}0.01 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.020.020.020.020.020.020.020.021.21superscript1.21absent-1.21^{***}- 1.21 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.62superscript0.62absent-0.62^{***}- 0.62 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.00)0.00(0.00)( 0.00 )(0.00)0.00(0.00)( 0.00 )(0.03)0.03(0.03)( 0.03 )(0.04)0.04(0.04)( 0.04 )(0.09)0.09(0.09)( 0.09 )(0.09)0.09(0.09)( 0.09 )
Constant0.15superscript0.15absent0.15^{***}0.15 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.21superscript0.21absent0.21^{***}0.21 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.94superscript2.94absent2.94^{***}2.94 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.15superscript3.15absent3.15^{***}3.15 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT21.27superscript21.27absent21.27^{***}21.27 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT17.54superscript17.54absent17.54^{***}17.54 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.00)0.00(0.00)( 0.00 )(0.02)0.02(0.02)( 0.02 )(0.01)0.01(0.01)( 0.01 )(0.21)0.21(0.21)( 0.21 )(0.03)0.03(0.03)( 0.03 )(0.52)0.52(0.52)( 0.52 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N321063210632106321063210632106321063210632106321063210632106321063210632106321063210632106321063210632106321063210632106
r20.00130.00130.00130.00130.02970.02970.02970.02970.00210.00210.00210.00210.02190.02190.02190.02190.00800.00800.00800.00800.09890.09890.09890.0989
p<0.001superscript𝑝absent0.001{}^{***}p<0.001start_FLOATSUPERSCRIPT ∗ ∗ ∗ end_FLOATSUPERSCRIPT italic_p < 0.001; p<0.01superscript𝑝absent0.01{}^{**}p<0.01start_FLOATSUPERSCRIPT ∗ ∗ end_FLOATSUPERSCRIPT italic_p < 0.01; p<0.05superscript𝑝0.05{}^{*}p<0.05start_FLOATSUPERSCRIPT ∗ end_FLOATSUPERSCRIPT italic_p < 0.05

2.4 Citation Analysis

Next, we turn to our analysis of gender differences in citations, the results of which are shown in Tables 5 (academic papers) and 6 (patents). We also provide a visualization of our findings in Figure 4. To simplify the presentation, in this section, we only report models predicting citation as a function of the rate of informational and involved feature utilization (i.e., we do not report the results of models that use the Involved-Informational Ratio as a predictor). We found substantially similar results, however, using the ratio measure.

Overall, we observe significant gender hom*ophily in citation patterns. Papers written by female authors receive a greater share of citations from papers with a female first author (β=4.33,p<0.001formulae-sequence𝛽4.33𝑝0.001\beta=4.33,p<0.001italic_β = 4.33 , italic_p < 0.001) and papers with a female last author (β=3.57;p<0.001formulae-sequence𝛽3.57𝑝0.001\beta=3.57;p<0.001italic_β = 3.57 ; italic_p < 0.001). The opposite pattern holds for sample papers written by men.

Taking into account the linguistic features, we see a consistent and statistically significant pattern across the outcome measures: papers with higher rates of utilization of involved features in their abstracts tend to be cited at a higher rate by papers with a female first author (β=0.25,p<0.001formulae-sequence𝛽0.25𝑝0.001\beta=0.25,p<0.001italic_β = 0.25 , italic_p < 0.001, Model 1) and papers with a female last author (β=0.29,p<0.001formulae-sequence𝛽0.29𝑝0.001\beta=0.29,p<0.001italic_β = 0.29 , italic_p < 0.001, Model 3). More substantively, we can interpret the coefficient estimates as indicating that for each unit increase in the rate of involved features used, the rate of citation by papers with a female first and last author is predicted to increase by 0.25 and 0.29, respectively. When we consider the use of informational features, however, the pattern flips (β=0.08,p<0.001formulae-sequence𝛽0.08𝑝0.001\beta=-0.08,p<0.001italic_β = - 0.08 , italic_p < 0.001, Model 1 for citation by papers with a female first author; β=0.06,p<0.001formulae-sequence𝛽0.06𝑝0.001\beta=-0.06,p<0.001italic_β = - 0.06 , italic_p < 0.001, Model 3 for citation by papers with a female first author). Papers with higher rates of utilization of involved features are cited at a lower rate by papers with a male first author (β=0.18,p<0.001formulae-sequence𝛽0.18𝑝0.001\beta=-0.18,p<0.001italic_β = - 0.18 , italic_p < 0.001, Model 2) and papers with a male last author (β=0.30,p<0.001formulae-sequence𝛽0.30𝑝0.001\beta=-0.30,p<0.001italic_β = - 0.30 , italic_p < 0.001, Model 4). We find that the opposite holds for utilization of informational features, although only in models predicting citation by papers with a male first author (β=0.03,p<0.01formulae-sequence𝛽0.03𝑝0.01\beta=0.03,p<0.01italic_β = 0.03 , italic_p < 0.01, Model 2).

Turning to patents, we find results that are generally similar, but much weaker. While patents that use more informational features are less likely to be cited by patents with a female first (β=0.07,p<0.001formulae-sequence𝛽0.07𝑝0.001\beta=-0.07,p<0.001italic_β = - 0.07 , italic_p < 0.001, Model 1) or last (β=0.07,p<0.001formulae-sequence𝛽0.07𝑝0.001\beta=-0.07,p<0.001italic_β = - 0.07 , italic_p < 0.001, Model 3) inventor, none of the remaining coefficients tracking the utilization of informational or involved features are statistically significant. These results are not surprising, considering that patent descriptions, as a type of text, allow much less stylistic expression than academic papers do.

//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (3)
(1)(2)(3)(4)
DV: Rate of citationsfrom papers witha female first authorDV: Rate of citationsfrom papers witha male first authorDV: Rate of citationsfrom papers witha female last authorDV: Rate of citationsfrom papers witha male last author
Involved rate0.25***-0.18***0.29***-0.30***
(0.02)(0.02)(0.02)(0.02)
Informational rate-0.08***0.03**-0.06***-0.01
(0.01)(0.01)(0.01)(0.01)
Author is female (1 = Yes)4.33***-4.33***3.57***-3.64***
(0.07)(0.08)(0.07)(0.08)
Paper is cited 0 times (1 = Yes)-29.11***-51.19***-25.80***-55.35***
(0.08)(0.08)(0.08)(0.08)
Constant27.16***53.38***23.12***59.00***
(0.20)(0.22)(0.19)(0.22)
Field fixed effectsYesYesYesYes
Year fixed effectsYesYesYesYes
N512520512520512520512520
r20.30840.47200.27480.5244
  • Notes: Robust standard errors are shown in parentheses; p-values correspond to two-tailed tests.

  • *p<0.1; **p<0.05; ***p<0.01

(1)(2)(3)(4)
DV: Rate of citationsfrom patents witha female first inventorDV: Rate of citationsfrom patents witha male first inventorDV: Rate of citationsfrom patents witha female last inventorDV: Rate of citationsfrom patents witha male last inventor
Involved rate0.020.20+0.040.12
(0.05)(0.11)(0.05)(0.11)
Informational rate-0.07***0.04-0.07***0.04
(0.02)(0.04)(0.02)(0.04)
Inventor is female (1 = Yes)2.78***-3.09***2.75***-3.20***
(0.16)(0.34)(0.16)(0.34)
Lawyer is female (1 = Yes)0.310.210.030.50
(0.33)(0.69)(0.33)(0.68)
Patent is cited 0 times (1 = Yes)-5.02***-41.99***-5.52***-41.42***
(0.13)(0.29)(0.14)(0.29)
Constant5.13***42.39***5.41***42.25***
(0.48)(1.09)(0.50)(1.08)
Field fixed effectsYesYesYesYes
Year fixed effectsYesYesYesYes
N32106321063210632106
r20.05890.29630.05750.2964
  • Notes: Robust standard errors are shown in parentheses; p-values correspond to two-tailed tests.

  • *p<0.1; **p<0.05; ***p<0.01

3 Discussion

The norm of universalism in science and technology stipulates that the evaluation of scientific and technological contributions should be based on impersonal criteria (Merton and Storer,, 1973). Nevertheless, growing evidence suggests that gender gaps in scholarly assessment remain both prevalent and widespread (Moss-Racusin etal.,, 2012; Reuben etal.,, 2014; Oliveira etal.,, 2019; Card etal., 2020a, ). In this study, we examined variation in writing style by gender in scientific and technical texts, and considered the role this variation plays in the citation patterns. To do so, we developed an approach motivated by a larger linguistic framework, examining differences between “involved” and “informational” writing styles.

Based on our sample of single-authored academic papers and patents, we find differences in the writing styles of female and male authors, with female writers using involved features more frequently in their work (features associated with more affective, interactional content), a pattern that holds across all scientific fields and patent subcategories. These differences are greater in the Social Sciences and Arts & Humanities and less pronounced in Physical Sciences, likely because the former fields allow for more stylistic expression. In the patent data, we find that the gender of the lawyer is more predictive of this pattern, suggesting that lawyers have a greater influence on the editing of the patent description. The same pattern holds in the citation analysis, where papers with higher rates of utilization of involved features in their abstracts tend to be cited at a higher rate by papers with a female first author and papers with a female last author.

Our work is not without limitations. Importantly, our study inferred the gender of authors and inventors based on their names, which offers only a rough proxy. Moreover, our gender assignments were binary, corresponding to sex-based categories (female and male), which oversimplifies the true diversity of gender identities among authors and inventors. We also limited our attention to single-authored papers and patents and patents with a single lawyer. While this approach helps to make the interpretation of our results clearer, more work needs to be done to determine whether and how our findings apply to teams. Further work on analyzing writing styles of mixed-gender teams could help provide a more nuanced understanding of the observed pattern. Finally, our study is based on observational data, and therefore our results should not be interpreted as causal. As such, future behavioral studies, involving experimental assessment of the response of reviewers to highly involved and highly informational texts based on the reviewer’s gender, would complement our corpus study of academic papers and patents.

Notwithstanding these limitations, our study makes several contributions. First, our study adds to the Science of Science. Specifically, we propose an analysis with a set of linguistic features which is motivated by a an involved/informational framework, established in the linguistic research as indicative of differences in male and female communication styles, which allows us to take a systematic view on understanding gender differences in scientific and technical writing.

Second, our study adds new insights to the linguistics literature. Specifically, we use a larger sample of formal written text and utilize a new source of data (patent descriptions) which to our knowledge has not been used for the purposes of the analysis of gender differences in writing before. We also released an open-source Python package to compute the linguistic features used in this paper for future use of researchers, available here: https://pypi.org/project/stylometer.

Finally, our study also has several natural policy implications. In particular, our findings suggest that writing in science and technology is not completely devoid of personal markers and, therefore, could constitute a potential source of bias during the evaluation processes. However, establishing rigid guidelines for communication styles in science and technology may not be the right solution. Instead, we suggest that in order to mitigate bias in evaluation processes, the diversity among the contributors should be matched by the same diversity among the evaluators. For example, previous researchers noted the need to increase the number of female reviewers (Kolev etal.,, 2020), as well as to encourage citation diversity (Zurn etal.,, 2020) and continue raising awareness of language bias in established institutions, structures, and processes, including academia (Clements and Petray,, 2021). Such measures could help to uphold the norm of universalism in scientific contributions.

4 Materials and Methods

4.1 Framework

We draw on a previously established linguistic framework, known as “Informational versus Involved Production,” outlined by Biber, (1988), which characterizes the co-occurrence of certain linguistic features based on their communicative function. While the linguistics community has developed a number of potentially applicable frameworks, we chose Biber’s because it was created using a diverse set of corpora, including non-fiction, and was intended to capture universal features of English language; therefore, we anticipate that it would likely capture useful variation across a broad range of scientific and technological fields, encompassing potentially diverse norms of composition. We specifically chose the “informational”/“involved” dimension of Biber’s framework, as studies note (Argamon etal.,, 2003) that men tend to focus more on relaying factual information (“informational”), while women tend to focus on building a relationship with an audience in their communications (“involved”). Based on a corpus of e-mail communication, speech, and blogs, Pennebaker, (2011) stipulated that “males categorize their worlds by counting, naming, and organizing the objects they confront” (informational aspect), while women “personalize their topics” (involved aspect).

Biber outlines two factors that influence the choice of individuals to use a specific linguistic mode: the production circ*mstances of real-time constraints (speech or conversation) versus editing possibilities (academic writing), and the primary purpose of the writer (informational versus involved). With respect to the production circ*mstances, given the type of corpus that we are exploring in this study (formal written text), we can hold these constant, as both papers and patents are produced in settings where editing is possible and likely (i.e., the authors are under no real-time constraints, unlike the case of a conversation). With respect to the primary purpose of the writer (informational versus involved), Argamon etal., (2003) noted that formal written text is intended for a “broad unseen audience.” Biber etal., (1998) found that letters written by women and addressed to women were more involved and letters written by men and addressed to men were less involved. He also found that, overall, letters written by women are more involved than letters written by men. Even though in formal written text there is no specific addressee, and academic papers and patent descriptions are intended for a “broad unseen audience”, our goal is to understand if we are still able to observe the variation by the gender of the writers.

In addition to Biber’s framework , the specific set of “involved” and “informational” features used in our study was informed by the follow-up work of Pennebaker, (2011) and Argamon etal., (2003). We chose three distinctive features for “involved” and “informational” writing modes (Figure 1). Pronouns, questions, and non-phrasal coordination (“and” connector) are highly correlated as linguistic features and are reflective of the “involved” writing style. The use of pronouns indicates interaction between the writer and the reader—the writer assumes that the reader knows what they are referring to with a pronoun, as the reader is following the writer’s story in a text; pronouns “present things in a relational way” (Argamon etal.,, 2003, p. 7). We consider questions to be an important involved feature as well, as they point to the presence of a hypothetical addressee in a formal written text, and are therefore a marker of interaction (Biber,, 1986; Marckworth and Baker,, 1974). Non-phrasal coordination (“and”) is used “to string clauses together in a loose, logically unspecified manner, instead of integrating the information into fewer units” (Biber,, 1988, p. 106), and thus is more associated with a narrative type of communication (“involved” aspect). For “informational” writing mode indicators, we use past tense, determiners, and cardinal numbers. We use past tense as an “informational” feature, to indicate a “disconnection” with the audience, absence of interaction in the narrative. The other two “informational” features—determiners and cardinal numbers—are taken from Argamon etal., (2003) who used the Balanced Winnow algorithm to identify features that are strong indicators of either male or female writing. Argamon and colleagues note that, based on their examination of texts, “the use of determiners reflects that male writers are mentioning classes of things, in contrast to female writers, who are personalizing their messages” (Argamon etal.,, 2003, p. 12). The greater use of cardinal numbers (quantification) is also noted as a strong male indicator in both the work of Pennebaker, (2011) and Argamon etal., (2003).

4.2 Corpus

As noted previously, we draw on two different corpora: (1) abstracts of academic papers from Web of Science555Web of Science: https://clarivate.com/webofsciencegroup/solutions/web-of-science/ (“WoS data”) and (2) abstracts of patents from PatentsView (“PatentsView data”)666PatentsView: http://www.patentsview.org/. Both databases have been widely used in prior research in the Science of Science, and provide generally representative coverage of scientific papers and patented inventions, respectively (Funk and Owen-Smith,, 2017; Birkle etal.,, 2020; Jaffe and Trajtenberg,, 2002).

For the WoS data, we excluded document types that are not typically meant for the publication of scientific findings (e.g., book reviews, letters, music scores, and bibliographies). We further limited our attention to articles that included full-text abstracts, and that were written in English777The WoS data do not include accurate information on the language of paper abstracts. We identified English language abstracts using a commercial machine translation API.. For the PatentsView data, we limited our attention to utility patents, which encompass roughly 90% of the patents granted by the United States Patent and Trademark Office (USPTO), and have therefore been the central focus of prior work (Jaffe and Trajtenberg,, 2002). All patents granted by the USPTO are written in English, and more than 90% (going back to 1976) include full-text abstracts.

For the purposes of our study, we further narrow our focus to solo-authored papers and patents (i.e., we exclude papers and patents written by teams). Our motivation for doing so was to allow for a clearer identification of the relationship between gender and writing style, which would be more complex in the case of teams, particularly those of mixed gender. Note that in the patent data, we also narrowed our sample to include only those inventors with a single lawyer, in order to more cleanly control for gender. In addition, we require that abstracts have more than 100 words, in order to normalize frequency counts per 100 words, which is consistent with prior research in linguistics (Biber,, 1988). Additional details on the procedure used to narrow down our samples from the population of articles and patents, including the number of observations lost at each stage, are given in Figure S6 in the Supplementary Materials.

Women are underrepresented in the population of both authors and inventors. For perspective, in our data, roughly 19.6% of authors and 5.4% of inventors are women.888This is an estimate for academic papers with one author and for patents with one inventor, where the gender of the author/inventor was known/assigned. Moreover, men and women tend to publish in different areas, and their relative representation changes over time. Therefore, to help facilitate comparisons, we subset the WoS and PatentsView data by building a matched sample of papers and patents written by male and female authors and inventors. To do so, we identified, for each academic paper written by a single female author in a particular WoS subject area (e.g.,“Sociology”, “Cell Biology”, “Mathematics”) and in a particular year, a corresponding academic paper written by a single male author in the same subject area and year. We follow a similar procedure for patents, using National Bureau of Economic Research (NBER) patent subcategories to proxy for the field of technology, and the grant year of the patent as the year of publication.

For the WoS data, we subset the corpus to years starting in 1991, as abstracts are not well represented in earlier periods (Figure 5). After matching, most of the abstracts in our final analytical sample are from the Social Sciences, followed by Life Sciences and Biomedicine. (Figure S7 in the Supplementary Materials shows the distribution of fields in the original Web of Science corpus.) This is consistent with prior work, as women are well represented in the social sciences and biomedical domains (Ceci etal.,, 2014; Su and Rounds,, 2015). For the PatentsView data, our analysis begins in 1976, which is the first year in which machine readable patent records are available from the USPTO. After matching, most patents come from the “Others” NBER category, which includes “Miscellaneous” patent types. (Figure S8 in the Supplementary Materials shows the distribution of categories in the original PatentsView corpus.)

Gender is assigned using author and inventor names. For papers, author gender is coded based on first and (when available) middle names, as reported in the Web of Science database, using the genderize.io application programming interface (API). For patents, we assign inventor gender by using a coding provided by the United States Patent and Trademark Office, as included with the PatentsView data. More details about the gender assignment can be found in the Supplementary Text section of Supplementary Materials.

4.3 Linguistic Measures

For each paper and patent in our sample, we constructed a series of measures tracking the utilization of informational and involved features. We use pronouns, questions, and non-phrasal coordination (“and” connector) as indicators of the “involved” style of writing, and past tense, determiners, and cardinal numbers as indicators of “informational” style of writing. Figure 1 gives an overview of the specific features of interest; more details about the computation of these features are included in the Table S7 in the Supplementary Materials.999We also released an open-source Python package to compute these linguistic features, available here: https://pypi.org/project/stylometer.

We define three separate measures: involved rate, informational rate, and Involved-Informational Ratio. Because our corpus includes academic writing and invention descriptions, there is already a high utilization of informational features by both female and male authors; therefore, in addition to involved and informational rates, we are also interested in seeing the relative rate at which writers use involved features in relation to informational features (Involved-Informational Ratio). We define the involved and informational rates as the sum of three involved (or informational) linguistic features divided by the total number of tokens in a given abstract, and normalized to units of 100 words (Biber,, 1988). We define the Involved-Informational Ratio as the involved rate divided by the informational rate.

Involved Rate=Npron+Nand+NqNtokens×100Involved Ratesubscript𝑁𝑝𝑟𝑜𝑛subscript𝑁𝑎𝑛𝑑subscript𝑁𝑞subscript𝑁𝑡𝑜𝑘𝑒𝑛𝑠100\textrm{Involved Rate}=\frac{N_{pron}+N_{and}+N_{q}}{N_{tokens}}\times 100Involved Rate = divide start_ARG italic_N start_POSTSUBSCRIPT italic_p italic_r italic_o italic_n end_POSTSUBSCRIPT + italic_N start_POSTSUBSCRIPT italic_a italic_n italic_d end_POSTSUBSCRIPT + italic_N start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT end_ARG start_ARG italic_N start_POSTSUBSCRIPT italic_t italic_o italic_k italic_e italic_n italic_s end_POSTSUBSCRIPT end_ARG × 100(1)
Informational Rate=Ndet+Npast+NnumNtokens×100Informational Ratesubscript𝑁𝑑𝑒𝑡subscript𝑁𝑝𝑎𝑠𝑡subscript𝑁𝑛𝑢𝑚subscript𝑁𝑡𝑜𝑘𝑒𝑛𝑠100\textrm{Informational Rate}=\frac{N_{det}+N_{past}+N_{num}}{N_{tokens}}\times 100Informational Rate = divide start_ARG italic_N start_POSTSUBSCRIPT italic_d italic_e italic_t end_POSTSUBSCRIPT + italic_N start_POSTSUBSCRIPT italic_p italic_a italic_s italic_t end_POSTSUBSCRIPT + italic_N start_POSTSUBSCRIPT italic_n italic_u italic_m end_POSTSUBSCRIPT end_ARG start_ARG italic_N start_POSTSUBSCRIPT italic_t italic_o italic_k italic_e italic_n italic_s end_POSTSUBSCRIPT end_ARG × 100(2)
Inv.-Inf. Rate=Involved RateInformational RateInv.-Inf. RateInvolved RateInformational Rate\textrm{Inv.-Inf. Rate}=\frac{\textrm{Involved Rate}}{\textrm{Informational %Rate}}Inv.-Inf. Rate = divide start_ARG Involved Rate end_ARG start_ARG Informational Rate end_ARG(3)

To better understand the relationship between author gender and writing style, we estimated a series of linear regression models, predicting the utilization of informational and involved features in paper and patent abstracts. For models of both papers and patents, predictor variables included the gender of the author (or inventor), the year of academic paper (or patent grant), and the principal scientific (or technological) field. For models of patents, we also included as a predictor the gender of the lawyer, as patent attorneys often take part in writing and editing the patent text.

//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (4)

4.4 Citation measures

The possibility that men and women write in systematically different ways naturally raises questions about the relationship between writing styles and the reception of intellectual products. Specifically, differences in how men and women present their research in written text may help account for some of the disparities in rates of citation documented in prior work (Caplar etal.,, 2017; Potthoff and Zimmermann,, 2017; Dion etal.,, 2018; Dworkin etal.,, 2020; Zurn etal.,, 2020).

To evaluate this possibility, we conducted a series of analyses in which we decomposed the number of citations made by future papers and patents to those in our sample, according to the gender of the citing author or inventor. We emphasize that citation is merely a proxy for (unobservable) reading preferences—authors may cite prior work without reading it; scientists and inventors probably only cite a fraction of the prior work that they do read; and citation need not correspond to a favorable reading of the cited work. Keeping those caveats in mind, for each sample paper and patent, we computed two related measures: (1) citations from papers with a female (male) first author, (2) citations from papers with a female (male) last author. For each measure, we excluded self-citations, i.e., those made by the author of the sample paper to his or her own work. In addition, to facilitate interpretation, we transform each measure into a rate per 100 citations. Subsequently, we estimated a series of regression models, wherein the three measures are our outcomes of interest, and the predictors are the gender of the author or inventor (and lawyer for patents) and the ratio of informational and involved features. For papers and patents cited 0 times, we impute a rate of 0, and add an imputation indicator as a covariate. As with our prior regression, we control for the field and year of academic paper/patent.

5 Acknowledgements

We thank Dr. Rebecca Levitan and Dr. Martin Chodorow for guidance and feedback for this work.

6 Funding

National Science Foundation grants 1829168 (RJF) and 1932596 (RJF)

7 Competing interests

The authors declare that they have no competing interests.

8 Data and materials availability

The Python 3, MySQL 8, RStudio1.2, and Stata 16 code used to analyze and visualize the data for the current study will be available from the corresponding author. PatentsView data used in the study are publicly available directly from the publisher. Data from the Web of Science are available from Clarivate Analytics, but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. These data are however still available from the authors upon reasonable request and with permission from the publisher.

References

  • Argamon etal., (2003)Argamon, S., Fine, J., and Shimoni, A. (2003).Gender, genre, and writing style in formal written texts.Text, 23.
  • Biber, (1986)Biber, D. (1986).Spoken and written textual dimensions in english: Resolving the contradictory findings.Language, 62(2):384–414.
  • Biber, (1988)Biber, D. (1988).Variation across Speech and Writing.Cambridge University Press.
  • Biber etal., (1998)Biber, D., Conrad, S., and Reppen, R. (1998).Corpus Linguistics Investigating Language Structure and Use.Cambridge University Press.
  • Birkle etal., (2020)Birkle, C., Pendlebury, D.A., Schnell, J., and Adams, J. (2020).Web of Science as a data source for research on scientific and scholarly activity.Quantitative Science Studies, 1(1):363–376.
  • Bornmann, (2011)Bornmann, L. (2011).Scientific peer review.Annual Review of Information Science and Technology, 45(1):197–245.
  • Caplar etal., (2017)Caplar, N., Tacchella, S., and Birrer, S. (2017).Quantitative evaluation of gender bias in astronomical publications from citation counts.Nature Astronomy, 1(6).
  • (8)Card, D., DellaVigna, S., Funk, P., and Iriberri, N. (2020a).Are Referees and Editors in Economics Gender Neutral?*.The Quarterly Journal of Economics, 135(1):269–327.
  • (9)Card, D., DellaVigna, S., Funk, P., and Iriberri, N. (2020b).Gender neutrality in economics: The role of editors and referees.Technical Report 28942, VOX, CEPR Policy Portal.
  • Card etal., (2021)Card, D., DellaVigna, S., Funk, P., and Iriberri, N. (2021).Gender differences in peer recognition by economists.Working Paper 28942, National Bureau of Economic Research.
  • Ceci etal., (2014)Ceci, S.J., Ginther, D.K., Kahn, S., and Williams, W.M. (2014).Women in academic science: A changing landscape.Psychological Science in the Public Interest, 15(3):75–141.PMID: 26172066.
  • Clements and Petray, (2021)Clements, G. and Petray, M., editors (2021).Linguistic Discrimination in U.S Higher Education: Power, Prejudice, Impacts, and Remedies.Routledge, 1st edition.
  • Cole, (1992)Cole, S. (1992).Making Science: Between Nature and Society.Harvard University Press.
  • Dion etal., (2018)Dion, M.L., Sumner, J.L., and Mitchell, S.M. (2018).Gendered citation patterns across political science and social science methodology fields.Political Analysis, 26(3):312–327.
  • Dondio etal., (2019)Dondio, P., Casnici, N., Grimaldo, F., Gilbert, N., and Squazzoni, F. (2019).The “invisible hand” of peer review: The implications of author-referee networks on peer review in a scholarly journal.Journal of Informetrics, 13(2):708–716.
  • Dworkin etal., (2020)Dworkin, J.D., Linn, K.A., Teich, E.G., Zurn, P., Shinohara, R.T., and Bassett, D.S. (2020).The extent and drivers of gender imbalance in neuroscience reference lists.bioRxiv.
  • Fortunato etal., (2018)Fortunato, S., Bergstrom, C., Borner, K., Evans, J., Helbing, D., Milojevic, S., Petersen, A., Radicchi, F., Sinatra, R., Uzzi, B., Vespignani, A., Waltman, L., Wang, D., and Barabasi, A.-L. (2018).Science of science.Science, 359:eaao0185.
  • Funk and Owen-Smith, (2017)Funk, R.J. and Owen-Smith, J. (2017).A dynamic network measure of technological change.Management Science, 63(3):791–817.
  • Garimella and Mihalcea, (2016)Garimella, A. and Mihalcea, R. (2016).Zooming in on gender differences in social media.In Proceedings of the Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media (PEOPLES), pages 1–10, Osaka, Japan. The COLING 2016 Organizing Committee.
  • Ghiasi etal., (2018)Ghiasi, G., Mongeon, P., Sugimoto, C.R., and Larivière, V. (2018).Gender hom*ophily in citations.In STI 2018 Conference Proceedings. Centre for Science and Technology Studies (CWTS).
  • Hengel, (2017)Hengel, E. (2017).Publishing while Female. Are women held to higher standards? Evidence from peer review.Cambridge Working Papers in Economics 1753, Faculty of Economics, University of Cambridge.
  • Hengel and Moon, (2020)Hengel, E. and Moon, E. (2020).Gender and equality at top economics journals.
  • Honnibal and Montani, (2017)Honnibal, M. and Montani, I. (2017).spaCy 2: Natural language understanding with Bloom embeddings, convolutional neural networks and incremental parsing.
  • Hospido and Sanz, (2019)Hospido, L. and Sanz, C. (2019).Gender gaps in the evaluation of research: Evidence from submissions to economics conferences.Labor: Demographics & Economics of the Family eJournal.
  • Jaffe and Trajtenberg, (2002)Jaffe, A.B. and Trajtenberg, M. (2002).Patents, Citations, and Innovations: A Window on the Knowledge Economy.The MIT Press.
  • Jensen etal., (2018)Jensen, K., Kovacs, B., and Sorenson, O. (2018).Gender differences in obtaining and maintaining patent rights.Nature Biotechnology, 36:307–309.
  • Joshi, (2014)Joshi, A. (2014).By whom and when is women’s expertise recognized? the interactive effects of gender and education in science and engineering teams.Administrative Science Quarterly, 59(2):202–239.
  • Key and Sumner, (2019)Key, E.M. and Sumner, J.L. (2019).You research like a girl: Gendered research agendas and their implications.PS: Political Science & Politics, 52(4):663–668.
  • Kim etal., (2022)Kim, L., Smith, D.S., Hofstra, B., and McFarland, D.A. (2022).Gendered knowledge in fields and academic careers.Research Policy, 51(1):104411.
  • King etal., (2017)King, M.M., Bergstrom, C.T., Correll, S.J., Jacquet, J., and West, J.D. (2017).Men set their own cites high: Gender and self-citation across fields and over time.Socius, 3:2378023117738903.
  • Kolev etal., (2020)Kolev, J., Fuentes-Medel, Y., and Murray, F. (2020).Gender differences in scientific communication and their impact on grant funding decisions.AEA Papers and Proceedings, 110:245–49.
  • Lane etal., (2021)Lane, J.N., Teplitskiy, M., Gray, G., Ranu, H., Menietti, M., Guinan, E.C., and Lakhani, K.R. (2021).Conservatism gets funded? a field experiment on the role of negative information in novel project evaluation.Management Science, 0(0):null.
  • Larivière etal., (2013)Larivière, V., Ni, C., Gingras, Y., Cronin, B., and Sugimoto, C.R. (2013).Bibliometrics: Global gender disparities in science.Nature, 504(7479):211–213.
  • Leahey, (2007)Leahey, E. (2007).Not by productivity alone: How visibility and specialization contribute to academic earnings.American Sociological Review, 72(4):533–561.
  • Lerchenmueller etal., (2019)Lerchenmueller, M.J., Sorenson, O., and Jena, A.B. (2019).Gender differences in how scientists present the importance of their research: observational study.BMJ, 367.
  • Long and Fox, (1995)Long, J.S. and Fox, M.F. (1995).Scientific careers: Universalism and particularism.Annual Review of Sociology, 21(1):45–71.
  • Mahajan etal., (2020)Mahajan, U.V., Wadhwa, H., Fatemi, P., Xu, S., Shan, J., Benzil, D.L., and Zygourakis, C.C. (2020).Does double-blind peer review impact gender authorship trends? an evaluation of two leading neurosurgical journals from 2010 to 2019.Journal of Neurosurgery, 135(2):352–360.
  • Marckworth and Baker, (1974)Marckworth, M.L. and Baker, W.J. (1974).A discriminant function analysis of co-variation of a number of syntactic devices in five prose genres.American Journal of Computational Linguistics.Microfiche 11.
  • Merton and Storer, (1973)Merton, R. and Storer, N. (1973).The sociology of science : theoretical and empirical investigations.Chicago: University of Chicago Press.
  • Moss-Racusin etal., (2012)Moss-Racusin, C.A., Dovidio, J.F., Brescoll, V.L., Graham, M.J., and Handelsman, J. (2012).Science faculty’s subtle gender biases favor male students.Proceedings of the National Academy of Sciences, 109(41):16474–16479.
  • Mulac etal., (2006)Mulac, A., Bradac, J.J., and Gibbons, P. (2006).Empirical Support for the Gender-as-Culture Hypothesis: An Intercultural Analysis of Male/Female Language Differences.Human Communication Research, 27(1):121–152.
  • Mulkay, (1976)Mulkay, M.J. (1976).Norms and ideology in science.Social Science Information, 15(4-5):637–656.
  • Ni etal., (2021)Ni, C., Smith, E., Yuan, H., Larivière, V., and Sugimoto, C.R. (2021).The gendered nature of authorship.Science Advances, 7(36):eabe4639.
  • Oliveira etal., (2019)Oliveira, D., Ma, Y., Woodruff, T., and Uzzi, B. (2019).Comparison of national institutes of health grant amounts to first-time male and female principal investigators.JAMA - Journal of the American Medical Association, 321(9):898–900.
  • Pennebaker, (2011)Pennebaker, J. (2011).The Secret Life of Pronouns: What Our Words Say About Us.Bloomsbury USA.
  • Potthoff and Zimmermann, (2017)Potthoff, M. and Zimmermann, F. (2017).Is there a gender-based fragmentation of communication science? an investigation of the reasons for the apparent gender hom*ophily in citations.Scientometrics, 112(2):1047–1063.
  • Reuben etal., (2014)Reuben, E., Sapienza, P., and Zingales, L. (2014).How stereotypes impair women’s careers in science.Proceedings of the National Academy of Sciences, 111(12):4403–4408.
  • Sarsons, (2017)Sarsons, H. (2017).Recognition for group work: Gender differences in academia.American Economic Review, 107(5):141–45.
  • Su and Rounds, (2015)Su, R. and Rounds, J. (2015).All stem fields are not created equal: People and things interests explain gender disparities across stem fields.Frontiers in Psychology, 6:189.
  • Teplitskiy etal., (2018)Teplitskiy, M., Acuna, D., Elamrani-Raoult, A., Körding, K., and Evans, J. (2018).The sociology of scientific validity: How professional networks shape judgement in peer review.Research Policy, 47(9):1825–1841.
  • Thomson etal., (2001)Thomson, R., Murachver, T., and Green, J. (2001).Where is the gender in gendered language?Psychological Science, 12(2):171–175.PMID: 11340928.
  • Toole etal., (2021)Toole, A.A., Saksena, M.J., Degrazia, C. A.W., Black, K.P., Lissoni, F., Miguelez, E., and Tarasconi, G. (2021).Progress and Potential: 2020 update on U.S. women inventor-patentees.Working Papers hal-03098153, HAL.
  • vander Lee and Ellemers, (2015)vander Lee, R. and Ellemers, N. (2015).Gender contributes to personal research funding success in the netherlands.Proceedings of the National Academy of Sciences, 112(40):12349–12353.
  • Vásárhelyi etal., (2021)Vásárhelyi, O., Zakhlebin, I., Milojević, S., and Ágnes Horvát, E. (2021).Gender inequities in the online dissemination of scholars work.Proceedings of the National Academy of Sciences, 118(39):e2102945118.
  • Weischedel etal., (2013)Weischedel, R., Palmer, M., Marcus, M., Hovy, E., Pradhan, S., Ramshaw, L., Xue, N., Taylor, A., Kaufman, J., Franchini, M., El-Bachouti, M., Belvin, R., and Houston, A. (2013).OntoNotes Release 5.0 LDC2013T19. Web Download. Philadelphia: Linguistic Data Consortium.
  • Zurn etal., (2020)Zurn, P., Bassett, D.S., and Rust, N.C. (2020).The citation diversity statement: A practice of transparency, a way of life.Trends in Cognitive Sciences, 24(9):669–672.

Supplementary Materials

Gender coding

In this appendix, we describe our procedure for coding the gender of paper authors and patent inventors and lawyers. For papers, we coded author gender based on author first and (when available) middle names, as reported in Web of Science, using the genderize.io application programming interface (API). While similar approaches have been widely used in prior work, we recognize that author names offer only an imperfect proxy for gender, and this limitation should be kept in mind when interpreting the results of our study. Given a name, the genderize.io API returns the most likely gender and an associated probability score. The genderize.io API makes gender suggestions based on worldwide data from hundreds of millions of social media profiles; as such, the service makes a significant improvement over prior approaches, which are typically based on records from the United States Social Security Administration or similar government entities, and that therefore perform poorly on international samples. For our study, we used a probability cutoff of 0.9 for assigning gender; gender suggestions below that level were set to “unknown” and excluded from the study. When possible, we assigned gender by looking across the academic papers of an author. For example, if the same author had published one paper under the name “J. Smith” and the other under the name “John Smith”, we would assign the gender of the author as “male” in both cases, using the name from the latter paper to propagate a gender to the second.

For patents, we assign inventor gender by using the coding provided by the United States Patent and Trademark Office, via its Patents View database. Similar to our approach for papers, the Patents View database assigns gender based on inventor names. Details on the algorithm are given in (Toole etal.,, 2021). Patents View does not assign gender to patent attorneys. Therefore, we assigned gender ourselves, using an approach analogous to the one described above for papers.

//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (5)
//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (6)
//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (7)
//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (8)
TagSourceDescriptionFeatureDimension
CDspaCyCardinal numberNumberInformational
PDTspaCyPredeterminer (e.g. both, etc.)DeterminerInformational
WDTWh-determiner (e.g. which, etc.)
DTDeterminer
VBDspaCyVerb, past tensePast tenseInformational
VBNVerb, past participle
PRPspaCyPersonal pronounPronounInvolved
PRP$Possessive pronoun
WPWh-pronoun
WP$Possessive wh-pronoun
ANDstring matchConnector “and”ConnectorInvolved
?string matchQuestion markQuestionInvolved

Physical Sciences
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Author is female (1 = Yes)0.0049superscript0.0049absent0.0049^{***}0.0049 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.0046superscript0.0046absent0.0046^{***}0.0046 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1170superscript0.1170absent0.1170^{***}0.1170 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1136superscript0.1136absent0.1136^{***}0.1136 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2085superscript0.2085absent0.2085^{***}0.2085 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2145superscript0.2145absent0.2145^{***}0.2145 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0012)0.0012(0.0012)( 0.0012 )(0.0011)0.0011(0.0011)( 0.0011 )(0.0136)0.0136(0.0136)( 0.0136 )(0.0134)0.0134(0.0134)( 0.0134 )(0.0311)0.0311(0.0311)( 0.0311 )(0.0303)0.0303(0.0303)( 0.0303 )
Constant0.2314superscript0.2314absent0.2314^{***}0.2314 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1668superscript0.1668absent0.1668^{**}0.1668 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT3.6371superscript3.6371absent3.6371^{***}3.6371 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.2202superscript3.2202absent3.2202^{***}3.2202 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT17.1514superscript17.1514absent17.1514^{***}17.1514 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT19.0286superscript19.0286absent19.0286^{***}19.0286 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0008)0.0008(0.0008)( 0.0008 )(0.0540)0.0540(0.0540)( 0.0540 )(0.0096)0.0096(0.0096)( 0.0096 )(0.6325)0.6325(0.6325)( 0.6325 )(0.0220)0.0220(0.0220)( 0.0220 )(1.4339)1.4339(1.4339)( 1.4339 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N623586235862358623586235862358623586235862358623586235862358623586235862358623586235862358623586235862358623586235862358
r20.00030.00030.00030.00030.05440.05440.05440.05440.00120.00120.00120.00120.04840.04840.04840.04840.00070.00070.00070.00070.05760.05760.05760.0576
Technology
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Author is female (1 = Yes)0.0096superscript0.0096absent0.0096^{***}0.0096 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.0101superscript0.0101absent0.0101^{***}0.0101 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1404superscript0.1404absent0.1404^{***}0.1404 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1444superscript0.1444absent0.1444^{***}0.1444 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.01620.01620.01620.01620.00180.00180.00180.0018
(0.0016)0.0016(0.0016)( 0.0016 )(0.0015)0.0015(0.0015)( 0.0015 )(0.0179)0.0179(0.0179)( 0.0179 )(0.0169)0.0169(0.0169)( 0.0169 )(0.0384)0.0384(0.0384)( 0.0384 )(0.0371)0.0371(0.0371)( 0.0371 )
Constant0.2491superscript0.2491absent0.2491^{***}0.2491 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1602superscript0.1602absent0.1602^{***}0.1602 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.8901superscript3.8901absent3.8901^{***}3.8901 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.8286superscript2.8286absent2.8286^{***}2.8286 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT17.3202superscript17.3202absent17.3202^{***}17.3202 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT19.2278superscript19.2278absent19.2278^{***}19.2278 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0011)0.0011(0.0011)( 0.0011 )(0.0134)0.0134(0.0134)( 0.0134 )(0.0127)0.0127(0.0127)( 0.0127 )(0.1478)0.1478(0.1478)( 0.1478 )(0.0272)0.0272(0.0272)( 0.0272 )(0.3239)0.3239(0.3239)( 0.3239 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N436754367543675436754367543675436754367543675436754367543675436754367543675436754367543675436754367543675436754367543675
r20.00080.00080.00080.00080.11600.11600.11600.11600.00140.00140.00140.00140.11760.11760.11760.11760.00000.00000.00000.00000.08110.08110.08110.0811
Life Sciences & Biomedicine
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Author is female (1 = Yes)0.0185superscript0.0185absent0.0185^{***}0.0185 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.0186superscript0.0186absent0.0186^{***}0.0186 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2046superscript0.2046absent0.2046^{***}0.2046 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2052superscript0.2052absent0.2052^{***}0.2052 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1583superscript0.1583absent-0.1583^{***}- 0.1583 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1571superscript0.1571absent-0.1571^{***}- 0.1571 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0010)0.0010(0.0010)( 0.0010 )(0.0010)0.0010(0.0010)( 0.0010 )(0.0096)0.0096(0.0096)( 0.0096 )(0.0090)0.0090(0.0090)( 0.0090 )(0.0220)0.0220(0.0220)( 0.0220 )(0.0211)0.0211(0.0211)( 0.0211 )
Constant0.2893superscript0.2893absent0.2893^{***}0.2893 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1160superscript0.1160absent0.1160^{***}0.1160 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT4.1138superscript4.1138absent4.1138^{***}4.1138 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.2907superscript2.2907absent2.2907^{***}2.2907 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT15.9629superscript15.9629absent15.9629^{***}15.9629 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT19.3331superscript19.3331absent19.3331^{***}19.3331 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0007)0.0007(0.0007)( 0.0007 )(0.0134)0.0134(0.0134)( 0.0134 )(0.0068)0.0068(0.0068)( 0.0068 )(0.1259)0.1259(0.1259)( 0.1259 )(0.0156)0.0156(0.0156)( 0.0156 )(0.2943)0.2943(0.2943)( 0.2943 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541149541
r20.00230.00230.00230.00230.10070.10070.10070.10070.00300.00300.00300.00300.12300.12300.12300.12300.00030.00030.00030.00030.08920.08920.08920.0892
Arts & Humanities
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Author is female (1 = Yes)0.0275superscript0.0275absent0.0275^{***}0.0275 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.0276superscript0.0276absent0.0276^{***}0.0276 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2787superscript0.2787absent0.2787^{***}0.2787 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2779superscript0.2779absent0.2779^{***}0.2779 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.3846superscript0.3846absent-0.3846^{***}- 0.3846 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.3874superscript0.3874absent-0.3874^{***}- 0.3874 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0018)0.0018(0.0018)( 0.0018 )(0.0017)0.0017(0.0017)( 0.0017 )(0.0177)0.0177(0.0177)( 0.0177 )(0.0176)0.0176(0.0176)( 0.0176 )(0.0302)0.0302(0.0302)( 0.0302 )(0.0285)0.0285(0.0285)( 0.0285 )
Constant0.3860superscript0.3860absent0.3860^{***}0.3860 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.3551superscript0.3551absent0.3551^{***}0.3551 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT5.7840superscript5.7840absent5.7840^{***}5.7840 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT4.3604superscript4.3604absent4.3604^{***}4.3604 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT16.4391superscript16.4391absent16.4391^{***}16.4391 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT14.6273superscript14.6273absent14.6273^{***}14.6273 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0013)0.0013(0.0013)( 0.0013 )(0.0835)0.0835(0.0835)( 0.0835 )(0.0125)0.0125(0.0125)( 0.0125 )(0.8482)0.8482(0.8482)( 0.8482 )(0.0214)0.0214(0.0214)( 0.0214 )(1.3787)1.3787(1.3787)( 1.3787 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N643176431764317643176431764317643176431764317643176431764317643176431764317643176431764317643176431764317643176431764317
r20.00370.00370.00370.00370.05560.05560.05560.05560.00380.00380.00380.00380.02320.02320.02320.02320.00250.00250.00250.00250.11480.11480.11480.1148
Social Sciences
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Author is female (1 = Yes)0.0283superscript0.0283absent0.0283^{***}0.0283 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.0282superscript0.0282absent0.0282^{***}0.0282 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2959superscript0.2959absent0.2959^{***}0.2959 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2949superscript0.2949absent0.2949^{***}0.2949 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2911superscript0.2911absent-0.2911^{***}- 0.2911 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2899superscript0.2899absent-0.2899^{***}- 0.2899 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0010)0.0010(0.0010)( 0.0010 )(0.0010)0.0010(0.0010)( 0.0010 )(0.0097)0.0097(0.0097)( 0.0097 )(0.0095)0.0095(0.0095)( 0.0095 )(0.0173)0.0173(0.0173)( 0.0173 )(0.0169)0.0169(0.0169)( 0.0169 )
Constant0.3482superscript0.3482absent0.3482^{***}0.3482 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2189superscript0.2189absent0.2189^{***}0.2189 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT4.9205superscript4.9205absent4.9205^{***}4.9205 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.2420superscript3.2420absent3.2420^{***}3.2420 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT15.5635superscript15.5635absent15.5635^{***}15.5635 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT16.0783superscript16.0783absent16.0783^{***}16.0783 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0007)0.0007(0.0007)( 0.0007 )(0.0530)0.0530(0.0530)( 0.0530 )(0.0069)0.0069(0.0069)( 0.0069 )(0.5159)0.5159(0.5159)( 0.5159 )(0.0122)0.0122(0.0122)( 0.0122 )(0.9211)0.9211(0.9211)( 0.9211 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629192629
r20.00420.00420.00420.00420.04530.04530.04530.04530.00480.00480.00480.00480.05870.05870.05870.05870.00150.00150.00150.00150.04990.04990.04990.0499

Computers & Comm
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.00040.00040.00040.00040.00040.00040.00040.00040.05780.05780.05780.05780.05780.05780.05780.05780.3038superscript0.3038absent0.3038^{**}0.3038 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.3034superscript0.3034absent0.3034^{**}0.3034 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT
(0.0026)0.0026(0.0026)( 0.0026 )(0.0026)0.0026(0.0026)( 0.0026 )(0.0412)0.0412(0.0412)( 0.0412 )(0.0411)0.0411(0.0411)( 0.0411 )(0.1064)0.1064(0.1064)( 0.1064 )(0.1054)0.1054(0.1054)( 0.1054 )
Lawyer is female (1 = Yes)0.0102superscript0.01020.0102^{*}0.0102 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT0.00930.00930.00930.00930.03730.03730.03730.03730.03560.03560.03560.03560.7366superscript0.7366absent-0.7366^{***}- 0.7366 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.6249superscript0.6249absent-0.6249^{**}- 0.6249 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT
(0.0051)0.0051(0.0051)( 0.0051 )(0.0051)0.0051(0.0051)( 0.0051 )(0.0799)0.0799(0.0799)( 0.0799 )(0.0802)0.0802(0.0802)( 0.0802 )(0.2065)0.2065(0.2065)( 0.2065 )(0.2057)0.2057(0.2057)( 0.2057 )
Constant0.1363superscript0.1363absent0.1363^{***}0.1363 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1314superscript0.1314absent0.1314^{***}0.1314 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.6626superscript2.6626absent2.6626^{***}2.6626 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.6891superscript2.6891absent2.6891^{***}2.6891 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT20.9118superscript20.9118absent20.9118^{***}20.9118 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT20.9487superscript20.9487absent20.9487^{***}20.9487 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0019)0.0019(0.0019)( 0.0019 )(0.0178)0.0178(0.0178)( 0.0178 )(0.0296)0.0296(0.0296)( 0.0296 )(0.2812)0.2812(0.2812)( 0.2812 )(0.0766)0.0766(0.0766)( 0.0766 )(0.7211)0.7211(0.7211)( 0.7211 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N558055805580558055805580558055805580558055805580558055805580558055805580558055805580558055805580
r20.00070.00070.00070.00070.01560.01560.01560.01560.00040.00040.00040.00040.01210.01210.01210.01210.00370.00370.00370.00370.03110.03110.03110.0311
Electrical & Electronic
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.0076superscript0.0076absent0.0076^{**}0.0076 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.0076superscript0.0076absent0.0076^{**}0.0076 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.2715superscript0.2715absent0.2715^{***}0.2715 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2716superscript0.2716absent0.2716^{***}0.2716 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.8669superscript0.8669absent0.8669^{***}0.8669 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.8668superscript0.8668absent0.8668^{***}0.8668 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0026)0.0026(0.0026)( 0.0026 )(0.0026)0.0026(0.0026)( 0.0026 )(0.0436)0.0436(0.0436)( 0.0436 )(0.0435)0.0435(0.0435)( 0.0435 )(0.1176)0.1176(0.1176)( 0.1176 )(0.1156)0.1156(0.1156)( 0.1156 )
Lawyer is female (1 = Yes)0.00530.00530.00530.00530.00480.00480.00480.00480.01830.01830.01830.01830.00340.00340.00340.00340.6147superscript0.6147-0.6147^{*}- 0.6147 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT0.5997superscript0.5997-0.5997^{*}- 0.5997 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT
(0.0054)0.0054(0.0054)( 0.0054 )(0.0054)0.0054(0.0054)( 0.0054 )(0.0910)0.0910(0.0910)( 0.0910 )(0.0914)0.0914(0.0914)( 0.0914 )(0.2455)0.2455(0.2455)( 0.2455 )(0.2431)0.2431(0.2431)( 0.2431 )
Constant0.1340superscript0.1340absent0.1340^{***}0.1340 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1366superscript0.1366absent0.1366^{***}0.1366 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.7003superscript2.7003absent2.7003^{***}2.7003 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.9902superscript2.9902absent2.9902^{***}2.9902 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT21.2449superscript21.2449absent21.2449^{***}21.2449 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT22.2131superscript22.2131absent22.2131^{***}22.2131 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0018)0.0018(0.0018)( 0.0018 )(0.0120)0.0120(0.0120)( 0.0120 )(0.0312)0.0312(0.0312)( 0.0312 )(0.2046)0.2046(0.2046)( 0.2046 )(0.0843)0.0843(0.0843)( 0.0843 )(0.5441)0.5441(0.5441)( 0.5441 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N463246324632463246324632463246324632463246324632463246324632463246324632463246324632463246324632
r20.00210.00210.00210.00210.01860.01860.01860.01860.00830.00830.00830.00830.02350.02350.02350.02350.01280.01280.01280.01280.05580.05580.05580.0558
Mechanical
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.00120.0012-0.0012- 0.00120.00120.0012-0.0012- 0.00120.1130superscript0.11300.1130^{*}0.1130 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT0.1129superscript0.11290.1129^{*}0.1129 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT0.9326superscript0.9326absent0.9326^{***}0.9326 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.9312superscript0.9312absent0.9312^{***}0.9312 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0025)0.0025(0.0025)( 0.0025 )(0.0025)0.0025(0.0025)( 0.0025 )(0.0456)0.0456(0.0456)( 0.0456 )(0.0454)0.0454(0.0454)( 0.0454 )(0.1149)0.1149(0.1149)( 0.1149 )(0.1133)0.1133(0.1133)( 0.1133 )
Lawyer is female (1 = Yes)0.01040.01040.01040.01040.01000.01000.01000.01000.04770.04770.04770.04770.07110.07110.07110.07111.1050superscript1.1050absent-1.1050^{***}- 1.1050 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.8692superscript0.8692absent-0.8692^{***}- 0.8692 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0057)0.0057(0.0057)( 0.0057 )(0.0057)0.0057(0.0057)( 0.0057 )(0.1033)0.1033(0.1033)( 0.1033 )(0.1042)0.1042(0.1042)( 0.1042 )(0.2603)0.2603(0.2603)( 0.2603 )(0.2601)0.2601(0.2601)( 0.2601 )
Constant0.1445superscript0.1445absent0.1445^{***}0.1445 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1516superscript0.1516absent0.1516^{***}0.1516 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.9765superscript2.9765absent2.9765^{***}2.9765 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.2065superscript3.2065absent3.2065^{***}3.2065 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT21.5424superscript21.5424absent21.5424^{***}21.5424 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT22.2138superscript22.2138absent22.2138^{***}22.2138 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0018)0.0018(0.0018)( 0.0018 )(0.0100)0.0100(0.0100)( 0.0100 )(0.0326)0.0326(0.0326)( 0.0326 )(0.1816)0.1816(0.1816)( 0.1816 )(0.0822)0.0822(0.0822)( 0.0822 )(0.4531)0.4531(0.4531)( 0.4531 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N453645364536453645364536453645364536453645364536453645364536453645364536453645364536453645364536
r20.00080.00080.00080.00080.01290.01290.01290.01290.00140.00140.00140.00140.01950.01950.01950.01950.01800.01800.01800.01800.05520.05520.05520.0552
Drugs & Medical
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.00340.00340.00340.00340.00400.00400.00400.00400.1690superscript0.1690absent0.1690^{**}0.1690 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.1702superscript0.1702absent0.1702^{**}0.1702 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.09930.09930.09930.09930.06150.06150.06150.0615
(0.0069)0.0069(0.0069)( 0.0069 )(0.0068)0.0068(0.0068)( 0.0068 )(0.0588)0.0588(0.0588)( 0.0588 )(0.0583)0.0583(0.0583)( 0.0583 )(0.1651)0.1651(0.1651)( 0.1651 )(0.1464)0.1464(0.1464)( 0.1464 )
Lawyer is female (1 = Yes)0.0333superscript0.0333absent0.0333^{**}0.0333 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT0.00470.00470.00470.00470.10210.10210.10210.10210.04200.04200.04200.04202.0064superscript2.0064absent-2.0064^{***}- 2.0064 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.20360.2036-0.2036- 0.2036
(0.0112)0.0112(0.0112)( 0.0112 )(0.0117)0.0117(0.0117)( 0.0117 )(0.0950)0.0950(0.0950)( 0.0950 )(0.0995)0.0995(0.0995)( 0.0995 )(0.2667)0.2667(0.2667)( 0.2667 )(0.2495)0.2495(0.2495)( 0.2495 )
Constant0.1689superscript0.1689absent0.1689^{***}0.1689 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2024superscript0.2024absent0.2024^{**}0.2024 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT3.0063superscript3.0063absent3.0063^{***}3.0063 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.2853superscript3.2853absent3.2853^{***}3.2853 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT20.3865superscript20.3865absent20.3865^{***}20.3865 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT18.1602superscript18.1602absent18.1602^{***}18.1602 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0050)0.0050(0.0050)( 0.0050 )(0.0775)0.0775(0.0775)( 0.0775 )(0.0426)0.0426(0.0426)( 0.0426 )(0.6605)0.6605(0.6605)( 0.6605 )(0.1195)0.1195(0.1195)( 0.1195 )(1.6570)1.6570(1.6570)( 1.6570 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N305630563056305630563056305630563056305630563056305630563056305630563056305630563056305630563056
r20.00300.00300.00300.00300.04050.04050.04050.04050.00310.00310.00310.00310.03320.03320.03320.03320.01820.01820.01820.01820.23950.23950.23950.2395

Chemical
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.00040.0004-0.0004- 0.00040.00040.0004-0.0004- 0.00040.07430.07430.07430.07430.07390.07390.07390.07390.03440.0344-0.0344- 0.03440.03710.0371-0.0371- 0.0371
(0.0070)0.0070(0.0070)( 0.0070 )(0.0069)0.0069(0.0069)( 0.0069 )(0.0607)0.0607(0.0607)( 0.0607 )(0.0607)0.0607(0.0607)( 0.0607 )(0.1777)0.1777(0.1777)( 0.1777 )(0.1660)0.1660(0.1660)( 0.1660 )
Lawyer is female (1 = Yes)0.01250.01250.01250.01250.01010.01010.01010.01010.01210.0121-0.0121- 0.01210.01430.01430.01430.01431.4362superscript1.4362absent-1.4362^{***}- 1.4362 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT1.2652superscript1.2652absent-1.2652^{***}- 1.2652 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0118)0.0118(0.0118)( 0.0118 )(0.0118)0.0118(0.0118)( 0.0118 )(0.1017)0.1017(0.1017)( 0.1017 )(0.1036)0.1036(0.1036)( 0.1036 )(0.2974)0.2974(0.2974)( 0.2974 )(0.2831)0.2831(0.2831)( 0.2831 )
Constant0.1725superscript0.1725absent0.1725^{***}0.1725 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2024superscript0.2024absent0.2024^{***}0.2024 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.0471superscript3.0471absent3.0471^{***}3.0471 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT2.8248superscript2.8248absent2.8248^{***}2.8248 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT19.8252superscript19.8252absent19.8252^{***}19.8252 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT16.3100superscript16.3100absent16.3100^{***}16.3100 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0051)0.0051(0.0051)( 0.0051 )(0.0379)0.0379(0.0379)( 0.0379 )(0.0439)0.0439(0.0439)( 0.0439 )(0.3318)0.3318(0.3318)( 0.3318 )(0.1285)0.1285(0.1285)( 0.1285 )(0.9070)0.9070(0.9070)( 0.9070 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N252425242524252425242524252425242524252425242524252425242524252425242524252425242524252425242524
r20.00040.00040.00040.00040.04640.04640.04640.04640.00060.00060.00060.00060.01850.01850.01850.01850.00920.00920.00920.00920.15010.15010.15010.1501
Others
DV: Inv.-Inf. RatioDV: Involved rateDV: Informational rate
(1)(2)(3)(4)(5)(6)
Inventor is female (1 = Yes)0.0056superscript0.0056absent0.0056^{***}0.0056 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.0056superscript0.0056absent0.0056^{***}0.0056 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1542superscript0.1542absent0.1542^{***}0.1542 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1537superscript0.1537absent0.1537^{***}0.1537 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2651superscript0.2651absent0.2651^{***}0.2651 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.2603superscript0.2603absent0.2603^{***}0.2603 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0017)0.0017(0.0017)( 0.0017 )(0.0017)0.0017(0.0017)( 0.0017 )(0.0293)0.0293(0.0293)( 0.0293 )(0.0293)0.0293(0.0293)( 0.0293 )(0.0699)0.0699(0.0699)( 0.0699 )(0.0691)0.0691(0.0691)( 0.0691 )
Lawyer is female (1 = Yes)0.0084superscript0.00840.0084^{*}0.0084 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT0.0079superscript0.00790.0079^{*}0.0079 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT0.00740.0074-0.0074- 0.00740.01980.01980.01980.01980.7232superscript0.7232absent-0.7232^{***}- 0.7232 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.4835superscript0.4835absent-0.4835^{**}- 0.4835 start_POSTSUPERSCRIPT ∗ ∗ end_POSTSUPERSCRIPT
(0.0036)0.0036(0.0036)( 0.0036 )(0.0037)0.0037(0.0037)( 0.0037 )(0.0623)0.0623(0.0623)( 0.0623 )(0.0630)0.0630(0.0630)( 0.0630 )(0.1487)0.1487(0.1487)( 0.1487 )(0.1487)0.1487(0.1487)( 0.1487 )
Constant0.1495superscript0.1495absent0.1495^{***}0.1495 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT0.1559superscript0.1559absent0.1559^{***}0.1559 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.1071superscript3.1071absent3.1071^{***}3.1071 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT3.1456superscript3.1456absent3.1456^{***}3.1456 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT21.8632superscript21.8632absent21.8632^{***}21.8632 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT21.8371superscript21.8371absent21.8371^{***}21.8371 start_POSTSUPERSCRIPT ∗ ∗ ∗ end_POSTSUPERSCRIPT
(0.0012)0.0012(0.0012)( 0.0012 )(0.0085)0.0085(0.0085)( 0.0085 )(0.0209)0.0209(0.0209)( 0.0209 )(0.1451)0.1451(0.1451)( 0.1451 )(0.0499)0.0499(0.0499)( 0.0499 )(0.3427)0.3427(0.3427)( 0.3427 )
Field fixed effectsNoYesNoYesNoYes
Year fixed effectsNoYesNoYesNoYes
N117781177811778117781177811778117781177811778117781177811778117781177811778117781177811778117781177811778117781177811778
r20.00140.00140.00140.00140.00650.00650.00650.00650.00240.00240.00240.00240.00870.00870.00870.00870.00310.00310.00310.00310.03120.03120.03120.0312
//pypi.org/project/stylometer. Financial support from the National Science Foundation (grants 1932689 and 1932596) is gratefully acknowledged. Any errors are those of the authors alone. (2024)

References

Top Articles
Latest Posts
Article information

Author: Delena Feil

Last Updated:

Views: 6097

Rating: 4.4 / 5 (65 voted)

Reviews: 80% of readers found this page helpful

Author information

Name: Delena Feil

Birthday: 1998-08-29

Address: 747 Lubowitz Run, Sidmouth, HI 90646-5543

Phone: +99513241752844

Job: Design Supervisor

Hobby: Digital arts, Lacemaking, Air sports, Running, Scouting, Shooting, Puzzles

Introduction: My name is Delena Feil, I am a clean, splendid, calm, fancy, jolly, bright, faithful person who loves writing and wants to share my knowledge and understanding with you.