Sunday, March 26, 2023

Lifespans of the European Elite, 800–1800: Marked increases around 1400 and again around 1650

Lifespans of the European Elite, 800–1800. Neil Cummins. The Journal of Economic History, Volume 77, Issue 2, June 2017, pp. 406-439. https://doi.org/10.1017/S0022050717000468

Abstract: I analyze the adult age at death of 115,650 European nobles from 800 to 1800. Longevity began increasing long before 1800 and the Industrial Revolution, with marked increases around 1400 and again around 1650. Declines in violent deaths from battle contributed to some of this increase, but the majority must reflect other changes in individual behavior. There are historic spatial contours to European elite mortality; Northwest Europe achieved greater adult lifespans than the rest of Europe even by 1000 AD.

DISCUSSION

This study has characterized adult noble lifespans from 800 to 1800. The consistent and large association uncovered between sex and plague mortality for nobles runs counter to the indiscriminate reputation of the Black Death and counter to recent paleodemographic analysis on skeletons from fourteenth century London (DeWitte Reference Dewitte2009).Footnote 30 If plague killed more women than men, a simple supply-side effect increasing female agency in the marriage market could explain the origin of the European Marriage Pattern (Hajnal Reference Hajnal, Eversley and Glass1965; De Moor and Van Zanden Reference David, S. Ryan and Andrea2010; Voigtländer and Voth Reference Voigtländer and Hans-Joachim2013). Of course this is a premature speculation, the patterns reported here would have to be convincingly established for the population at large.

The sharp decline in the proportion of male nobles dying from battle, from over 600 years of a steady 30 percent, to less than 5 percent in the sixteenth century, predates the arrival of the Industrial Revolution by two centuries. The long-run decline in violence is cited as one of the principal correlates of the emergence of the modern World with the “civilizing process” needing the transformation of warrior nobles into gentleman courtiers (Elias Reference Elias1982).Footnote 31

One can perhaps ask why did battlefield violence decline among European nobility. Nobility certainly did not lose its taste for military life. The Wars of Religion following 1500 were aristocratic feuds at least as much as earlier wars.Footnote 32 However, the decline in battlefield death amongst nobles corresponds to the emergence of modern warfare; artillery, standing armies, and the replacement of privilege with merit.Footnote 33 The power of hereditary warrior status declined in battle as modern and larger standing armies, led by increasingly wealthy princes, focused upon artillery and infantry (Keen Reference Keen1984, pp. 1, 238–53). The decline of cavalry meant that nobility became officers, inherently a more administrative role than before (Keen Reference Keen1984, p. 240). In war, nobility still led, but from the safety of the rear guard, not the front lines.Footnote 34

I estimate the time-trend of adult noble lifespan over the millennium between 800 and 1800. The findings on the timing of the modern rise in age at death agree almost exactly with de la Croix and Licandro (Reference De and Omar2012) (the birth cohort of 1640–1649). The nobility are, in general, forerunners of Europe's mortality transition as David, Johansson, and Pozzi (Reference David, S. Ryan and Andrea2010figs. 3(a) and 3(b), p. 28) argue too.Footnote 35 This may provide a clue for those who seek to explain why mortality declined. There could be an important role for individual behavior and a demonstration effect (e.g., hygiene and other behavioral traits) as this rise predates modern medicine or any public health measures. It also predates the Industrial Revolution.Footnote 36 Whilst modern evidence suggests that life expectancy does not matter for economic growth (Acemoglu and Johnson Reference Acemoglu and Simon2007), the case has not been proven for the preindustrial era.

Unlike de la Croix and Licandro (Reference De and Omar2012), this study argues that lifespan was not a stationary trend before 1650. There are significant oscillations, most importantly the sharp Europe-wide rise in noble lifespan after 1400. The rise is stronger over the 1400–1600 interval in Ireland, Scotland, and in particular, England and Wales (Figures 11 and 12). This pattern has remained hidden. Only long and deep time series of at least a millennium in length could uncover it. For England, this result can be directly compared with existing estimates of adult mortality. The dramatic rise from the fourteenth to the fifteenth and sixteenth centuries revealed in Figure 12(a) is in broad accordance with Russell's estimates of life expectancy at age 25 (e25) for tenants-in-chief of the crown from the Inquisitions Post Mortem (Smith Reference Poos, Jim, Richard and Hicks2012Figure 10, p. 79). However, recent re-estimates of e25 for these same data (Poos, Oeppen, and Smith Reference Poos, Jim, Richard and Hicks2012) suggest a much higher level and a flat trend, at about 30 years, during the fourteenth century. Monastic evidence from communities in Durham, Canterbury, and Westminster points to a decline in e25 from 1450 to 1500 (Poos, Oeppen, and Smith Reference Poos, Jim, Richard and Hicks2012Figure 8.2, p. 162). This is not the pattern I find. Figure 12(a) reports the opposite trend for the English elite: a sharply rising trend in predicted average age at death, for those dying over 20, from 1450–1500. The evidence I have assembled and analyzed in this article strongly suggests a strong improvement in lifespan in the fifteenth century for the English elite.

No conclusions can be drawn as to why adult noble lifespan increased so much after 1400. No known medical innovations in Europe before 1500 could be responsible.Footnote 37 Nutrition, in terms of calories consumed, also cannot explain this rise. These elites could be expected to have always filled their bellies. For this reason, those who argue that the “modern rise of population” was a result of nutrition, the equality of aristocratic and peasant lifespans in the past has presented a paradox (see Fogel (Reference Fogel, Engerman and Gallman1986, pp. 480–84) and McKeown (Reference McKeown1976, pp. 139–42)). Robert Fogel attributed this “peerage paradox” to the vast quantities of alcohol the English elite consumed (Reference Fogel, Engerman and Gallman1986, p. 483). Perhaps diet changed in other ways. The late fourteenth century did witness an increase in the proportion of manuscripts on health.Footnote 38 Works such as the Tacuinum Sanitatis, incorporating Arabic and Ancient knowledge, recommended moderation in food and alcohol, adequate rest, and exercise and, similar to modern medicine, emphasized the importance of vegetables and fruit to human health (Janick, Daunay, and Paris Reference Janick, Marie and Harry2010). Of course, the actual effect of these manuscripts is speculation at this point.

The rise in elite adult age at death for those born after 1400 could also be the result of a Darwinian selection effect from the half century of recurring plague that returned in 1347. Plague killed those susceptible to plague but would also have purged the population of other frailties that may have been correlated with plague susceptibility.Footnote 39 However, most people, even during the plague era, died from other causes.Footnote 40 The real long-term demographic effect of the Black Death could have been through its effect on the disease climate. Noble lifespan in Figure 8 corresponds closely to the trend in real wages in England (Clark Reference Clark2005fig. 4, p. 1311)Footnote 41 and to recent estimates of gross domestic product (GDP) per capita (Broadberry et al. Reference Broadberry, Bruce and Alexander2015, p. 206). Improved nutrition amongst the general population, from higher real incomes via Malthusian dynamics, could have led to a reduction in the incidence of other infectious diseases among plague survivors and their offspring.Footnote 42 Nutritional status did little to diminish plague lethality (see Fogel Reference Fogel, Engerman and Gallman1986, table 9.11, p. 481) but together with a “purging” effect, the Black Death could have led to an improved climate against infectious disease, especially in cities.

The cause of the 1400 rise in adult noble lifespan is unknown. Presently only speculations can be made. Future empirical work, perhaps linking estate account books (to reconstruct diet) to specific time and location (rural/urban) effects and genealogies of the kind analyzed here, will have great potential to answer this mystery.

This article documents a geographic pattern to European elite lifespans. The mortality gradient runs South-North and East-West, and has existed since before the Black Death. The long existence of such a geographic “effect,” and the factors which are causing it, may have implications for recent work which stresses the “little divergence” between the Northwest Europe and the Southeast (Voigtländer and Voth Reference Voigtländer and Hans-Joachim2013; Broadberry Reference Broadberry2013; de Pleijt and van Zanden Reference Acemoglu and Simon2013). The Black Death is not the first turning point. There was something about the Northwest Europe long before 1346 that led to nobles living longer lives.


People indulge in the belief that social justice is on a uniform path to betterment

Beliefs About Linear Social Progress. Julia D. Hur, Rachel L. Ruttan. Personality and Social Psychology Bulletin, March 23, 2023. https://doi.org/10.1177/01461672231158843

Abstract: Society changes, but the degree to which it has changed can be difficult to evaluate. We propose that people possess beliefs that society has made, and will make, progress in a linear fashion toward social justice. Five sets of studies (13 studies in total) demonstrate that American participants consistently estimated that over time, society has made positive, linear progress toward social issues, such as gender equality, racial diversity, and environmental protection. These estimates were often not aligned with reality, where much progress has been made in a nonlinear fashion. We also ruled out some potential alternative explanations (Study 3) and explored the potential correlates of linear progress beliefs (Study 4). We further showed that these beliefs reduced the perceived urgency and effort needed to make further progress on social issues (Study 5), which may ultimately inhibit people’s willingness to act.

Saturday, March 25, 2023

Replication failure... No effects of exposure to women's fertile window body scents on men's hormonal and psychological responses

No effects of exposure to women's fertile window body scents on men's hormonal and psychological responses. James R. Roney et al. Evolution and Human Behavior, March 22 2023. https://doi.org/10.1016/j.evolhumbehav.2023.03.003

Abstract: Do men respond to women's peri-ovulatory body odors in functional ways? Prior studies reported more positive changes in men's testosterone and cortisol after exposure to women's scents collected within the putative fertile window (i.e., cycle days when conception is possible) compared to comparison odors, and also psychological priming effects that were differentially larger in response to the fertile window odors. We tested replication of these patterns in a study with precise estimation of women's ovulatory timing. Both axillary and genital scent samples were collected from undergraduate women on six nights spaced five days apart. Here, we tested men's responses to a subset of these samples that were chosen strategically to represent three cycle regions from each of 28 women with confirmed ovulation: the follicular phase prior to the start of the fertile window, the fertile window, and the luteal phase. A final sample of 182 men were randomly assigned to each smell one scent sample or plain water. Saliva samples were collected before and after smelling to assess changes in testosterone and cortisol, and psychological measures of both sexual priming and social approach motivation were assessed after stimulus exposure. Planned comparisons of fertile window to other stimuli revealed no statistically significant effects for any dependent variable, in spite of sufficient power to detect effect sizes reported in prior studies. Our findings thus failed to replicate prior publications that showed potentially adaptive responses to women's ovulatory odors. Discussion addresses the implications of these findings for the broader question of concealed ovulation in humans.

Keywords: Scent attractivenessConcealed ovulationTestosteroneCortisolHuman mating

4. Discussion

As a general summary, we found no compelling evidence that men exhibit differential hormonal or psychological responses to women's body odors collected near ovulation relative to their responses to body odors from other cycle regions (or to plain water). The nonsignificant findings occurred despite our study having sufficient power to detect effect sizes that have been reported in the prior literature. Furthermore, Bayes factors computed for each of our dependent variables suggested that the observed data were about 10 to 13 times more likely under null models than under models including the fertile window contrast, and Bayes factors in this range have been argued to provide strong evidence in favor of the null hypothesis (see Schonbrodt & Wagenmakers, 2018).

4.1. Hormone responses to scent stimuli

Our results did not replicate prior findings of more positive testosterone or cortisol changes after exposure to scents collected near ovulation relative to comparison scents (Cerda-Molina et al., 2013Miller & Maner, 2010). Our findings for testosterone were more similar to those of Roney and Simmons (2012), who reported no significant differences in hormone changes after exposure to peri-ovulatory scents vs. after exposure to plain water. Among prior studies, only Cerda-Molina et al. (2013) measured differential cortisol responses to women's peri-ovulatory body odors, and they reported a complex pattern whereby cortisol rose above basal concentrations at 15 min post-exposure for peri-ovulatory stimuli and for luteal vulvar stimuli at post 30 min., but fell below baseline concentrations for luteal axillary stimuli at 15 and 60 min. post-exposure. Our results at 15 min. post-exposure did not replicate those patterns.

What may account for differences between results of the current study and those of prior studies that have reported significant hormone responses to women's peri-ovulatory body scents? Miller and Maner (2010) used whole T-shirts as scent stimuli as opposed to our use of gauze pads; although Cerda-Molina et al. (2013) employed similar collection methods to those in the present study, we cannot rule out the possibility that hormone responses may be more reliable in response to shirt stimuli. A possible limitation of our method was the longer time that we stored frozen samples before use in testing (up to a year, as opposed to samples being used within a week in Miller and Maner (2010) and Cerda-Molina et al. (2013)), although studies that have varied length of storage have provided evidence that responses to human body scents are not affected by long freezing times (Gomes et al., 2020Lenochova et al., 2009). We estimated ovulatory timing more precisely via use of LH tests than did Miller and Maner (2010) who used highly error-prone counting methods (see Gangestad et al., 2016), and this should have increased our probability of finding true effects. Cerda-Molina et al. (2013) cited two factors that might explain discrepancies between their results and the null effects reported by Roney and Simmons (2012)—the longer stimulus collection time in their study and evidence that men in their study were aware that they were smelling women—but both of these differences were eliminated in the present study in which women collected scents overnight and male participants were explicitly told that they were smelling odors from women.

A salient difference between our methods and those of Cerda-Molina et al. (2013) was their use of a nebulizer containing scent stimuli (or plain air) in order to forcefully project odors into participants' nasal passages. It is possible that this method produces hormone responses in perceivers that are absent after taking deep sniffs from jars containing scent stimuli. The ecological validity of the nebulizer delivery method is uncertain. On the one hand, it may deliver stimuli of supra-normal intensity that are not encountered under real-world conditions. On the other hand, it is possible that this method approximates the greater intensity of odor exposure that might occur during some forms of sexual contact. In any case, this difference in scent delivery method presents a possible reason for the discrepancy in findings across the two studies.

It is also possible that prior positive findings for men's hormone responses to women's peri-ovulatory body odors were false positive results. The patterns described in Cerda-Molina et al. (2013) were particularly striking in that men generally responded with testosterone increases after smelling peri-ovulatory stimuli but testosterone decreases after exposure to luteal stimuli. That pattern suggested that men's hormone responses might be strong enough that they could accurately diagnose women's ovulatory timing from scent cues alone. The current findings shed at least some doubt on the robustness of those findings. Future research would ideally provide additional evidence.

4.2. Psychological responses to scent stimuli

Our results also failed to replicate prior findings suggesting the priming of sexual concepts after exposure to peri-ovulatory scent stimuli relative to comparison stimuli. For two dependent variables, we employed measures verbatim from Miller and Maner (2011): the word stem completion task, and a measure of attribution of sexual arousal to the scent donor. A difference in data analysis between studies was the addition of the Chemical Sensitivity Scale (CSS; Nordin, Millqvist, Lowhagen, & Bende, 2003) to the data analyses in Miller and Maner (2011). The scale measures participants' conscious awareness of odors in their environment. For the word stem task, Miller and Maner (2011) added controls for main and interaction effects for scores on this scale in the model testing effects of scent exposure condition. For the sexual arousal attribution task, they reported no main effect of scent exposure condition but a significant interaction between scent condition and CSS scores such that only among men with high smell sensitivity was greater sexual arousal attributed to the peri-ovulatory scents relative to luteal scents. We did not administer this scale, and this difference in method could help to explain discrepant findings for these variables. However, simulation data show that the addition of covariates and testing for interactions with individual difference variables are practices that can inflate type I error rates (Simmons, Nelson, & Simonsohn, 2011), which adds some doubt to the positive findings for the word stem and arousal attribution variables. Furthermore, if sexual priming effects were specialized adaptations for responding to cues of women's ovulatory timing, one would not expect their expression to be restricted only to men with highly sensitive senses of smell. Thus, the overall data pertaining to these variables—including the non-significant findings in the present study—appear to provide weak evidence for adaptations that produce sexual priming effects in response to ovulatory scent cues.

As a more direct measure of sexual priming, we also queried how much sexual desire men felt after exposure to scent cues. There were no significant effects of cycle phase for this variable (see Table 1 and Fig. 4c). Cerda-Molina et al. (2013) administered an “an interest in sex” scale and reported higher scores after exposure to peri-ovulatory scent stimuli, but the scale was quite heterogeneous and included trait-like items (e.g., “[how high do] you think that your sexual desire normally is?”) in addition to measures of current states. As with the hormone responses, it is possible that a nebulizer delivery of scents would produce stronger fertile window effects on men's self-reported sexual desire than those found here.

We did find a main effect of stimulus type on sexual desire such that men who smelled the armpit stimuli reported higher desire than those who smelled the pantyliner stimuli. Additional data analyses supported subjective scent attractiveness ratings as mediating this effect of stimulus type. The positive correlation between scent attractiveness ratings and sexual desire supports the possibility that desire responds to odor attractiveness in general even if it does not respond reliably to scents produced during the fertile window. Odor attractiveness may be related to variables like health (e.g., Olsson et al., 2014) or immune compatibility (e.g., Thornhill et al., 2003), and thus responding to it with desire may have functions aside from ovulation detection.

We also administered a custom social approach motivation scale but scores on it were not differentially higher after exposure to scents from the fertile window (see Table 1 and Fig. 4d). Tan and Goldman (2015) used an indirect behavioral measure to provide evidence that men exposed to peri-ovulatory scents were motivated to sit closer to women, and it is possible that our findings would have differed with such a measure. Oren and Shimone-Tsoory (2019) provided evidence that single but not paired men exhibited greater social perception abilities after exposure to peri-ovulatory scents. Although we did not measure social perception, we did assess the possible moderating influence of relationship status for the effects of cycle phase on our dependent measures, in part motivated by the findings of the social perception study. Results presented in SOM provide no compelling evidence that hormonal or psychological responses to fertile window stimuli were consistent with prior positive findings in the subset of single men.

4.3. Implications for concealed ovulatory timing

Our findings argue against the possibility that human ovulatory timing is detectable from body odors. Mei et al. (2022) recently used signal detection analyses to show that increased scent attractiveness during the fertile window was not substantial enough to reliably diagnose ovulatory timing. That finding left open the possibility that diagnostic cues of ovulatory timing might be revealed via adaptive patterns of responses to scents, such as reactive hormone changes. The present results failed to detect any such putatively adaptive responses, however, and thus argue against that possibility.

The present study addressed only odor cues of ovulatory timing. Cues from other sensory modalities could in principle provide more information, or a combination of cues across modalities could prove more diagnostic. With respect to the latter possibility, Miller and Maner (2011) provided evidence that men who interacted in person with a woman confederate were more likely to mimic her movements and to increase their risk-taking when her estimated conception risk was higher. Perhaps in cases like that, a combination of odor, voice, face, and behavioral cues might more accurately cue fertile window timing.

The strongest tests of multi-modal cuing of ovulatory timing should in principle come from studies that measure the responses of women's long-term romantic partners to the women's cycle phases. Such partners should have the most intimate and detailed information regarding changes in any perceptible stimuli, and would also have clear functional reasons to respond to cues of ovulatory timing for the purpose of ensuring paternity confidence. A recent study of nearly 400 couples with preregistered data analyses and many thousands of observations found no significant effects of women's estimated fertile window timing on male partners' ratings of the women's attractiveness, sexual desire for their partners, feelings of jealousy, or levels of attention to and desire to have contact with the women (Schleifenbaum et al., 2022). Those findings corroborate earlier studies that have generally found that men's rates of sexual initiation are flat across phases of their partners' menstrual cycles (Adams, Gold, & Burt, 1978Caruso et al., 2014Van Goozen, Wiegant, Endert, Helmond, & VandePoll, 1997; cf. Harvey, 1987). Likewise, and pertinent to the hormonal responses tested in the current study, studies have failed to find significant shifts in men's testosterone concentrations across different phases of their romantic partners' menstrual cycles (Ström, Ingberg, Druvefors, Theodorsson, & Theodorsson, 2012Ström, Ingberg, Slezak, Theodorsson, & Theodorsson, 2018). Collectively, these patterns are unexpected if women's body odors provide diagnostic information regarding their ovulatory timing, or if multi-modal stimulus cues jointly reveal fertile window timing.

50-70% of all dreams include residue from the previous day, especially in the early stages of sleep, while later stages refer to more distant memories

Memory reactivations during sleep: a neural basis of dream experiences? Claudia Picard-Deland et al. Trends in Cognitive Sciences, March 22 2023. https://doi.org/10.1016/j.tics.2023.02.006


Abstract: Newly encoded memory traces are spontaneously reactivated during sleep. Since their discovery in the 1990s, these memory reactivations have been discussed as a potential neural basis for dream experiences. New results from animal and human research, as well as from the rapidly growing field of sleep and dream engineering, provide essential insights into this question, and reveal both strong parallels and disparities between the two phenomena. We suggest that, although memory reactivations may contribute to subjective experiences across different states of consciousness, they are not likely to be the primary neural basis of dreaming. We identify important limitations in current research paradigms and suggest novel strategies to address this question empirically.


Systematic review of all published fMRI research on psychopathy: No reproducible evidence suggests that psychopathy is associated with a functional neurobiological profile

Jalava, J., Griffiths, S., & Larsen, R. R. (2023). How to keep unreproducible neuroimaging evidence out of court: A case study in fMRI and psychopathy. Psychology, Public Policy, and Law, 29(1), 1–18. Feb 2023. https://doi.org/10.1037/law0000383

Abstract: The amount of neuroimaging evidence introduced in courts continues to increase. Meanwhile, neuroimaging research is in the midst of a reproducibility crisis, as many published findings appear to be false positives. The problem is mostly due to small sample sizes, lack of direct replications, and questionable research practices. There are concerns that a significant proportion of neuroimaging evidence introduced in court may therefore be unreliable. Guidelines governing the admissibility of scientific evidence—Frye and Daubert—are not designed to weed out such data. We propose supplementing Frye and Daubert with minimal reproducibility criteria that allow judges to make informed admissibility decisions about neuroimaging research. To demonstrate how this could work, we subjected functional magnetic resonance imaging (fMRI) findings on psychopathy—evidence that has been admitted in court—to a minimal reproducibility test. A systematic PRISMA search found 64 relevant studies but no sufficiently powered, directly replicated evidence of a psychopathy-related neurobiological profile. This illustrates two things: (a) the probability of false positives in this data set is likely to be unacceptably high and (b) the reproducibility of similar neuroimaging evidence can be evaluated in a straightforward way. Our findings suggest an urgent need to modify admissibility guidelines to exclude low-quality neuroimaging data.

Check also Is the Psychopathic Brain an Artifact of Coding Bias? A Systematic Review. Jarkko Jalava et al. Front. Psychol., April 12 2021. https://www.bipartisanalliance.com/2021/04/is-psychopathic-brain-artifact-of.html

Friday, March 24, 2023

People may not be able to tell if they are envied by another person at a particular moment, but they know who the notoriously envious ones are among the people they have known for a longer time

Lange, Jens, Birk Hagemeyer, Thomas Lösch, and Katrin Rentzsch. 2019. “Accuracy and Bias in the Social Perception of Envy.” OSF Preprints. June 16. doi:10.31219/osf.io/8jc7x

Abstract: Research converges on the notion that when people feel envy, they disguise it towards others. This implies that a person’s envy in a given situation cannot be accurately perceived by peers, as envy lacks a specific display that could be used as a perceptual cue. In contrast to this reasoning, research supports that envy contributes to the regulation of status hierarchies. If envy threatens status positions, people should be highly attentive to identify enviers. The combination of the two led us to expect that (a) state envy is difficult to accurately perceive in unacquainted persons and (b) dispositional enviers can be accurately identified by acquaintances. To investigate these hypotheses, we used actor-partner interdependence models to disentangle accuracy and bias in the perception of state and trait envy. In Study 1, 436 unacquainted dyad members competed against each other and rated their own and the partner’s state envy. Perception bias was significantly positive, yet perception accuracy was non-significant. In Study 2, 502 acquainted dyad members rated their own and the partner’s dispositional benign and malicious envy as well as trait authentic and hubristic pride. Accuracy coefficients were positive for dispositional benign and malicious envy and robust when controlling for trait authentic and hubristic pride. Moreover, accuracy for dispositional benign envy increased with the depth of the relationship. We conclude that enviers might be identifiable but only after extended contact and discuss how this contributes to research on the ambiguous experience of being envied.


Whether intelligence can be achieved without any agency or intrinsic motivation is an important philosophical question; equipping LLMs with agency & intrinsic motivation is a fascinating & important direction for future work

Sparks of Artificial General Intelligence: Early experiments with GPT-4. Sebastien Bubeck et al. Mar 22 2023. https://arxiv.org/pdf/2303.12712.pdf

Abstract: Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4 [Ope23], was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT4 is part of a new cohort of LLMs (along with ChatGPT and Google’s PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and more, without needing any special prompting. Moreover, in all of these tasks, GPT-4’s performance is strikingly close to human-level performance, and often vastly surpasses prior models such as ChatGPT. Given the breadth and depth of GPT-4’s capabilities, we believe that it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system. In our exploration of GPT-4, we put special emphasis on discovering its limitations, and we discuss the challenges ahead for advancing towards deeper and more comprehensive versions of AGI, including the possible need for pursuing a new paradigm that moves beyond next-word prediction. We conclude with reflections on societal influences of the recent technological leap and future research directions.

---
For example, whether intelligence can be achieved without any agency or intrinsic motivation is an important philosophical question. Equipping LLMs with agency and intrinsic motivation is a fascinating and important direction for future work. With 92 this direction of work, great care would have to be taken on alignment and safety per a system’s abilities to take autonomous actions in the world and to perform autonomous self-improvement via cycles of learning. We discuss a few other crucial missing components of LLMs next.

Thursday, March 23, 2023

Experimental evidence that core intertemporal choice anomalies—like extreme short-run impatience, structural estimates of present bias, hyperbolicity & transitivity violations—are driven by complexity rather than time or risk preferences

Complexity and Time. Benjamin Enke, Thomas Graeber & Ryan Oprea. NBER Working Paper 31047. Mar 2023. DOI 10.3386/w31047

Abstract: We provide experimental evidence that core intertemporal choice anomalies -- including extreme short-run impatience, structural estimates of present bias, hyperbolicity and transitivity violations -- are driven by complexity rather than time or risk preferences. First, all anomalies also arise in structurally similar atemporal decision problems involving valuation of iteratively discounted (but immediately paid) rewards. These computational errors are strongly predictive of intertemporal decisions. Second, intertemporal choice anomalies are highly correlated with indices of complexity responses including cognitive uncertainty and choice inconsistency. We show that model misspecification resulting from ignoring behavioral responses to complexity severely inflates structural estimates of present bias.


Female participants who interacted with a female chatbot gave the lowest ratings for goodwill and likeability among all groups

Gender identity and influence in human-machine Communication:A mixed-methods exploration. Weizi Liu, Mike Yao. Computers in Human Behavior, March 20 2023, 107750. https://doi.org/10.1016/j.chb.2023.107750

Abstract: The advancement of conversational technologies stimulates new research agenda on the patterns, norms, and social impacts of human-machine communication (HMC) as a novel process. Conversational agents (CAs), a prevalent example of machines that communicate with users directly, are usually depicted as females in assisting roles. This study intends to explore empirical evidence of how “gendered” technologies might influence HMC and potentially reinforce gender stereotyping in human-human communication. We applied a mixed-methods approach to explore users' gender-related responses and evaluations in the interaction with CAs. First, we observed unrestricted interactions between 36 human participants and Amazon Alexa in a laboratory and qualitatively analyzed the transcripts to detect gendered communication cues. We then conducted a 2 × 3 (participant gender: female vs. male; CA gender: female vs. male vs. neutral) online experiment where 250 participants interacted with a customized chatbot created by the researcher. Results showed participants’ different emotions/tones, engagement, (non)accommodation, as well as credibility, attraction, and likeability evaluations between human-CA gender pairs.


Expressions of Pain, Pleasure, and Fear Are Consistently Rated Due to Chance

“Eye can’t see the difference”: Facial Expressions of Pain, Pleasure, and Fear Are Consistently Rated Due to Chance. Silvia Boschetti, Hermann Prossinger, Tomáš Hladký, Kamila Machová, Jakub Binter. Human Ethology, Volume 37, 046-072,  Nov 26, 2022. https://doi.org/10.22330/he/37/046-072

Abstract: Our research consisted of two studies focusing on the probability of humans being able to perceive the difference between faces expressing pain versus pleasure. As controls, we included: smile, neutral facial expression, and expression of fear. The first study was conducted online and used a large sample (n = 902) of respondents. The second study was conducted in a laboratory setting and involved a stress induction procedure. For both, the task was to categorize whether the facial expression was rated positive, neutral or negative. Stimuli were faces extracted from freely downloadable online videos. Each rating participant (rater) was presented with five facial expressions (stimuli) of five females and of five males. All raters were presented with the stimuli twice so as to evaluate the consistency of the ratings. Beforehand, we tested for stimuli differences using specialized software and found decisive differences. Using a Bayesian statistical approach, we could test for consistencies and due-to-chance probabilities. The results support the prediction that the results are not repeatable but are solely due to chance, decreasing the communication value of the expressions of pain and pleasure. The expression of fear was also rated due to chance, but neither neutral nor smile. Stress induction did have an impact  on the perception of pleasure.


Keywords: perception, emotion, facial expression, visual stimuli, BDSM, pain and pleasure, Dirichlet distribution, Bayesian statistical approach, Cold Pressor Task


Rolf Degen summarizing... There is a fine line between neuroticism and high sensitivity, and the self-diagnosis of high sensitivity brings considerable redemption

On the feeling of being different–an interview study with people who define themselves as highly sensitive. Marcus Roth ,Danièle A. Gubler,Tobias Janelt,Banous Kolioutsis,Stefan J. Troche. PLoS March 17, 2023. https://doi.org/10.1371/journal.pone.0283311

Abstract: The construct of “sensory processing sensitivity” has become an extremely popular concept outside the scientific literature under the term “high sensitivity” (HS), reflected in a variety of self-help guides and media reports. Therefore, the present study aimed to investigate this phenomenon by examining in-depth individuals who consider the label HS essential to their self-definition. In semi-structured interviews, 38 individuals described their understanding of HS and its perceived manifestations and impact on their lives (among other topics). Subsequently, the data were content-analytically evaluated, i.e., categorized and quantified. One key finding was that HS individuals feel relief following self-attribution or self-diagnosis. Moreover, this self-attribution replaced the feeling of being somehow different from the others, which almost all interviewees mentioned, with positive attributes. The main negative features of HS mentioned were feeling overwhelmed by sensory and emotional stimuli. The results are discussed with regard to the significance of the label HS for this group on the one hand, and with regard to alternative approaches for future research on the other hand.

Discussion

As described in the introduction, the construct known variously as sensory processing sensitivity (SPS) or high sensitivity (HS) has gained enormous importance in everyday psychology, going far beyond the construct’s scientific foundation. As the abundance of popular scientific literature and self-help guides shows, many people identify with and feel that they fall under the category of SPS. Therefore, the present study sought to find out how these people define HS, what manifestations they perceive, and what impact HS has on their lives. For this purpose, we conducted interviews with individuals who strongly define themselves as highly sensitive. Of course, it can be assumed that the definition of HSP and the self-perception of individuals is strongly influenced by social media and popular scientific works. Therefore, it is not surprising that the definitional elements we found are very much reflective of the popular scientific literature [see e.g., 73240].

Summarizing the interview statements, the following picture emerges: People see the main characteristic of HS as increased and more intensive perception of emotional and sensory stimuli as well as longer processing of these stimuli. In addition, many subjects describe that they have stronger emotional empathy and are better able to recognize the perspectives of others. This is seen as positive by many, whereas the resulting feeling of exhaustibility and overstimulation is evaluated as negative by most. These data correspond with previous empirical findings that showed that HS is associated with global symptom load [1819], stress, [4142], and anxiety [43]. Overall, as stated also in the scientific literature cited above, the feeling of being overwhelmed is essential to defining HS [e.g., 517].

Despite these sometimes stressful experiences, almost all of the participants interviewed reported predominantly positive feelings when they first heard about HS. For many, this amounted to an attestation of “being normal”. Many saw themselves as part of a larger community and no longer as outsiders. The feeling of being somehow different from others, which almost all interviewees mentioned, was replaced with positive attributes. Thus, identification with HS can be described as "liberation" from the feeling of being deficient for most participants in this study. Correspondingly, a majority reported greater self-acceptance, especially since many explicitly described HS as a special ability. One participant summarized this connection in a particularly impressive way: “So, if you have always this stamp on your forehead that you are different, then it is very nice to hear that there is a cause for it–that it is not a disorder, but actually a special ability.”

At this point, we would like to make a first attempt to relate this pattern of results to the lack of separability between HS and neuroticism [e.g., 61831]: Neuroticism is commonly evaluated negatively, as seen when individuals are asked to report their personality traits under “faking good” instructions [e.g., 4445]. This is likely reinforced by terms such as "emotional lability". Here, only the negative side of high neuroticism is considered, while positive features related to increased emotionality are not included—neither in the description of this personality trait nor in the items measuring it. In contrast to traits like “neurotic” or “introverted”, the term “highly sensitive” appears to be positively connotated. Furthermore, this term not only describes deficits, but also includes strengths of high neuroticism. In this way, the concept of HS might be a (quite desirable) way to free neuroticism from its purely deficit-based characterization. As shown by our results, HS people described suffering as a result of the pathologization of their emotionality and therefore experienced the label “highly sensitive” as liberating. In principle, a neutral label for a basic personality trait seems necessary. However, the problem with HS could be that the same mistake, namely judgmental labelling, is now made in the reverse direction: HS is posited as a positive trait by the flower metaphor [12946], for example, according to which people are divided into “dandelions” (i.e. low sensitivity), “tulips” (medium sensitivity), and “orchids” (i.e. high sensitivity). Here, it seems useful to find a middle ground in terminology–something between “disturbed neurotics” and “the elected few of the human race”(to put it in rather pointed terms).

Interestingly, a recent study was able to demonstrate links between SPS and both vulnerable and grandiose narcissism [47].

In addition to highlighting people’s need to receive a neutral or positive description of their personality in order to be able to accept themselves, the present study can also advance scientific research. Of course, it remains possible that SPS actually exists as a trait but has so far been insufficiently conceptualized and measured. As mentioned above, it is currently difficult to distinguish HS from neuroticism, introversion and openness. Undoubtedly, one reason for this is the HSPS, which contains a large number of items measuring neuroticism, extraversion, and openness. However, this should not be surprising given how the HSPS items were generated. To extract the basic characteristics of HS people, Aron and Aron [5] asked students from university psychology classes to interview “‘highly sensitive people’—that is, those who are ‘either highly introverted (for example, preferring the company of one or two people) or easily overwhelmed by stimulation (such as noisy places or evocative or shocking entertainment)”. When manifestations of introversion and neuroticism are used as inclusion criteria, it is not surprising that items measuring introversion and neuroticism emerge as a result. It is possible that the “wrong people” were interviewed through this procedure. In contrast, the present study takes a more neutral approach and could serve as a start point for the development of an alternative scale with items that do not measure neuroticism and introversion, but refer primarily to what is specific to HSP.

Nevertheless, the biased sample characteristics can be viewed as limitations of the present study: The vast majority of participants were female and highly educated. These tendencies may not be uncommon in psychological studies, but were especially strong in the current study. However, this is not really surprising due to the recruitment procedure. Furthermore, although N = 38 is considerable for a qualitative sample, this sample size lacks representativeness and therefore must be viewed critically when it comes to generalizability. However, the consistent pattern of our results allows us to assume that the present study’s findings do allow a certain degree of generalization. Of course, such a generalization can only be valid for the German cultural area. Since this is the first study that explores people who define themselves as highly sensitive, information on cultural differences is unfortunately not available. However, the specific ways in which HS manifests “as a blessing and a burden” [47] in different cultures should be an interesting question for future research.


Tuesday, March 21, 2023

Adult men view criminal records as less of a hindrance to partner selection than adult women

She’s Not That into You: Speed Dating with a Criminal Record. Douglas N. Evans & Noreen Ali. Corrections, Mar 13 2023. https://doi.org/10.1080/23774657.2023.2190550

Abstract: Prosocial relationships are beneficial to post-conviction reintegration, but criminal stigma may limit romantic relationship access. This study implements an experimental audit of speed dating, which allows people to meet several potential partners in a brief time, to explore how conviction disclosure, offense type, and attractiveness and personality ratings affect dating interest. Three women and three men confederates of different races/ethnicities were randomly assigned to a control or one of three offense conditions before interacting one-on-one with 64 participants in 4-minute Zoom Q&A speed dating sessions. Following each interaction, participants rated one another on attractiveness, personality dimensions, and interest in dating. Findings indicate that disclosure of property offense conviction significantly reduced women’s willingness to date men confederates while assault and drug convictions did not negatively affect women’s dating interest. Women confederate disclosures of convictions did not affect men’s interest in dating them. Researching the effects of prior convictions on romantic relationship interest is challenging but important in revealing how criminal stigma varies by offense type to affect relationship capital.


Keywords: Speed datingcriminal history disclosureraceattractivenessstigma


Worth the Risk? Greater Acceptance of Instrumental Harm Befalling Men than Women

Worth the Risk? Greater Acceptance of Instrumental Harm Befalling Men than Women. Maja Graso, Tania Reynolds & Karl Aquino. Archives of Sexual Behavior, March 17 2023. https://link.springer.com/article/10.1007/s10508-023-02571-0

Abstract: Scientific and organizational interventions often involve trade-offs whereby they benefit some but entail costs to others (i.e., instrumental harm; IH). We hypothesized that the gender of the persons incurring those costs would influence intervention endorsement, such that people would more readily support interventions inflicting IH onto men than onto women. We also hypothesized that women would exhibit greater asymmetries in their acceptance of IH to men versus women. Three experimental studies (two pre-registered) tested these hypotheses. Studies 1 and 2 granted support for these predictions using a variety of interventions and contexts. Study 3 tested a possible boundary condition of these asymmetries using contexts in which women have traditionally been expected to sacrifice more than men: caring for infants, children, the elderly, and the ill. Even in these traditionally female contexts, participants still more readily accepted IH to men than women. Findings indicate people (especially women) are less willing to accept instrumental harm befalling women (vs. men). We discuss the theoretical and practical implications and limitations of our findings.

General Discussion

The current investigation sought to examine whether people were more willing to endorse interventions when IH was borne by men than women. Our first two studies supported this premise. Importantly, however, our results showed that this asymmetry was driven primarily by women, but not men, being more likely to accept IH to men than to women across a variety of contexts (i.e., supporting Hypothesis 2). Study 3 tested a boundary condition to this gender bias in harm tolerance: stereotypically female caregiving contexts. When instrumental harm benefitted vulnerable individuals (e.g., infants, young children, sick, or the elderly), both men and women exhibited a bias in their willingness to accept IH to men versus women (i.e., supporting Hypothesis 1; not supporting Hypothesis 3). That is, contrary to what might be expected by historical gender roles (Eagly & Wood, 1999), people believed men ought to bear greater costs, even in traditionally female sacrificial domains.

Theoretical and Practical Implications

Our findings offer four contributions. First, we extended the literature on gender and harm endorsement, which has primarily emphasized high-conflict sacrificial dilemmas involving questions of life or death (e.g., FeldmanHall et al., 2016; Skulmowski et al., 2014). The current findings revealed this gender bias persists in highly consequential, yet understudied domains: assessments of beneficial interventions carrying negative externalities across a variety of contexts: medical, psychological, educational, sexual, and caregiving. Second, we demonstrated that when evaluating interventions, female participants were more likely than male participants to accept IH borne by men than women. This pattern lends further support to the well-documented finding that women have a stronger in-group bias than men (e.g., Glick et al., 2004; Rudman & Goodwin, 2004) and are more likely to perceive one another as victims than perpetrators (Reynolds et al., 2020). This disparity suggests women may prioritize one another’s welfare over men’s in the construction or approval of social, educational, medical, and occupational interventions. If so, female policymakers might be especially wary of advancing policies or initiatives risking harm to other women, but less so when they risk harming men.

Third, we tested a boundary condition to this gender bias by investigating contexts previously unstudied in sacrificial dilemmas: stereotypically female caregiving roles. Although consideration of gender stereotypes and role congruence (Eagly & Wood, 1999) might predict a greater tolerance for female sacrifice in such contexts, men and women alike were more tolerant of IH incurred by men (versus women). These patterns suggest that although women traditionally fill and sacrifice in these roles, people may not necessarily endorse that ought to be the case. Rather, our results align with emerging evidence documenting diminished concern for men’s suffering due to a greater tendency to stereotype men as perpetrators rather than victims (Reynolds et al., 2020).

Fourth, our findings identified individual-level factors that contribute to asymmetries in harm tolerance. Namely, Studies 2 and 3 revealed that individuals more strongly endorsing egalitarian, feminist, or liberal ideologies exhibited greater disparities in their acceptance of instrumental harm, such that they more readily tolerated instrumental harm borne by men. These patterns suggest those most concerned about rectify- ing historical injustices might most ardently oppose explora- tory interventions potentially providing long-term benefits to women.

Limitations, Emerging Questions, and Future Directions

Although the current investigation has its strengths (e.g., consistent results across varied contexts, within and between-person designs, diverse beneficiaries, pre-registrations), it is not without limitations. First, future investigations might profit, for example, from examining contexts that explicitly signal one’s willingness to sacrifice on behalf of others (e.g., voluntary military service or blood donation) to determine the generalizability of these patterns. Second, our conclusions are limited by our reliance on American MTurk and CloudResearch users. Thus, our results might not generalize to other contexts and cultures. Indeed, changes in stereotypes over time (Charlesworth & Banaji, 2022), and cultural differences in norms surrounding masculinity and femininity might shift beliefs about the value of IH incurred by men versus women (see Glick et al., 2004 for a cross-cultural comparison of attitudes toward men and women). Examining whether the reluctance to expose women to instrumental harm emerges across cultures remains an open avenue for future work. Moreover, our data were collected during the earlier days of COVID-19, which could have influenced the composition or motivations of our samples (Arechar & Rand, 2021). Thus, replication is warranted before strong conclusions can be inferred.

Fourth, although the results of Studies 1 and 2 consistently revealed women’s gender bias in instrumental harm acceptance, their methods could not disentangle whether the bias more strongly emerged from an aversion toward harming women or a desire to benefit women. That is, because both studies pit harm to one sex against the benefit to the other, it is unclear which more strongly contributed to these findings. That Study 3’s female participants (along with male) more readily tolerated men’s (versus women’s) suffering in contexts benefitting vulnerable individuals (rather than women) suggests the possibility Studies 1 and 2’s results reflected women’s greater aversion to harming fellow women, rather than a motivation to benefit them per se. Nonetheless, future research might examine interventions whereby only one sex is benefitted or harmed to adjudicate the relative contribution of these two factors.

Altogether, our findings point to potentially consequential implications for laypeople’s perceptions of exploratory interventions and programs. The asymmetry we documented may place disparate pressures on researchers and policymakers to intervene experimentally on men’s versus women’s afflictions in ways that minimize instrumental harm to women. The biases uncovered here suggest the possibility that women were excluded historically from exploratory research due to an aversion toward inflicting instrumental harm onto women, such as in medicine (Holdcroft, 2007). This ultimately proved costly to women, as men’s overrepresentation in medical research yielded treatments more effective among men than women (Holdcroft, 2007). Thus, although such an aversion may have benefitted women in the short term because women were spared incidental harm imposed by risky experiments, in the long run, experimentation on men unearthed medical and safety advancements better suited for male bodies. Experimental examinations and interventions carry both costs and benefits. If, as our results suggest, people are less willing to accept instrumental harm befalling women, women might lose out on the long-term benefits of such experimental endeavors.

Throughout history, countless male lives have been sacrificed on the battlefield, ostensibly to promote the greater good (Baumeister, 2010). Our findings suggest that these sentiments persist beyond the field of combat. For many people, accepting instrumental harm to men is perceived as worth the cost to advance other social aims. We invite researchers to further investigate how individuals appraise the value of suffering and whether those appraisals differ across target characteristics. A deeper understanding of the biases embedded in such calculations may minimize the unforeseen and unintended consequences of those preferences, thereby reducing harm to men and women alike.

We found a significant increase in pseudo-event coverage, expressing a more positive tone than genuine event coverage; moreover, political pseudo-event coverage shows quadrennial cycles with peaks in each presidential election year

Pseudo-events: Tracking mediatization with machine learning over 40 years. Mengyao Xu, Lingshu Hu, Amanda Hinnant. Computers in Human Behavior, Volume 144, July 2023, 107735. https://doi.org/10.1016/j.chb.2023.107735

Abstract: Using automated content analysis, this research explores the phenomenon of pseudo-events coverage in The New York Times (N = 70,370 articles) from 1980 to 2019. By clarifying the operationalization of pseudo-events, this study introduces pseudo-events as a valuable tool to index how different social subsystems perpetuate mediatization (which is when institutions absorb and abide by media logic). Machine-learning classifiers were constructed to measure pseudo-events, which provides historicity, specificity, and measurability — three tasks set forth for new mediatization research. We found a significant increase in pseudo-event coverage, expressing a more positive tone than genuine event coverage. Moreover, political pseudo-event coverage shows quadrennial cycles with peaks in each presidential election year. Our findings reveal the expansion of mediatization since 1980 and show how media logic has been internalized in different ways by the social subsystems of politics, culture, and economics. Institutions and their social actors need efficient tools to abide by media logic in seeking publicity and commanding authority, and pseudo-events have matured into one of the most dominant tools, especially for political actors. This study offers an innovative approach to capture complex phenomena and shows promises of broader application of machine learning to empirically quantify and identify patterns using theoretical concepts.