Saturday, November 12, 2022

Citizens express higher trust in less politicized scientific fields (mathematics, astronomy, etc.) than politicized ones (economics & climate science), & these trust gaps stem from different ideological biases against the politicized fields

Don't Tell Me What I Don't Want to Hear! Politicization and Ideological Conflict Explain Why Citizens Have Lower Trust in Climate Scientists and Economists Than in Other Natural Scientists. Thor Bech Schrøder. Political Psychology, November 11 2022. https://doi.org/10.1111/pops.12866

Abstract: Studies suggest that citizens have higher trust in some groups of scientists than in others. However, we still know little about the causes of these trust gaps. The current study fills this knowledge gap by examining Norwegian citizens' trust in climate scientists, economists, and so-called “less politicized natural scientists.” I argue that trust in climate scientists and economists is lower than trust in less politicized natural scientists because the former fields are politicized, while the latter are not. Politicization strengthens ideological conflicts between citizens' ideology and research produced by climate scientists and economists, which leads to lower trust in these groups of scientists. I test this argument by running regression analyses on data from a representative survey of the Norwegian population. The results support the argument: Citizens have significantly higher trust in less politicized natural scientists than in both climate scientists and economists, and these differences can be explained by ideological biases in trust. Citizens with a proeconomic growth ideology have significantly lower trust in climate scientists than in less politicized natural scientists, and citizens with a left-wing economic ideology have significantly lower trust in economists than in less politicized natural scientists.

Discussion

The study provides evidence for the argument that citizens express lower trust in scientific areas which are politicized and ideologically dissonant. However, the study also has certain limitations that should be discussed. First, while the study only investigates ideological explanations of trust gaps in one country, Norway has several characteristics that makes it a good empirical case for investigating ideological biases in trust in scientists. Because science is only partly politicized in Norway (some scientific fields are politicized and others are not), the country provides a good case for investigating how politicization affects trust in scientists. By showing that citizens do in fact express higher trust in less politicized scientific fields (mathematics, astronomy etc.) than politicized ones (economics and climate science), and that these trust gaps stem from different ideological biases against the politicized fields, the study provides strong support for the politicization hypothesis. Further, given that we generally observe lower trust in scientists across the board in the United States, where most scientific fields are politicized (Blank & Shaw, 2015), the findings are likely generalizable beyond the Norwegian case. However, future research should test whether these findings can be replicated in other Western countries. The study also underlines the importance of science politicization beyond the Norwegian setting because Norway provides a strong test of the politicization hypothesis due to its' low levels of political polarization (Knudsen, 2021; Lindqvist & Östling, 2010). Citizens with more extreme attitudes are more likely to engage in motivated reasoning, and motivated reasoning has been shown to increase attitude polarization (Taber & Lodge, 2006). Therefore, we should be less likely to observe ideological biases in trust in scientists in countries with low political polarization, like Norway. Since we still find ideological biases in trust in different scientists in Norway under these conditions, we should also expect to find ideological biases in citizens' trust in scientists in countries with higher political polarization. Second, my measure of environmentalist ideology has some weaknesses because it forces citizens to make a trade-off between environmental protection and economic growth and a high standard of living. However, this trade-off is a theoretical construction that not all citizens necessarily agree with, as many mainstream European parties argue that technology and “green” economic growth is the solution to the climate crisis (Nisbet, 2009). Thus, citizens with a moderate environmentalist ideology might not feel that either ends of the scale accurately reflect their attitudes. This could force them to give an answer that does not accurately reflect their attitudes or to answer “do not know.” Third, the environmentalist (growth-protection) and economic ideology dimensions used in the study are conceptually related. Having a right-wing economic ideology (supporting the free market) is associated with prioritizing economic growth with other environmental protection (Crawley, 2021), and this association is theoretically attributed to ideological conflicts between economic growth-oriented right-wing economic ideology and environmentalist policies of market regulation (Crawley, 2021). Therefore, the association between right-wing economic ideology and progrowth environmentalist ideology and trust in climate scientists is likely related to the same mechanism (conflict between climate science and free market/economic growth ideologies). This study cannot directly assess whether this is case, but future research should try to solve this conundrum. Fourth, it might affect the results that the data were collected during the COVID-19 pandemic. Political crises are known to lead to high levels of political and government trust through so-called “rally ‘round the flag” effects (Mueller, 1970; Oneal & Bryan, 1995). Citizens feel that the crisis calls for national unity, and therefore they express trust in the government even if they would not do so in a no-crisis situation. Because the COVID-19 pandemic was a global health crisis, where scientists played an important role in the crisis management (Christensen & Lægreid, 2020a), citizens might have reacted with similar rally effects and expressed higher-than-normal levels of trust in scientists out of a sense of public duty. Since public health scientists were the most visible experts in the COVID-19 crisis management in Norway (Christensen & Lægreid, 2020a), they are more likely to have experienced a rally ‘round-the-flag trust boost than climate scientists or economists. This could lead to an overestimation of the size of the trust gaps between economists and climate scientists and less politicized natural scientists if citizens expressed abnormally high levels of trust in health scientists but maintained their prepandemic trust in economists and climate scientists. Fifth, because the study relies on cross-sectional survey data, I cannot directly test the motivated-reasoning mechanisms that I argue to be the causal mechanisms through which politicization and ideology leads to trust gaps between more politicized and less politicized groups of scientists. The study can only show that environmentalist and economic ideology statistically explain variations in the trust gaps between less politicized natural science and economists and climate scientists respectively, which still is an important contribution to the literature. Future studies should use experimental methods to better test the argument that trust gaps between different groups of scientists stem from motivated processing of scientific information.

Moderate alcohol consumption over the adult life-course was associated with the most favorable social, psychological, lifestyle and health characteristics, while stable abstaining trajectory or decreasing consumption) didn't.

Social, psychological and health characteristics associated with stability and change in adult alcohol consumption. Martin Ekholm Michelsen,Marie Grønkjær,Erik Lykke Mortensen,Cathrine Lawaetz Wimmelmann. PLoS One, November 11, 2022. https://doi.org/10.1371/journal.pone.0277511

Abstract

Background: Many studies analyzing health effects of alcohol consumption have operationalized alcohol intake from a single baseline measure without further follow-up. Consequently, there is a lack of knowledge about stability and change in alcohol consumption over the life course and the social, psychological, lifestyle, and health characteristics associated with different alcohol consumption trajectories.

Objectives: The aims of the study were to describe the prevalence of different adult-life alcohol consumption trajectories among Danish men and to analyze social, psychological, lifestyle and health characteristics associated with these trajectories.

Methods: For 2510 Danish men, retrospective decade-based information on alcohol consumption during life period 26–60 years was obtained in late midlife and information on individual characteristics was obtained in young adulthood, late midlife and from national hospital registries. The men were allocated to one of six a priori defined alcohol consumption trajectories.

Results: About 65% of Danish men had a stable moderate consumption, drinking 1–21 units weekly while the five other consumption trajectories were comparatively rare: 3% stable abstainers, 4.7% stable high-risk drinkers, 10.9% with increasing and 12.7% with decreasing consumption. Moderate consumption over the adult life-course was associated with the most favorable social, psychological, lifestyle and health characteristics while the other trajectories were generally associated with less favorable characteristics to varying degrees–e. g. this was the case for the stable abstaining trajectory and in particular the trajectory with decreasing consumption.

Conclusion: The findings suggest that the majority of Danish men drink moderately in the life period from young adulthood to late midlife, and deviance from this ‘normal’ moderate consumption trajectory is associated with less favorable social, psychological, lifestyle and health characteristics. Some of these characteristics may influence alcohol consumption patterns, but for some of the trajectories, alcohol consumption may influence health as well as social and psychological functioning.


Discussion

Most of the study sample belonged to the moderate consumption trajectory while the five other consumption trajectories were comparatively rare. The abstaining trajectory included the smallest number of men (n = 76), but this trajectory is important in relation to the large literature on abstaining, alcohol, and health. The analyses of sociodemographic factors, psychological characteristics, lifestyle, and health revealed several significant differences among men in the different consumption trajectories.

Moderate consumption trajectory

About two out of three men showed a stable trajectory of moderate drinking within 1–21 units per week. This “normal” consumption trajectory was associated with favorable characteristics on most of the investigated variables. Thus, stable moderate alcohol consumption was associated with better social adjustment, desirable mental characteristics, healthy lifestyle, and good health. Even though the range of weekly units was quite broad for the moderate trajectory, the large number of men with this trajectory indicates considerable stability in alcohol habits for the majority of the study sample, and this is in line with similar British findings [4]. The average consumption levels in Table 1 also suggest substantial stability.

Abstaining trajectory

The relatively few stable abstainers during adulthood deviate negatively on several social, psychological, lifestyle and mental health characteristics. Among the 76 abstainers, 31 individuals reported weekly moderate consumption before age 26. However, none of them reported high-risk drinking, and consequently, alcohol consumption in adolescence is unlikely to explain the associations with health. These findings are important since it has been suggested that negative deviation for abstainers primarily reflects the inclusion of former drinkers in the abstainer category [62425]. In line with our findings, a previous study of young adult Danes observed low education, low intelligence, and personality deviance in abstainers [15] while a broad range of cardiovascular risk factors have shown to be more prevalent among nondrinkers than among drinkers after adjustment for age and sex [26].

High-risk consumption trajectory

This trajectory was observed in 119 men, indicating stable consumption of more than 21 units per week over the adult life span. An increasing and very high average level of consumption in late midlife was observed. However, only 72 (60.5%) men had higher consumption in late midlife (about 58 units) than in young adulthood (about 36 units per week) while the remaining 47 men showed a decrease in average consumption of about 4 units from young adulthood to late midlife. For the high-risk trajectory, it may be particularly informative to compare draft board data and late midlife follow-up data because the high consumption level may have influenced functioning and health over the life course. Compared with the moderate drinkers, the high-risk drinkers did not differ significantly with respect to young adult intelligence (in contrast to a previous Danish study [15]) and BMI, but in late midlife they showed the largest decrease in intelligence score and the largest increase in BMI of all consumption trajectories. The decrease in intelligence did not differ significantly from moderate drinkers, but a previous study using a different methodology observed significantly more decline in intelligence for adult-life consumption of more than 28 units per week [17]. The high-risk consumption trajectory was associated with the highest prevalence of several health problems, which may not only reflect the consistently high level of alcohol consumption, but also the high levels of extreme binge drinking and smoking (in line with [27]) as well as a generally unhealthy lifestyle [28]. The social functioning of high-risk drinking men has perhaps also been influenced by the high level of alcohol consumption as a significantly smaller percentage had a job, a partner, and children. A previous study of young adult Danes observed that high-risk drinking was associated with higher scores on neuroticism and a number of other personality traits [15], and a recent Danish study found that neuroticism prospectively predicts alcohol and substance abuse [29]. In line with these results, we observed significantly higher prevalence of adjustment and personality disorders among men with high-risk consumption trajectory.

Increasing consumption trajectory

The increasing consumption trajectory was characterized by moderate drinking in young adulthood, but on average increasing about five weekly units for each age decade and reaching a high-risk average in the life-period 51–60 years. Participants with this trajectory were comparable on many characteristics to participants with a moderate consumption trajectory. Increased statistical power due to the relatively large group of men with this trajectory may partly explain some of the statistically significant differences. This seems to be the case for the significantly lower intelligence score and higher BMI at the late midlife follow-up, while the higher levels of smoking, extreme binge drinking, treatment for alcohol problems and alcohol-related diagnoses may reflect the drinking trajectory. The increasing trajectory also differed significantly from the moderate trajectory regarding self-reported health, self-reported depression and personality disorders. However, on several health variables, the men in the increasing consumption trajectory showed the second most favorable health status, only surpassed by participants with the moderate consumption trajectory, suggesting that the ability to increase alcohol consumption over the life span may require relatively good physical health.

Decreasing consumption trajectory

The decreasing consumption trajectory was associated with low intelligence and educational level both in young adulthood and in late midlife. Moreover, suboptimal social functioning, personality deviance and a less healthy lifestyle were observed in late midlife. Generally, the decreasing consumption trajectory was associated with remarkably poor physical and mental health characteristics, and this was confirmed by higher prevalence of all categories of mental disorder diagnoses. Particularly important was the high prevalence of alcohol-related diagnoses comparable to the prevalence for the high-risk trajectory as this suggests that the decreasing consumption trajectory includes a substantial number of men with previous high-risk consumption. This was also supported by the high mean consumption levels in the age period 26–40 years and the remarkably low consumption level in the age period 51–60. In fact, among the men with decreasing consumption trajectory, 70% reported abstaining from alcohol in the latter age period. Thus, a study defining abstainers by self-report in late midlife will tend to mix the relatively few life-long abstainers with a larger number of former high-risk drinkers, confounding social, psychological, lifestyle and health characteristics associated with abstaining [5625]. A related issue is why previous high-risk drinkers are motivated and able to limit or abstain from alcohol intake in late midlife. Compared with the high-risk trajectory, the men with the decreasing trajectory seemed to have more health problems, and the substantial health problems may be a crucial factor both directly and indirectly influencing the experience and health consequences of alcohol consumption [930]. Finally, low intelligence and educational level in young adulthood have been observed to predict both physical and mental health problems [153132], and thus, the health problems associated with the decreasing consumption trajectory are not necessarily a consequence of excessive alcohol consumption in early life periods.

Fluctuating consumption trajectory

This trajectory (comprising the second smallest part of the sample) is characterized by fluctuations in level of consumption over the adult life span even though men with this trajectory had the same level of consumption in young adulthood (26–30 years) and in late midlife (51–60 years). Among the 86 men, only six were abstainers in the two defining age periods, while eight men were high-risk drinkers in both periods, and closer analysis of consumption trajectories showed that most of the men had been high-risk drinkers at some point during the age period 31–50 years. Thus, the fluctuating trajectory includes a very heterogeneous group of men, and it may be difficult to interpret the observed associations. However, the trajectory was associated with the second highest smoking level, with several years of extreme binge drinking, and high levels of alcohol-related diagnoses and self-reported treatment for alcohol problems. The trajectory was also associated with significantly poorer physical and mental health. Most of the men had a decrease in consumption from age period 31–50 years to age period 51–60 years, and some of the findings are comparable to the results for the decreasing consumption trajectory.

Strengths and limitations

There are several strengths in the present study, including the large study sample of men with detailed information on alcohol consumption trajectories and many different social, psychological, lifestyle and health characteristics, which both include self-reported information and information from national registries. The adult-life alcohol consumption information made it possible to describe different categories of abstainers and to analyze six groups with different consumption trajectories.

A major limitation is the fact that alcohol consumption during adult life was self-reported and therefore potentially inaccurate due to recall errors and bias. However, the associations between alcohol consumption trajectories and alcohol-related hospital diagnosis corroborate the validity of the self-reported assessments. Moreover, the self-reported information was reliable enough to identify distinct social, psychological, lifestyle and health characteristics for six a priori defined consumption trajectories. Furthermore, the self-report method is supported by a previous British study using the retest method to evaluate the reliability of decade-based life-course self-report questions on alcohol consumption [33]. The study demonstrated not only reasonable reliability, but also correlations with prospective data on alcohol consumption.

Another limitation is the selection of the study sample. First, the LiKO-15 study only invited individuals living in a specific area in Copenhagen who had completed the BPP at draft board examinations, thus excluding men with disqualifying diseases and all women. Second, only 13% of the invited men participated in the study and these participants comprised a selected group with relatively high education and high intelligence in young adulthood [17]. Third, the invited population included a relatively large proportion of men with previous psychiatric admission and diagnosis, including alcohol-related hospital diagnoses [34]. This has obviously influenced the prevalence of mental disorders in the full sample and the prevalence of the six consumption trajectories as well as the social, psychological and health characteristics associated with the trajectories. Thus, it is likely that a more representative Danish sample would have a larger prevalence of both the abstaining and the moderate consumption trajectories as well as a lower prevalence of high-risk consumption. However, irrespective of the selected sample, the prevalence of the different consumption categories obviously reflects the Danish alcohol culture and may not be generalizable to other countries and cultures. In fact, some findings are dramatically different even when differences in sample and methodology are considered. Thus, a study of American military veterans observed about 65% to be abstinent or rare alcohol drinkers, while about 30% were moderate drinkers [14].

While our sample included a large number of men with a moderate consumption trajectory, the number of men with the other trajectories was relatively small, resulting in large standard errors and low statistical power. In particular, this was the case for the abstaining, the high-risk and the fluctuating trajectories, but in spite of potential power problems, we observed a number of significant differences between these trajectories and the moderate consumption trajectory.

Finally, we did not analyze the whole adult life span, but only the age period 26–60 years. Adolescence and young adulthood were excluded on purpose while the age of the participants prohibited analysis of consumption in old adulthood. However, average consumption seems to decline after age 60 [8], and changes in consumption in this life period may reflect both health status and social factors [9].

Parasocial breakup: Grief, distress, & loneliness are experienced by viewers following the departure of a favorite character or the cancellation or completion of a television series

The Contrasting Roles of Enjoyment and Appreciation in Predicting Viewer Distress Following the Dissolution of a Series. K. Maja Krakowiak. Journal of Media PsychologyAdvance Articles, November 04, 2022. https://doi.org/10.1027/1864-1105/a000361

Abstract: In recent years, entertainment scholars have investigated the extent to which negative emotions such as grief, distress, and loneliness are experienced by viewers following the departure of a favorite character or the cancellation or completion of a television series – also referred to as parasocial breakup (PSB). An online questionnaire (N = 814) was administered to understand predictors of PSB following the end of a series as they relate to character involvement and holistic enjoyment and appreciation experiences. Findings revealed that a parasocial relationship (PSR) with a favorite character and liking of and identification with the character positively enhanced enjoyment of the series. PSR and identification also facilitated appreciation of the series. Moreover, while PSR and appreciation increased PSB, character liking and enjoyment of the series decreased it. Theoretical implications for the contrasting roles of hedonic and eudaimonic media experiences in predicting PSB are discussed.


Friday, November 11, 2022

Our findings suggest that doing nothing can be just as costly—if not more costly—than exerting effort

Wu, R., Ferguson, A. M., & Inzlicht, M. (2022). Do humans prefer cognitive effort over doing nothing? Journal of Experimental Psychology: General. Nov 2022. https://doi.org/10.1037/xge0001320

Abstract: Humans and other animals find mental (and physical) effort aversive and have the fundamental drive to avoid it. However, doing nothing is also aversive. Here, we ask whether people choose to avoid effort when the alternative is to do nothing at all. Across 12 studies, participants completed variants of the demand selection task, in which they repeatedly selected between a cognitively effortful task (e.g., simple addition, Stroop task, and symbol-counting task) and a task that required no effort (e.g., doing nothing, watching the computer complete the Stroop, and symbol-viewing). We then tabulated people’s choices. Across our studies and an internal meta-analysis, we found little evidence that people choose to avoid effort (and hints that people sometimes prefer effort) when the alternative was doing nothing. Our findings suggest that doing nothing can be just as costly—if not more costly—than exerting effort.


Thursday, November 10, 2022

Selection in utero (spontaneuous abortion) against male twins in the United States early in the COVID-19 pandemic confirms reproductive suppression (natural selection aborts fetuses unlikely to thrive)

Selection in utero against male twins in the United States early in the COVID-19 pandemic. Tim A. Bruckner, Brenda Bustos, Claire Margerison, Alison Gemmill, Joan Casey, Ralph Catalano. American Journal of Human Biology, November 5 2022. https://doi.org/10.1002/ajhb.23830

Abstract
Objectives: We aim to contribute to the literature reporting tests of selection in utero. The theory of reproductive suppression predicts that natural selection would conserve mechanisms, referred to collectively as selection in utero, that spontaneously abort fetuses unlikely to thrive as infants in the prevailing environment. Tests of this prediction include reports that women give birth to fewer than expected male twins, historically among the frailest of infants, during stressful times. The onset of the COVID-19 pandemic in the United States in Spring 2020 demonstrably stressed the population. We test the hypothesis that conception cohorts in gestation at the onset of the pandemic in the United States yielded fewer than expected live male twin births.

Methods: We retrieved deidentified data on the universe of live births in the United States from the National Center for Health Statistics birth certificate records. We applied Box-Jenkins time-series methods to the twin secondary sex ratio computed for 77 monthly conception cohorts spanning August 2013 to December 2019 to detect outlying cohorts in gestation at the onset of the pandemic.

Results: The twin secondary sex ratio fell below expected values in three conception cohorts (i.e., July, September, and October 2019, all p < .05) exposed in utero to the onset of the pandemic.

Conclusions: Our results add to prior findings consistent with selection in utero. The role of selection in utero in shaping the characteristics of live births cohorts, especially during the COVID-19 pandemic, warrants further scrutiny.

4 DISCUSSION

Whereas much literature finds changes in live birth outcomes following the initial phase of COVID-19, less work examines the possibility that the stressful nature of the early pandemic induced selection in utero. We used conception cohorts and focused on male twin gestations, a subgroup which much theory and empirical work identifies as a sensitive gauge of selection in utero. Results using the universe of births in the United States indicate fewer than expected male twin births among three monthly conception cohorts in utero at the time of COVID-19 “shelter-in-place” policies in March 2020. We hope these findings will encourage further research into the extent to which selection in utero accounts for unexpected patterns of perinatal outcomes in months immediately after the onset of the COVID-19 pandemic.

The magnitude of our results (in terms of percentage reduction of male twins during COVID-19 shelter-in-place policies) appears smaller than that reported for Norway, where odds of male twin births decreased by 27% (Catalano et al., 2021). We, however, caution against direct comparisons for two reasons. First, our US study examined conception, rather than birth, cohorts. Second, we focused on cohorts conceived before January 2020 because earlier research reports these exhibited characteristics (e.g., fewer than expected preterm births) consistent with selection in utero. By contrast, the largest declines in male twinning in Norway occurred among birth cohorts likely conceived after December 2019. We expect that the large observed reductions in fertility, as well as reduced availability of assisted reproductive technology in March 2020 (Vermeulen et al., 2020), could similarly reduce male twinning in the United States among cohorts conceived in 2020.

Strengths of the study involve the conception cohort approach, use of the universe of twin births in the United States, and application of rigorous time-series methods to rule out confounding by factors that affect male and female twin births equally. Well-developed theory of selection in utero, combined with a narrow time window specified a priori in which the onset of the pandemic likely affected male twin gestations, further enhances internal validity. In addition, although twin births appear more frequent among pregnancies conceived using assisted reproductive technologies (ART) (Maalouf et al., 2014; Supramaniam et al., 2019), our findings cannot arise from COVID-induced disruptions in ART service provision that occurred in March and April of 2020, since the affected cohorts we identified were conceived in July, September, and October of 2019. Likewise, changes in fertility behaviors caused by the pandemic cannot drive our findings because the affected cohorts were conceived prior to the pandemic.

Limitations include the lack of direct measures of selection in utero such as fetal loss and/or spontaneous pregnancy losses before the second trimester. Our study relies solely on the live birth data and the gestational age of delivery of live births. Very few countries (e.g., Denmark), however, routinely collect data describing early pregnancy loss at the population level and these data were not available to us. We also do not have information on the zygosity of twins, which precludes examination of whether selection against male twins occurred more for monozygotic than dizygotic twin gestations. We are currently pursuing the feasibility of both analyses using Scandinavian data.

The U.S. data include only month and year of birth thereby requiring us to randomly assign parturition to days of the birth month. Although this strategy may lead to error in estimation of conception date, we know of no reason to infer that such errors biased counts of twin male births toward pre-pandemic conception cohorts. Finally, our analyses assess the United States in aggregate. Different regions of the United States may have responded differently to COVID-19 pandemic exposure or experienced stressors at different time points (e.g., New York City faced the brunt of the pandemic early on) (Van Dorn, Dorn et al., 2020). Future studies may wish to assess regional impacts of the COVID-19 pandemic on male twin births.

Our results add to the empirical research consistent with the argument that reproductive suppression in humans includes selection in utero. They also suggest that selection in utero may account for some as yet unknown, but worth estimating, fraction of the unusual characteristics (e.g., low frequency of preterm births) of infants born during the early months of the COVID-19 pandemic. Further attempts to quantify selection in utero during the pandemic would benefit from arraying putatively frail sets of gestations by conception, rather than by birth, cohorts.

The Emotional Rewards of Prosocial Spending Are Robust and Replicable in Large Samples

The Emotional Rewards of Prosocial Spending Are Robust and Replicable in Large Samples. Lara B. Aknin et al. Current Directions in Psychological Science, November 9, 2022. https://doi.org/10.1177/09637214221121100

Abstract: Past studies show that spending money on other people—prosocial spending—increases a person’s happiness. However, foundational research on this topic was conducted prior to psychology’s credibility revolution (or “replication crisis”), so it is essential to ask whether the evidence supporting this claim is robust and replicable. Here, we consider all 15 published preregistered experiments on prosocial spending to evaluate whether there is causal evidence for the idea that spending money on other people promotes happiness. Although the evidence appears somewhat mixed, we argue that the emotional benefits of prosocial spending are robust and replicable in large samples. These benefits are particularly likely when people have some choice about whether or how to give and when they understand how their generosity makes a difference. This review provides renewed support for the idea that prosocial spending promotes happiness and offers a template for revisiting phenomena that were established prior to the credibility revolution.

---
In 2021, during the second year of the pandemic, responses to a global survey revealed that more than one third (36.7%) of people had donated to charity in the past month, a figure nearly 6% higher than global estimates from before the pandemic (Helliwell et al., 2022). What accounts for such high levels of generosity, even during a global crisis? One possibility is that using financial resources to help other people—called prosocial spending—feels good. Indeed, in the first experiment on this topic, people randomly assigned to spend $5 or $20 on others were significantly happier than people randomly assigned to spend on themselves (Dunn et al., 2008). Since then, the emotional benefits of generosity have been observed in diverse areas of the world, from South Africa to Vanuatu (Aknin et al., 20132015), as well as among toddlers (Aknin et al., 2012; see Dunn et al., 2014, for a review). These findings have been cited widely in the literature, discussed in popular culture (e.g., Grant, 2014), and used to inform government policy (e.g., the 2013 report on charitable giving by the United Kingdom Cabinet Office Behavioural Insight Team). But how robust and replicable is the evidence that spending money on other people improves happiness?
Here, we review all relevant experiments implementing the current best practice of preregistration to answer the question of whether spending money on other people causally increases well-being. We conclude that the causal impact of prosocial spending on well-being is robust and replicable in large samples (n ≥ 200 per condition). We also extract key lessons on when it appears that these emotional benefits are most likely to be detected: when paradigms involve actual behavior, when participants have a choice in deciding who or how to help, and when participants receive information on how their actions help other people. These lessons underscore the hedonic benefits of actual generous behaviors that provide a sense of choice and the feeling that one has created positive change.

Wednesday, November 9, 2022

Estimates suggest high-speed wireless internet significantly increased teen girls' mental health diagnoses--by 90%--relative to teen boys over the period when visual social media became dominant among teenagers

Social Media and Teenage Mental Health: Quasi-Experimental Evidence. Nov 2022. Elaine Guo. Working Paper. https://www.dropbox.com/s/5v9c6ajvmu2of8d/Guo_social_media.pdf

Abstract: Teenage mental health has been a source of growing concern over the past decade, with recent whistleblower testimony pointing to the mental health risks of spending time on social media platforms, especially for girls. This paper investigates the extent to which social media are harmful for teenagers, leveraging rich administrative data from the Canadian province of British Columbia and quasi-experimental variation related to the rollout of wireless internet there. I show neighbourhoods covered by high-speed wireless internet have significantly higher social media use, based on Google search volumes. In the main analysis, I link federally-collected broadband data with 20 years of student records that provide detailed information about individual student health. I then estimate a triple-difference model using this novel data linkage, comparing teen girls to teen boys in terms of school-reported mental health diagnoses, before and after major social media launches, and across neighbourhoods with and without access to high-speed wireless internet. Estimates suggest high-speed wireless internet significantly increased teen girls' mental health diagnoses -- by 90\% -- relative to teen boys over the period when visual social media became dominant among teenagers. I find similar effects across all subgroups, indicating they are not driven by differences in confounding characteristics.  When applying the same strategy, I find null effects for placebo health conditions -- ones for which there is no clear channel for social media to operate. The evidence points to adverse effects of visual social media, in light of large gender gaps in visual social media use and documented risks. In turn, the analysis calls attention to policy interventions that could mitigate the harm to young people due to their online activities.


Despite Google's filtering/ranking, users still appear to be able to tailor their information exposure to maintain their prior beliefs, hence defying that algorithmic impact

Adapting the Selective Exposure Perspective to Algorithmically Governed Platforms: The Case of Google Search. Laura Slechten et al. Communication Research, April 30, 2021, Volume 49, Issue 8. https://doi.org/10.1177/00936502211012154

Abstract: Experimental research on selective exposure on online platforms is generally limited by a narrow focus on specific parts of the information selection process, rather than integrating the entire sequence of user-platform interactions. The current study, focusing on online search, incorporates the entire process that stretches from formulating an initial query to finally satisfying an information need. As such, it comprehensively covers how both users and platforms exercise agency by enabling and constraining each other in progressively narrowing down the available information. During a tailored online experiment, participants are asked to search for social and political information in a fully tracked, manipulated Google Search environment. Although the results show a structural impact of varying search result rankings, users still appear to be able to tailor their information exposure to maintain their prior beliefs, hence defying that algorithmic impact. This corroborates the need to conceptually and methodologically expand online selective exposure research.


Portfolio outcomes: Higher IQ is not associated with enhanced investment performance

The effects of personality and IQ on portfolio outcomes. Chris Firth et al. Finance Research Letters, November 4 2022, 103464. https://doi.org/10.1016/j.frl.2022.103464

Highlights

• We investigate the effects of personality (Big Five traits) and IQ on individuals’ stock portfolio outcomes (activity, biases and performance)

• We use 1238 customers’ responses to a self-report survey and matched with their portfolio trading data.

• Traits have small but significant effects; higher IQ is not associated with enhanced investment performance.

• Other factors, such as customer age, portfolio size and portfolio risk, much better explain outcomes. Financial literacy shows little effect.

Abstract: We use responses to a self-report survey and matched administrative data to investigate the effects of personality (Big Five traits) and IQ on individuals’ stock trading portfolios. Traits have small but significant effects: openness and extraversion are associated with undesirable outcomes whereas conscientiousness is associated beneficially. Higher IQ is associated with lower trading activity but not enhanced investment performance. We postulate these factors influence outcomes in a complex manner, and exert over long timeframes. With portfolio size held constant, financial literacy has little effect. Other factors, such as customer age, portfolio size and portfolio risk, better explain outcomes.


JEL: D14 D91 G11

Keywords: individual decision-makingpersonality traitsBig Fiveinvestment biasesfinancial literacyIQ


3.1. Big Five personality traits

Internal consistency, measured by Cronbach's alpha, ranges from 0.31 to 0.72 for the five TIPI-derived traits and is consistent with (Gosling et al. 2003) (who also show strong correlations from a much longer inventory).

Coefficient sizes from the regressions for trait variables are consistently small7. Typical is the coefficient of 0.05 on conscientiousness (Big5_C) for the regression on Sharpe (investment performance). We interpret this as saying that on average a one standard deviation increase in conscientiousness improves performance by 5% of a standard deviation of the variable Sharpe. Despite the weak effects, the presence or not of statistical significance and the signs of coefficients offer useful insights. In summary: openness (Big5_O) has a negative association with login rate and positive associations with overconfidence and idiosyncratic risk; extraversion (Big5_E) has positive associations with overconfidence and negative associations with idiosyncratic risk and investment performance. We could explain this by suggesting that extraverts – being easily bored – take excessive, unrewarded risk. We might also speculate they would trade more and be more vulnerable to the disposition effect but that does not feature in our results.

Conscientiousness (Big5_C) has positive associations with overconfidence and investment performance. Roberts et al. (2011) report that conscientious people earn more money, which is comparable to our finding of a positive link between conscientious individuals and better returns. Kleine et al. (2016) report that openness tends to have a negative impact on portfolios; we too see that openness is associated with less desirable portfolio outcomes.

3.2. IQ

The IQ scores in our sample are as expected for a general population (see online appendix); we may reasonably state that the brokerage customers are not exceptionally intelligent. The regressions indicate that IQ very weakly predicts lower portfolio activity8. We observe no significant associations between IQ and biases of disposition effect and idiosyncratic risk, though there is a lessening effect for overconfidence. We report no statistically significant effect of IQ on the economically relevant measure (Sharpe). Although these coefficients are directionally consistent with results in (Grinblatt et al. 2012), their lack of statistical significance is somewhat surprising. However, there could be plausible reasons this. Firstly, we have a much smaller sample size than do Grinblatt et al. (2012), and, secondly, the IQ measure we use – from Raven and Court (1998) – is arguably independent of formal education (Almlund et al. 2011) which avoids potentially confounding IQ with numeracy9. Numeracy is known to be an important predictor of success in everyday risk and financial decisions (Smith et al. 2010Cokely et al. 2012Peters 2012).

3.3. Financial literacy

Financial literacy (FinLit) shows no statistically significant effects across six of the regressions, the main exception being the (undesirable) positive coefficient on overconfidence (OverCon). There is also weak evidence of a reduction in the disposition effect (Disp). Fernandes et al. (2014) find that effects of financial literacy diminish when variables for psychological traits are added. Instead, we observe an analogous role for portfolio size (Size). In regressions with Size excluded, financial literacy shows significant effects (see online appendix). These coefficients moderate sharply with Size included (expressed alternatively: with portfolio size held constant, financial literacy does not affect outcomes). For example, excluding Size, a one standard deviation increase in financial literacy lowers idiosyncratic risk by 8% of a standard deviation of the variable Idio. With Size, the effect of literacy is not distinguishable from zero and the coefficient for Size is 17% (see Table 4). We record a correlation of 0.21 (significant at 1%) between IQ and financial literacy (see online appendix). Muñoz-Murillo et al. (2020) likewise find that individuals with higher cognitive abilities are more financially literate.

Taken together, these results hint at a nuanced causal relationship in which those with larger portfolios are more financially literate—either because those more literate are confident enough to hold larger portfolios or because those who plan to hold larger portfolios deliberately acquire a higher level of literacy—but that variance in financial literacy over and above that due to portfolio size has no further effect.

We repeat our earlier caution that the high but uniform financial literacy of our sample customers – in itself noteworthy – could hinder inferences. Moreover, financial literacy could be endogenous, leading to biased parameter estimates. We attempt to overcome this by conducting age-instrumented estimates of the effects of financial literacy (see online appendix) based on the premise that customer age is exogenous, together with evidence from Agarwal et al. (2009) that financial mistakes decrease with age (until the onset of mental impairment). While our IV results are consistent with the overall pattern in our main analysis, we conclude that the methodological hurdles surrounding our financial literacy variable leave the question of its effects unsettled.

Government, trusts, and the making of better roads in early nineteenth century England and Wales

Government, trusts, and the making of better roads in early nineteenth century England and Wales. Alan Rosevear, Dan Bogart & Leigh Shaw-Taylor. Univ. of California at Irvine, Aug 31 2022. https://www.economics.uci.edu/files/docs/workingpapers/2021-2022/turnpike_road_quality%20after_31_Aug_2022.pdf

Abstract: This paper introduces new data to explain how public, private, and non-profit sectors sponsored, financed, and technically developed better roads in early 19th century England and Wales. Our findings show that central Government direct financing was limited to sections of the politically important London-Holyhead Mail Road. By comparison non-profit organizations, known as turnpike trusts, built more new roads by attracting private investors and capable surveyors. We also show the Government Mail Road had the highest quality. Nevertheless, most turnpike trust roads were good quality, indicating their practical achievements. The  broader implications for infrastructure and state capacity are also explained. 

Keywords: State capacity, infrastructure, roads, turnpike trusts, non-profit sector, public private partnerships

JEL: N7, N4, H1, R4


Adolescent females: Rape victimization was predicted by greater curvaceousness and weight but was unrelated to interviewer-rated attractiveness

Association between Dimensions of Physical Attractiveness and Risk of Rape Victimization among Adolescent Females. Anthony W. Hoskin &Daniel Moody. Journal of Aggression, Maltreatment & Trauma, Nov 6 2022. https://doi.org/10.1080/10926771.2022.2142180

Abstract: This study addresses the following question: “Does female physical attractiveness predict the risk of rape victimization?” Data were analyzed from a large sample of females who participated in the National Longitudinal Study of Adolescent to Adult Health (Add Health). Three conceptualizations of physical attractiveness were operationalized: overall attractiveness, curvaceousness, and weight. Bivariate analysis revealed that rape victimization was predicted by greater curvaceousness and weight but was unrelated to interviewer-rated attractiveness. Respondent age, Hispanic ethnicity, race, general health, physical maturation, and number of sexual partners were added to multivariate models in a stepwise fashion, but physical attractiveness and rape victimization were found to be unrelated in all models, and the positive association between weight and victimization fell to statistical non-significance when general health was added to the model. By contrast, curvaceousness consistently predicted greater victimization risk across all models. Among control variables, females who are older, less healthy, and who have more sexual partners face a higher risk of victimization. Add Health data do not present a straightforward picture of the relationship between various dimensions of physical attractiveness and rape victimization. Findings, study limitations, and theoretical implications are discussed.


Keywords: Sexual assaultrapevictimizationphysical attractiveness


Individualization and the decline of homicide: England 1250–1750

Individualization and the decline of homicide: England 1250–1750. Mark Cooney, Jeffery Patterson. Journal of Criminal Justice, November 3 2022, 101997. https://doi.org/10.1016/j.jcrimjus.2022.101997

Abstract: A key issue in criminology is to account for variation in rates of violence across time and place. An important variable largely neglected in the literature is individualism. Building on theoretical ideas proposed by Durkheim, Black, and Baumgartner, we illustrate the role of increased individualism with a case study: the decline of homicide in England, 1250–1750. The qualitative historical materials we present reveal the growth of more individualized conflicts evident in less third-party partisan intervention and a reduced concern with honor. More individualized conflicts were, in turn, a product of a more individualized society, one characterized by increased social distance and mobility. As conflicts individualized they became less lethal, resulting in declining aggregate rates of homicide. Although the case study is historical, our argument has implications for understanding contemporary criminal violence.


Tuesday, November 8, 2022

People in historically rice-farming areas are less happy and socially compare more than people in wheat-farming areas

Lee, C.-S., Talhelm, T., & Dong, X. (2022). People in historically rice-farming areas are less happy and socially compare more than people in wheat-farming areas. Journal of Personality and Social Psychology, Nov 2022. https://doi.org/10.1037/pspa0000324

Abstract: Using two nationally representative surveys, we find that people in China’s historically rice-farming areas are less happy than people in wheat areas. This is a puzzle because the rice area is more interdependent, and relationships are an important predictor of happiness. We explore how the interdependence of historical rice farming may have paradoxically undermined happiness by creating more social comparison than wheat farming. We build a framework in which rice farming leads to social comparison, which makes people unhappy (especially people who are worse off). If people in rice areas socially compare more, then people’s happiness in rice areas should be more closely related to markers of social status like income. In two studies, national survey data show that income, self-reported social status, and occupational status predict people’s happiness twice as strongly in rice areas than wheat areas. In Study 3, we use a unique natural experiment comparing two nearby state farms that effectively randomly assigned people to farm rice or wheat. The rice farmers socially compare more, and farmers who socially compare more are less happy. If interdependence breeds social comparison and erodes happiness, it could help explain the paradox of why the interdependent cultures of East Asia are less happy than similarly wealthy cultures.


Partisan bias in false memories for misinformation about the 2021 Capitol riot: For both true and false events, participants remembered more events that favoured their political party

Partisan bias in false memories for misinformation about the 2021 U.S. Capitol riot. Dustin P. Calvillo, Justin D. Harris & Whitney C. Hawkins. Memory, Sep 28 2022. https://doi.org/10.1080/09658211.2022.2127771

Abstract: Memory for events can be biased. For example, people tend to recall more events that support than oppose their current worldview. The present study examined partisan bias in memory for events related to the January 6, 2021, Capitol riot in the United States. Participants rated their memory for true and false events that were either favourable to their political party or the other major political party in the United States. For both true and false events, participants remembered more events that favoured their political party. Regression analyses showed that the number of false memories that participants reported was positively associated with their tendency to support conspiracy beliefs and with their self-reported engagement with the Capitol riot. These results suggest that Democrats and Republicans remember the Capitol Riot differently and that certain individual difference factors can predict the formation of false memories in this context. Misinformation played an influential role in the Capitol riot and understanding differences in memory for this event is beneficial to avoiding similar tragedies in the future.

Keywords: False memoryfake newsmemory biaspolitical ideology


Most people feel others’ social lives are richer and livelier than theirs

Keeping Up With the Joneses: How Cognitive Availability Biases Everyday Social Comparisons. Sebastian Marc Deri. PhD dissertation, Cornell University 2022. https://ecommons.cornell.edu/bitstream/handle/1813/111944/Deri_cornellgrad_0058F_13102.pdf?sequence=1

Abstract: This dissertation documents the role that cognitive availability plays in distorting the conclusions that people reach about how they measure up to others in domains of everyday concern. The first chapter provides a review of the social comparison literature and an explanation of how my account of social comparison is novel. The second chapter (N=3,293, 11 studies, 3 pre-registered) documents the fact that most people feel others’ social lives are livelier than theirs, and that this is because they can’t help but to bring to mind highly social exemplars when making such comparisons. The third chapter (N= 2,747, 12 studies, 4 pre-registered) documents a robust tendency to compare to above average standards, which cannot solely be explained by motivational factors like social desirability or self-enhancement—adding a wrinkle to the standard above average effect literature by showing that, although people tend to think of themselves as above average in many domains, they also hold and compare themselves to above average standards. The fourth chapter (N=1,703, 3 studies, 1 pre-registered) documents the fact that people feel they are financially worse off than others when thinking about positive instances of wealth (e.g. having a lot in savings) and that this effect can be reversed if people are made to think of positive instances of low economic standing (e.g. having a lot of debt). The fifth and final chapter synthesizes these empirical findings, summarizes my cognitive availability account of social comparison, reviews why it is a novel contribution, and addresses any outstanding concerns.

FDA Deregulation Increases Safety and Innovation and Reduces Prices

Regulating the Innovators: Approval Costs and Innovation in Medical Technologies. Parker Rogers, October 27, 2022. https://parkerrogers.github.io/Papers/RegulatingtheInnovators_Rogers.pdf


Abstract: How does FDA regulation affect innovation and market concentration? I examine this question by exploiting FDA deregulation events that affected certain medical device types but not others. I use text analysis to gather comprehensive data on medical device innovation, device safety, firm entry, prices, and regulatory changes. My analysis of these data yields three core results. First, these deregulation events significantly increase the quantity and quality of new technologies in affected medical device types relative to control groups. These increases are particularly strong among small and inexperienced firms. Second, these events increase firm entry and lower the prices of medical procedures that use affected medical device types. Third, the rates of serious injuries and deaths attributable to defective devices do not increase measurably after these events. Perhaps counterintuitively, deregulating certain device types lowers adverse event rates significantly, consistent with firms increasing their emphasis on product safety as deregulation exposes them to more litigation.

 

---

After moving from Class III (high regulation) to II (moderate), device types exhibited a 200% increase in patenting and FDA submission rates relative to control groups. Patents filed after these events were also of significantly higher quality, as measured by a 200% increase in received citations and market valuations. These effects do not spill over into similar device types.1 For Class II to I deregulations, the rate of patent filings increased by 50%, though insignificantly, and the quality of patent filings exhibited a significant 10-fold improvement, suggesting that litigation better promotes innovation.

[...]

Down-classification yields considerable benefits, as the proponents of deregulation would predict, but what of product safety? Perhaps counterintuitively, I find that deregulation can improve product safety by exposing firms to more litigation. Despite some adverse event rates increasing after Class III to II events (albeit insignificantly), Class II to I events are associated with significantly lower adverse event rates.3 My analysis of patent texts also reveals that inventors focus more on product safety after deregulation. These results suggest that litigation encourages product safety more than regulation [...]

Monday, November 7, 2022

People who are more prone to cry also place a higher focus on morality in their judgements and actions

Only the Good Cry: Investigating the Relationship Between Crying Proneness and Moral Judgments and Behavior. Janis H. Zickfeld et al. Social Psychological Bulletin, Volume 17, Nov 3 2022. https://doi.org/10.32872/spb.6475

Abstract: People cry for various reasons and in numerous situations, some involving highly moral aspects such as altruism or moral beauty. At the same time, criers have been found to be evaluated as more morally upright—they are perceived as more honest, reliable, and sincere than non-criers. The current project provides a first comprehensive investigation to test whether this perception is adequate. Across six studies sampling Dutch, Indian, and British adults (N = 2325), we explored the relationship between self-reported crying proneness and moral judgments and behavior, employing self-report measures and actual behavior assessments. Across all studies, we observed positive correlations of crying proneness with moral judgments (r = .27 [.17, .38]) and prosocial behavioral tendencies and behaviors (r = .20 [.12, .28]). These associations held in three (moral judgment) or two (prosocial tendencies and behaviors) out of five studies when controlling for other important variables. Thus, the current project provides first evidence that crying is related to moral evaluation and behavior, and we discuss its importance for the literature on human emotional crying.


Those with the lowest ability to evaluate scientific evidence overestimate their skill the most

Calibration of scientific reasoning ability. Caitlin Drummond Otten, Baruch Fischhoff. Journal of Behavioral Decision Making, November 4 2022. https://doi.org/10.1002/bdm.2306

Abstract: Scientific reasoning ability, the ability to reason critically about the quality of scientific evidence, can help laypeople use scientific evidence when making judgments and decisions. We ask whether individuals with greater scientific reasoning ability are also better calibrated with respect to their ability, comparing calibration for skill with the more widely studied calibration for knowledge. In three studies, participants (Study 1: N = 1022; Study 2: N = 101; and Study 3: N = 332) took the Scientific Reasoning Scale (SRS; Drummond & Fischhoff, 2017), comprised of 11 true–false problems, and provided confidence ratings for each problem. Overall, participants were overconfident, reporting mean confidence levels that were 22.4–25% higher than their percentages of correct answers; calibration improved with score. Study 2 found similar calibration patterns for the SRS and another skill, the Cognitive Reflection Test (CRT), measuring the ability to avoid intuitive but incorrect answers. SRS and CRT scores were both associated with success at avoiding negative decision outcomes, as measured by the Decision Outcomes Inventory; confidence on the SRS, above and beyond scores, predicted worse outcomes. Study 3 added an alternative measure of calibration, asking participants to estimate the number of items answered correctly. Participants were less overconfident by this measure. SRS scores predicted correct usage of scientific information in a drug facts box task and holding beliefs consistent with the scientific consensus on controversial issues; confidence, above and beyond SRS scores, predicted worse drug facts box performance but stronger science-consistent beliefs. We discuss the implications of our findings for improving science-relevant decision-making.

5 GENERAL DISCUSSION

Across three studies, we find that people with greater ability to evaluate scientific evidence, as measured by scores on the SRS, have greater metacognitive ability to assess that skill. Using a confidence elicitation paradigm common to studies of confidence in knowledge, we found that individuals with low SRS scores greatly overestimate their skill, while those with high skills slightly underestimate their skill, a pattern previously found with confidence in beliefs (e.g., Kruger & Dunning, 1999; Lichtenstein et al., 1982; Lichtenstein & Fischhoff, 1977; Moore & Healy, 2008).

Study 2 replicated these patterns with confidence in SRS skills and with calibration for another skill-based task, the CRT, which assesses the ability to avoid immediately appealing but incorrect “fast lure” answers and then find correct ones (Attali & Bar-Hillel, 2020; Frederick, 2005; Pennycook et al., 2017). As a test of external validity, Study 2 also found that people with better SRS scores had better scores on the DOI, a self-report measure of avoiding negative decision outcomes (Bruine de Bruin et al., 2007); confidence on the SRS, controlling for knowledge, was associated with worse DOI scores.

Study 3 replicated the results of Studies 1 and 2, asking participants how confident they are in each SRS answer, now called “local calibration.” It also assessed “global calibration,” derived from asking participants how many items they thought they answered correctly. Overconfidence was much smaller with the global measure, as found elsewhere (e.g., Ehrlinger et al., 2008; Griffin & Buehler, 1999; Stone et al., 2011). This finding suggests that global calibration may be more appropriate for situations where individuals reflect and act on a set of tasks, rather than act on tasks one by one. However, in an experimental setting, it may also convey a demand characteristic, with an implicit challenge to be less confident (if interpreted as, “How many do you think that you really got right?”), artifactually reducing performance estimates and overconfidence.

Study 3 also included additional measures of construct and external validity. As a test of construct validity, we found that global confidence, controlling for scores, was unrelated to a self-report measure of intellectual humility (Leary et al., 2017), and local confidence, controlling for scores, was unexpectedly positively related to self-reported intellectual humility. These findings may reflect the limitations of self-report measures, including a desirability bias in reporting.

Study 3 further found that SRS scores predicted performance on two science-related tasks: extracting information from a drug facts box (Woloshin & Schwartz, 2011) and holding beliefs consistent with the scientific consensus (as in previous work, Drummond & Fischhoff, 2017). However, confidence, controlling for knowledge, played different roles for these outcomes: It was negatively associated with scores on the drug facts box test, but positively associated with holding beliefs consistent with science on controversial issues. These findings suggest that while those with greater confidence in their scientific reasoning ability may also be more confident in their beliefs on scientific issues, confidence that is out of step with knowledge may hinder decision-making. Neither scores nor confidence was related to self-reported adoption of pandemic health behaviors, perhaps reflecting partisan divisions that reduce the role of individual cognition (e.g., Bruine de Bruin et al., 2020). Future work could examine the role of confidence, above and beyond knowledge, in other science-relevant judgments and decisions, including falling sway to pseudoscientific claims or products.

Individuals' metacognitive understanding of the extent of their knowledge has been related to life events in many domains (Bruine de Bruin et al., 2007; Parker et al., 2018; Peters et al., 2019; Tetlock & Gardner, 2015). Overall, we find that unjustified confidence (Parker & Stone, 2014) in scientific reasoning ability, as reflected in self-reported confidence in the correctness of one's answer adding predictive value to SRS scores (Drummond & Fischhoff, 2017), is associated with reduced avoidance of negative outcomes and worse performance on tasks that require using scientific information, but greater acceptance of the scientific consensus on controversial issues. Unlike Peters et al. (2019), who found that mismatches between skill and confidence were associated with worse outcomes, we found that unjustified confidence (measured both locally and globally) was associated similarly with outcomes at all levels of reasoning ability. These findings may reflect differences between numeracy and scientific reasoning, differences between the studies' measures of confidence and outcomes, or interactions too weak to be detected with the statistical power of the present research. Our findings may also reflect our measures' range restrictions: Here, confidence was elicited as expected performance, thus restricting the extent to which participants with very low or high performance could display underconfidence or overconfidence, respectively. Future work could seek other measures that could further separate the respective contributions of scientific reasoning ability and metacognition about it, such as a subjective scientific reasoning ability scale similar to the Subjective Numeracy Scale (Fagerlin et al., 2007).

Overall, we observed patterns of metacognition for cognitive skills similar to those observed for beliefs, using conventional confidence elicitation methods with known artifacts. Prior research has proposed a variety of methods for measuring overconfidence, with varying strengths and limitations. We discuss several key limitations below; for further discussion of these measurement issues, we refer readers to Lichtenstein and Fischhoff (1977), Erev et al. (1994), Moore and Healy (2008), Fiedler and Unkelbach (2014), Parker and Stone (2014), and Yates (1982).

The dramatically poor performance of the lowest quartile, for both SRS and CRT, is notable. As the groups were identified based on SRS and CRT scores, some of the spread is artifactual (as noted by Lichtenstein & Fischhoff, 1977, and others). One known artifact is the truncated 50–100% response mode, which precludes perfect calibration for participants who answer fewer than 50% of the SRS questions correctly (N = 444 [43% of respondents] in Study 1; N = 34 [34%] in Study 2; and N = 148 [45%] in Study 3). In a post hoc analysis, we treated these respondents as though they had answered 50% of questions correctly. Even with this change, they were still overconfident, by, on average, 29.8% in Study 1 (SD = 10), 31.5% in Study 2 (SD = 11), and 30.5% in Study 3 (SD = 10).

One limitation of these results is that the SRS or CRT tests might have demand effects atypical of real-life tests of cognitive skills, such that participants assume that an experimental task would not be as difficult as these proved to be or want to appear knowledgeable in this setting (Fischhoff & Slovic, 1980). A second possible limitation is that reliance on imperfectly reliable empirical measures somehow affects the patterns of correlations and not just the differences between the groups (Erev et al., 1994; Lichtenstein & Fischhoff, 1977). Attempts to correct for such unreliability have had mixed results (Ehrlinger et al., 2008; Krueger & Mueller, 2002; Kruger & Dunning, 2002). Third, task incentives were entirely intrinsic; conceivably, if clearly explained, calibration-based material rewards might have improved performance. Here, too, prior results have been mixed (Ehrlinger et al., 2008; Mellers et al., 2014). Fourth, our measure of science education, whether participants had a college course, may have been too poor to detect a latent relationship. Fifth, for Study 2, some participants may have seen the CRT items before (Haigh, 2016; Thomson & Oppenheimer, 2016), potentially increasing their scores (Bialek & Pennycook, 2018), with uncertain effects on confidence and calibration. Finally, our version of the CRT, which asked participants to choose between the fast lure and the correct answer, produced higher scores that the usual open-ended response mode and hence might not generalize to other CRT research.

If our results regarding the similarity between calibration for cognitive skill and knowledge prove robust, future work might seek to improve public understanding of science (e.g., Bauer et al., 2007; Miller, 198319982004) by addressing separately the ability to think critically and the need to stop and think critically. If people are as overconfident in their scientific reasoning ability as many participants were here, it may not be enough to correct erroneous beliefs through science communication and education (e.g., Bauer et al., 2007; Miller, 198319982004). People may also need help in reflecting on the limits to their ability to evaluate evidence and their potential vulnerability to manipulation by misleading arguments as well as by misleading evidence. Mental models approaches to science communication offer one potential strategy, by affording an intuitive feeling for how complex processes work (e.g., Bruine de Bruin & Bostrom, 2013; Downs, 2014). The inoculation approach to combating misinformation (Cook et al., 2017; van der Linden et al., 2017) offers another potential strategy, refuting misinformation in advance, so that people have a better feeling for when and how to think about the issues and when and how they can be deceived. Developing effective interventions requires research examining the separate contributions of scientific reasoning ability and metacognition to improving science-relevant judgments and decisions.