200K US congressional speeches + 5K presidential communications related to immigration from 1880 to the present
political speech about immigration is much more positive on average than the past
contextual embeddings of text
nationality mentioned changed the tone of speeches (Mexican, Chinese) still a major factor in how immigrants are spoken of in Congress
anti-Chinese fearmongering in 1880s
Southern and Eastern European immigrants in 1920s
antiimmigration rhetoric of Trump (2017 to 2020)
"Certain types of immigrants can never truly join American society?"
identify relevant speeches
curated and applied a set of lexicons for analyzing relevant frames with semi-automated method
neural contextual embedding models to quantify implicit dehumanizing metaphors
political speeches about immigration today are more positive than the past
being net positive on average since early 1950s
Trump is the first president to express sentiment toward immigration more negative than the average member of his own party
two parties have become increasingly polarized over time
today, Democrats are unprecedentedly positive
generic political polarization observed in Gentzkow et al. by more than a decade
nationality of immigrants continues to matter greatly
there remains a string and growing strain of antiimmigration speech among Replblicans
17M congressional speeches from 1880 to 2020
human annotations and trained ML classifiers to detece immigration related speech with accompanying tone (pro, con, neutral)
applied same models to all presidential communications by American Presidency Project (Bottom)
Fig 1
average sentiment is negative throughout the late 19th and early 20th centuries (Chinese Exclusion Act(1882) to strint immigration quotas (1920s))
the attitude became more positive around the start of WW2
beginning about a dacade after 1965, an overall decline in sentiment amone Republicans and incline among Democrats is observed
Trends for presedential attitudes should be treated more cautiously as there is less text
involves a slight domain shift (the model is trained on congressional speeches)
found a similar pattern
in recent years, presidents are uniformly more proimmigration even the Republican (Ronald Reagan) and the Democrats (Jimmy Carter)
the tone was varied dramatically depending on which groups of immigrants are being discussed
Mexican and Chinese (Italian is Identifying Groups)
Speech mentioning Chinese immigrants were overwhelmingly negative during Chinese exclusion (1882 to 1943)
Attitute toward all groups improved from 1940 to 1970
since the late 1970s, the gap between Italian and Mexican is large as the gap in tone that exists between Replublicans and Democrats today.
this pattern is mirrored in broader regional trends
antiimmigration terms contains the words representing threats (dangerous, cheap), control (permit, violation), and the targets of early antiimmigration legislation (undesirable, Chinese)
proimmigration terms contain the words representing desirable characteristics (industrious), land (property, agriculture), and service (gave, served)
Despite the relatively negative tone toward Mexican in the modern period, Hispanic and Latino had strong positive associations
almost no difference in the frames by two parties in the earlier time period
today, they use strongly divergent use of different frames
these patterns are robust to the exclusion of any individual term as well as to automated lexicon expansion
the most salient aspects
economy is the most uncommon in speeches about immigration the lease salient in both era
only flood and tide metaphor emerge dfrom the semiautomated frame construction process
measure the metaphors based on how probable such terms are as substitutes according to contextual embedding models
Republican used more dehumanizing metaphors
this trend mirrored by congressional tone toward immigration
eventually becoming net positive on average in 1950s
possibly by the humanitarian concerns
43rd to 111th Congress : digitized copy of the Congressional Record
112th to 116th Congress : congressional-record tool by @unitedstates project
data with speaker, party, state and date
Procedural speeches were identified and excluded
presidential communication : all presidential documents from The American Presidency Project
Immigration statistics : Historical Statistics of the United States Millennial Edition Online + census data by the Migration Policy Institute
the mose prominent immigrant nationalities historical data on the countries of origin of the foreign-born US population
45 countries that accounted for at least 1% of the foreitn-born population in at least 1 decade
manually modified the country name and nationality
used L1-regularized LR models to fit the predicted tone labels on all congressional segments classified as relevant
words in the vocab : at lease 20 times used / excluding numbers, punctuation, stop words / counts were binarized
Shapley values computed (reflected in Table 1)
collected direct mentions + group terms + more generic person references with nationality
used to measure dehumanizing metaphorical language for each group
included slang and derogatory terms to identify groups
trained on MLM task
fine-tuning to act as a classifier
to train implicit metaphorical language, began with the representative of that category