Old Bailey Voices: gender, speech and outcomes in the Old Bailey

R
ggplot
OBV
womens history
Author

Sharon Howard

Published

16 December 2018

Introduction

The Old Bailey Voices data is the result of work I’ve done for the Voices of Authority research theme for the Digital Panopticon project. The research theme is exploring reported speech in the Old Bailey Proceedings and the experience of the trial from the late 18th to the late 19th century.

The most personal piece of evidence we have about the women and men who are the subject of the Digital Panopticon is their words, as recorded in the Old Bailey Proceedings. But, what was lost in translation in the space between the spoken and the printed account? And how did the defendant experience the trial process?

This includes 3D modelling of the courtroom and reconstruction of trials, as well as linguistic analysis of the speech corpus. I’ve been focusing on the relationship between defendants’ spoken words and their trials and outcomes.

Last year I gave an initial conference paper. This will be the first of a few blog posts in which I start to dig deeper into the data. First I’ll review the general trends in trials, verdicts and speech, and then I’ll look a bit more closely at defendants’ gender.

The data

The Old Bailey Voices dataset (OBV) brings together the Old Bailey Online (OBO) Proceedings data and the Old Bailey Corpus (OBC). (The data is explained in more detail in the downloadable documentation and on Github.

I’ve already blogged a bit at this blog about the Proceedings data. The Old Bailey Corpus, created by Magnus Huber, enhanced a large sample of the OBO data (407 sessions, c. 14 million spoken words) between 1720 and 1913 for linguistic analysis, including tagging of direct speech and tagging about speakers.

OBV re-combines the OBC’s linguistic tagging with OBO’s structured trials tagging (including verdicts and sentences). In OBC, individual speakers’ unique OBO identifies are not linked to their tagged utterances, and we needed this information for defendants to make it possible to do a number of things:

  • identify which defendants spoke in court and, just as important, which did not
  • what kind of utterances defendants made (did they ask questions and answer questions? did they make defence statements?)
  • how much they spoke (both word counts and utterance counts)
  • how they interacted with other speakers

We can then try to correlate these pieces of information with other known data for the defendant: gender, age, occupation, offence, verdict, sentence; and, eventually, with the life archives data from Digital Panopticon which includes their actual punishments and later life courses.

OBV consists of single-defendant trials for the period 1780-1880, amounting to c.21000 trials in 227 Old Bailey court sessions; c.15850 of the trials contain first-person speech tagged by the OBC. Trials with multiple defendants have been excluded from the dataset because of the added complexity of matching speaker to utterances (and they aren’t always individually named, so it can actually be impossible to do this).

Trials have also been simplified: if there are multiple offences, verdicts or sentences only the most “serious” is retained.

So I’d emphasise that this isn’t a full picture of OBO; there are also aspects of the sampling methodology used by the OBC that are potentially problematic (though I think not in a major way) for my analysis, but I haven’t investigated these properly yet. As a result, this work is quite provisional and likely to be subject to future revisions. And as you’ll see, at this stage I seem to be generating more questions than answers…

Code
# most frequently used r packages 

library(tidyverse)

theme_set(theme_minimal()) # set preferred ggplot theme 

library(lubridate)  # nice date functions
library(scales)  # additional scaling functions mainly for ggplot (eg %-ages)
library(knitr)  # for kable (nicer tables)
library(kableExtra)  


# additional packages 
library(patchwork) # for multiple ggplots in one plot


# obv summary data 
obv_trials_data <- read_tsv(here::here("site_data/obv/obv_defendants_trials.tsv"), na=c("NULL",""," "))

## process_data

# dates: make sess_date a proper date; add year, decade (1-10), quarter
# since the focus here is on gender, filter out unknown from the start (there are hardly any)
# speech2: abbreviated version of speech for convenience
# recategorise offcats: use offcat unchanged: breakingPeace, damage, miscellaneous, royalOffences, sexual, violentTheft, kill; deception: fraud, forgery, other; theft: subcats with >160 cases -> subcat, anything less -> 'other'
# recategorise verdicts: special and misc -> misc; guilty noncomposmentis and ng insane -> insane; notguilty unchanged; guilty "" unchanged; reduced verdicts -> "part"; recommendation -> rece
# verdict type pleaded guilty | found guilty | not guilty
# obc_type  tagged/untagged trials
# age groups
# exclude: 1784 (single trial) and 1780, null gender
# guilty/not guilty verdicts only (ie remove any misc/special)

obv_trials <- obv_trials_data %>%
  mutate( 
    #dates
    sess_date = parse_date(as.character(sess_date), format = "%Y%m%d") ,
    year = year(sess_date),
    decade = ifelse( year %% 10 == 0,  (year-10) - (year %% 10), year - (year %% 10) ),
    decs = ifelse(between(decade, 1780,1820), "1781-1830", "1831-1880"),
    # speech
    speech = recode(speech,
                     "deft_speaks" = "speaks",
                     "deft_silent" = "silent",
                     .default = "none"),
    obc_type = recode(speech,
                      "speaks" = "tagged",
                      "silent" = "tagged",
                      .default = "untagged"),
    speech= as_factor(speech)
  ) %>%
  # nb: a few guilty pleas with speech = found guilty not pleaded - I think this is technically incorrect. see t18530613-723 - judge refused to allow a guilty plea after a jury had been sworn. (only about 20 trials)
  mutate(
    verdict_type = case_when(
        deft_vercat == "notGuilty" ~ deft_vercat,
        str_detect(deft_versub, "plead") & speech !="none" ~ "foundGuilty",
        str_detect(deft_versub, "plead") ~ "pleadedGuilty",
        deft_vercat == "guilty" ~ "foundGuilty",
        TRUE ~ NA_character_ 
      ),
    verdict_type = fct_relevel(verdict_type, "notGuilty"),
    # gender
    deft_gender = as_factor(deft_gender)
  ) %>%
  # tidy up names
  rename(
    def_spk = obv_def_spk, 
    gender = deft_gender, age = deft_age, 
    offcat = deft_offcat, vercat = deft_vercat, puncat = deft_puncat) %>% 
  # filtering
  filter(year != 1784, year !=1780, !is.na(gender)) %>% 
  # only include trials with a guilty/not guilty verdict (this is nearly all)
  filter(!is.na(verdict_type)) %>%  # (2017 conference obv2_f_gng)
  droplevels() # for vcd work 

# + only trials with tagged speech
# filter(trial_tagged == 1) # (obv2_f_gng_speech)



# manual colour schemes (using colorbrewer palettes) 

# more precise control over colours 
# not just pretty :-) use based on chosen scale fill/colour
# will get an error message if there aren't enough colours in the list...
# use colorbrewer directly for larger number of colours; available:
#Diverging  BrBG, PiYG, PRGn, PuOr*, RdBu*, RdGy, RdYlBu*, RdYlGn, Spectral
#Qualitative  Accent, Dark2, Paired, Pastel1, Pastel2, Set1, Set2, Set3
#Sequential   Blues, BuGn, BuPu, GnBu, Greens, Greys, Oranges, OrRd, PuBu, PuBuGn, PuRd, Purples, RdPu, Reds, YlGn, YlGnBu, YlOrBr, YlOrRd

# speech PuOr
speech_col3 <- c("#b35806","#fdb863","#8073ac")

# verdict RdBu
verdict_col3 <- c('#ca0020','#f4a582','#0571b0') 

# gender RdYlBu 
gender_col2 <- c("#f46d43","#4575b4")



# plots must have unique chunk names else the images won't save... 

Overview

This and following posts use the data from 1781-1880 (decades = eg 1781-90); I omit a handful of ‘special’/miscellaneous verdicts and any unknown/indeterminate gender defendants. This gives a total of 20711 trials.

A random slice of the data (selected variables):

Code
kable( 
  sample_n(obv_trials, 10) %>%
    select(obo_trial, sess_date, decade, speech, trial_speech_wc, deft_total_wc, gender, age, offcat, vercat, verdict_type) %>%
    rename(trial_wordcount = trial_speech_wc, deft_wordcount = deft_total_wc) %>%
    arrange(sess_date), format = "html"
  ) %>%
  kable_styling(bootstrap_options = "striped")
obo_trial sess_date decade speech trial_wordcount deft_wordcount gender age offcat vercat verdict_type
t18070114-95 1807-01-14 1800 speaks 233 116 male 38 miscellaneous guilty foundGuilty
t18150405-1 1815-04-05 1810 speaks 2486 132 male 29 kill guilty foundGuilty
t18180909-323 1818-09-09 1810 silent 40 0 male NA theft notGuilty notGuilty
t18250113-46 1825-01-13 1820 silent 203 0 male 34 theft guilty foundGuilty
t18261026-221 1826-10-26 1820 none NA NA male NA theft notGuilty notGuilty
t18261026-142 1826-10-26 1820 speaks 127 7 male 25 theft guilty foundGuilty
t18430821-2521 1843-08-21 1840 speaks 125 10 male 14 theft guilty foundGuilty
t18510616-1387 1851-06-16 1850 none NA NA male 60 sexual guilty foundGuilty
t18620106-196 1862-01-06 1860 none NA NA male 60 theft guilty pleadedGuilty
t18640411-422 1864-04-11 1860 silent 1487 0 male 38 kill guilty foundGuilty

Looking at trials by decade intriguingly suggests three distinct phases. Overall numbers rose dramatically in the early 19th century, peaked in the 1830s-40s and then dropped even more rapidly in the 1850s to return to levels similar to the first three decades. This is unlikely to have very much at all to do with actual levels of crime. (The 1850s drop is very easy to explain: legislation in 1855 channelled a high proportion of relatively minor thefts to lower courts.)

Code
(obv_trials_decade_plot <- 
  obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  count(decade) %>%
  ggplot(aes(x=factor(decade), y=n)) +
  geom_col(fill="#003c30")  +
  labs(title="OBV: trials by decade 1781-1880", x="decade", y="count") 
)

Code
#BrailleR::VI(obv_trials_decade_plot)

Verdicts

Code
#obv_trials %>% #filter(verdict_type !="pleadedGuilty") %>%    add_tally() %>% rename(tot=n) %>%  filter(vercat=="guilty") %>%  count(vercat, tot) %>%  mutate(pc = round(n/tot*100, 1), pc_txt = paste0(pc, "%")) 

obv_trials_conv_by_decade <- obv_trials %>% #filter(verdict_type !="pleadedGuilty")  %>%    
  add_count(decade) %>% rename(cd=n) %>%  
  count(vercat, cd, decade) %>%  mutate(pc = round(n/cd*100,1)) %>%  
  filter(vercat=="guilty") 

obv_trials_conv_excpg_by_decade <- 
  obv_trials %>% filter(verdict_type !="pleadedGuilty")  %>%    
  add_count(decade) %>% rename(cd=n) %>%  
  count(vercat, cd, decade) %>%  mutate(pc = round(n/cd*100,1)) %>%  
  filter(vercat=="guilty") 

It’s important to note from the start that conviction was by far the most likely outcome for any defendant. Overall, 78.1% of all defendants were convicted.

But conviction rates fluctuated considerably over the decades; at their lowest during the 1780s at 66.1% and peaking during the 1840s at 83%:

Code
obv_trials %>%
  add_count(decade) %>% rename(cd=n) %>%
  count(vercat, cd, decade) %>%
  mutate(pc = n/cd*100) %>%
  filter(vercat=="guilty") %>%
  ggplot(aes(x=factor(decade),  y=pc)) +
  geom_col(fill="#d6604d") +
  ylim(0,100) +
  labs(x="decade", y="% of defendants", title="OBV: conviction rates by decade 1781-1880") 

The spike in the 1790s is a bit misleading; for two years reporting of acquittals was suppressed, so conviction rates are somewhat over-stated in that decade. Without archival research, we can’t know exactly how much impact this had, but we can get a pretty good idea. First, compare with adjacent years:

Code
obv_trials %>%
  filter(between(year, 1785,1796)) %>%
  add_count(year) %>% rename(cd=n) %>%
  count(vercat, cd, year) %>%
  mutate(pc = n/cd*100) %>%
  filter(vercat=="guilty") %>%
  ggplot(aes(x=factor(year),  y=pc)) +
  geom_col(fill="#d6604d") +
  ylim(0,100) +
  labs(x="decade", y="% of defendants", title="OBV: conviction rates by year 1785-96")

It’s pretty obvious which years were censored! And simply excluding these two years makes their impact on the 1790s’ conviction rate very clear:

Code
obv_trials %>%
  filter(!between(year, 1791, 1792)) %>%
  add_count(decade) %>% rename(cd=n) %>%
  count(vercat, cd, decade) %>%
  mutate(pc = n/cd*100) %>%
  filter(vercat=="guilty") %>%
  ggplot(aes(x=factor(decade),  y=pc)) +
  geom_col(fill="#d6604d") +
  ylim(0,100) +
  labs(x="decade", y="% of defendants", title="OBV: conviction rates by decade, excluding 1791-2") 

(In the rest of this post I’ll exclude 1791-92 where it seems more appropriate.)

Convictions

However, simply looking at conviction rates hides an important distinction. There were actually two types of conviction: defendants could be found guilty by a jury, or they could plead guilty. In fact, the growth of guilty pleas during the 19th century may well be the single most significant trend in the verdicts data.

Defendants very rarely pleaded guilty in felony trials during the 17th and 18th centuries (sometimes they wanted to and were actually discouraged from doing so by judges): very often their lives were at stake, and the trial itself was their only certain opportunity to present to a judge mitigating circumstances and character references which would be considered in post-conviction pardoning decisions. But the end of the Bloody Code in the 1820s lowered the stakes in the majority of trials, while the growing use of imprisonment from the late 18th century gave judges much more fine-grained sentencing options. Guilty pleas and plea bargaining rapidly - very rapidly! - became much more significant.

Code
obv_trials %>%
  filter(!between(year, 1791,1792)) %>%
  count(decade, verdict_type) %>%
  ggplot(aes(x=factor(decade), y=n, fill=verdict_type)) +
  geom_col(position = "fill") +
  scale_y_continuous(labels=percent_format()) +
  scale_fill_manual(values = verdict_col3) +
  labs(x="decade", y="% of defendants", title="OBV: verdicts, by decade 1781-1880 (ex 1791-2)", fill="verdict") 

If we look only at defendants who pleaded not guilty (ie, they were tried by a jury), the overall conviction rate was 74.5%. In the 1780s, the conviction rate was 66.1%, rising to peak at 78.7% in the 1830s and then dropping to 63.5% in the 1870s.

Code
obv_trials %>%
  filter(!between(year, 1791,1792)) %>%
  filter(verdict_type !="pleadedGuilty")  %>%
  add_count(decade) %>% rename(cd=n) %>%
  count(vercat, cd, decade) %>%
  mutate(pc = n/cd*100) %>%
  filter(vercat=="guilty") %>%
  ggplot(aes(x=factor(decade),  y=pc)) +
  geom_col(fill='#d6604d') +
  ylim(0,100) +
  labs(x="decade", y="% of defendants", title="OBV: jury conviction rates, by decade (ex 1791-2)") 

How might we explain this rise and fall in juries’ tendency to convict defendants? One thing to note is that the rise of plea bargaining was very likely to filter out more hopeless cases, so defendants tried by juries in the 1850s-70s may actually have been more innocent, on average, than those tried in the 1780s-1820s. But there are other factors at work that don’t have much to do with actual guilt or innocence.

The first is the pressure of court business and speed of trials. For jury trials only, the four decades with the highest trial counts had the highest conviction rates (by a considerable margin); and the six decades with the lowest trial counts had the lowest conviction rates. The difference is very clear:

Code
obv_trials %>%
  filter(!between(year, 1791,1792)) %>%
  filter(verdict_type !="pleadedGuilty")  %>%
  add_count(decade) %>% rename(nd=n) %>%
  count(vercat, decade, nd) %>% rename(guilty = n) %>%
  mutate(pc = round(guilty/nd,2)) %>% 
  filter(vercat=="guilty") %>%
  ggplot(aes(x=nd, y=pc,  label=paste0(decade,"s") )) +
  geom_label(size=3) +
  scale_y_continuous(labels = percent_format(), limits = c(0.6,0.85)) +
  labs(x="number of trials in decade", y="convicted %", colour="decade", title="OBV: conviction rates and trial counts by decade (ex 1791-2), jury trials only")

And what that looks like when guilty pleas are included:

Code
obv_trials %>%
  filter(!between(year, 1791,1792)) %>%
  #filter(verdict_type !="pleadedGuilty")  %>%
  add_count(decade) %>% rename(nd=n) %>%
  count(vercat, decade, nd) %>% rename(guilty = n) %>%
  mutate(pc = round(guilty/nd,2)) %>% 
  filter(vercat=="guilty") %>%
  ggplot(aes(x=nd, y=pc,  label=paste0(decade,"s") )) +
  geom_label(size=3) +
  scale_y_continuous(labels = percent_format(), limits = c(0.6,0.9)) +
  labs(x="number of trials in decade", y="convicted %", colour="decade", title="OBV: conviction rates and trial counts by decade (ex 1791-2), all trials")

Looking at the whole picture highlights (again) that in many respects making comparisons of outcomes in jury trials over the whole of the century can be misleading: the transformations in punishments and development of guilty pleading, major changes in trials brought by the growing significance of both defence lawyers and professional police officers, and the transfer of much of the court’s business to lower courts in the 1850s, all mean that being tried at the Old Bailey was a completely different experience (for both defendants and jurors) in 1781 and in 1880.

All of which makes the correlation between juries’ tendency to convict and the number of trials dealt with all the more noteworthy. At the same time, it’s not entirely surprising, given the nature of trials during the period. Juries heard numerous trials in a sessions (rather than modern practice of one jury per trial), and

[in the 18th century] trials were very short, averaging perhaps half an hour per case… With the abolition of the death penalty for many crimes in the 1820s, trials became even shorter: in 1833 one commentator calculated that the average trial took only eight and a half minutes.

Speech

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  mutate(speech = ifelse(speech!="speaks", "silent", "speaks") ) %>%
  add_count(decade) %>% rename(cd=n) %>%
  count(speech, cd, decade) %>%
  mutate(pc = n/cd*100) %>%
  filter(speech=="speaks") %>%
  ggplot(aes(x=factor(decade),  y=pc)) +
  geom_col(fill="#8073ac") +
  ylim(0,100) +
  labs(x="decade", y="% of defendants", title="OBV: % of defendants who speak in court by decade 1781-1880") 

But, as with verdicts, the picture is a bit more complicated than a simple “defendant speaks”/“defendant silent” choice. In terms of defendant speech, there are three types of trial:

  • trials in which there is reported speech by the defendant (labelled “speaks” below)
  • trials in which there is reported speech by other participants, but the defendant is (apparently) silent (“silent”)
  • trials in which there is no reported speech at all (“none”)
Code
obv_trials  %>% 
  #filter(!between(year, 1791,1792)) %>%
  count(decade, speech) %>%
  ggplot(aes(x=factor(decade), y=n, fill=speech)) +
  geom_col(position="fill") +
  scale_fill_manual(values=speech_col3) +
  scale_y_continuous(labels = percent_format()) +
  labs(x="decade", y="% of trials", title="OBV: defendant speech in trials, by decade, 1781-1880", fill="defendant\nspeech") 

This repeats my earlier findings about changes in patterns of speech between the 1780s and 1870s: overall, there was a massive decline in defendants’ speaking in trials, and this encompassed two distinct trends: more trials in which there was speech but the defendant was silent and more trials without any reported speech at all (which I’ll call “no-speech trials” for conciseness, but emphasise that it means no reported speech; the Proceedings never, even at their most detailed, published all speech). And also there are instances of trials in which the defendant spoke but their words were not directly reported; I haven’t (yet) found a way to work out how often this happened.

There are three main causes of no-speech trials:

  • The defendant pleaded guilty, so there was no trial as such; the largest category in this data (61% of no-speech trials)
  • Trials which didn’t proceed because the prosecutor or key witnesses failed to turn up, and so the defendant was acquitted for lack of evidence (these became less common as prosecution by police became more the norm)
  • Trial reports consisting of summaries rather than reporting evidence directly

The three are not evenly spread chronologically. Directly reporting the speech of trial participants rather than summarising evidence was a fairly gradual development in the Proceedings from the 1730s onwards, and was not quite complete by the 1780s when the role of the publication shifted to a semi-official, government-subsidised record.

But at the same time there was a lot of anxiety about publishing details of evidence in trials that ended in acquittal (in case criminals used the information to learn more effective defence strategies!). This is why reporting of acquittals was suppressed in the early 1790s. I think that summary-report trials in the 1780s also skewed towards acquittals, but this needs more detailed investigation.

At any rate, until the 1820s, the majority of no-speech trials were acquittals, since (as already noted) guilty pleas were rare. In fact, it’s clear once again that even though there was a trend even before that, the 1830s represented a significant turning point:

Code
obv_trials %>%
  filter(!between(year, 1791,1792)) %>%
  filter(obc_type == "untagged") %>%
  count(verdict_type, decade) %>%
  ggplot(aes(x=factor(decade), y=n, fill=verdict_type) ) +
  geom_col(position="fill") +
  scale_y_continuous(labels=percent_format()) +
  scale_fill_manual(values = verdict_col3) +
  labs(title="OBV: verdicts by decade, no-speech trials only (ex 1791-2)", x="decade", y="% of defendants", fill="verdict") 

Repeating the comparison of jury conviction rates and trial numbers, but with the added variable of whether the defendant speaks or not, gets interesting (filtering out trials without reported speech):

Code
obv_trials %>%
  filter(!between(year, 1791,1792)) %>%
  filter(verdict_type !="pleadedGuilty", speech !="none")  %>%
  add_count(decade) %>% rename(nd=n) %>%
  add_count(decade, speech) %>% rename(ns=n) %>%
  count(vercat, decade, nd, ns, speech) %>% rename(guilty = n) %>%
  mutate(pc = round(guilty/ns,2)) %>% arrange(decade, vercat, speech) %>% 
  filter(vercat=="guilty") %>%
  ggplot(aes(x=nd, y=pc,  label=paste0(decade,"s"), colour=speech )) +
  geom_label(size=3) +
  geom_point(size=0, stroke=0) +
  scale_y_continuous(labels = percent_format(), limits = c(0.45,0.95)) +
  scale_color_manual(values= c("#e08214", "#8073ac") ) +
  guides(colour = guide_legend(override.aes = list(size = 3, shape="square"))) +
  labs(x="number of trials in decade", y="conviction %", colour="defendant\nspeech", title="OBV: conviction rates and trial counts, jury trials with speech only (ex 1791-2)") 

Again, we can see how speaking is associated with higher conviction rates; speaking really wasn’t good for defendants. Interestingly, though, the gap seems to be narrower in the decades with the largest numbers of trials.

Gender

It’s time to bring gender into the picture, and to anyone familiar with historical or contemporary crime statistics it won’t come as much of a surprise that female defendants were very much in the minority.

Code
obv_trials %>% 
  #filter(!between(year, 1791,1792)) %>%
  count(gender) %>%
  mutate(pc = round(n / sum(n)*100,1), pc_txt = paste0(gender, "\n", pc, "%")) %>%
  ggplot(aes(x="", y=n, fill=gender)) +
  geom_col(width=1) +
  geom_text(aes(x=1.2, label=pc_txt), position = position_stack(vjust=0.35), colour="white", size=4.5) +   # x= and vjust adjust label positioning
  coord_polar(theta="y") +
  scale_fill_manual(values = gender_col2)  +
  scale_y_continuous(breaks = NULL) +  # white lines
  guides(fill=FALSE) + # remove legend
  ggtitle("OBV: Gender of defendants") +
  theme(panel.grid.major = element_blank(),  # white lines
        axis.ticks=element_blank(),  # the axis ticks
          axis.title=element_blank(),  # the axis labels
          axis.text.x=element_blank()) # the 0.00... 1.00 labels

What may be more unexpected though is just how much the proportion of female defendants decreased over the course of the century:

Code
#obv_trials %>%  add_count(decade) %>% rename(tot = n) %>%  count(decade, gender, tot) %>%  filter(gender=="female") %>%  mutate(pc = round(n/tot*100, 1) )

obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  count(decade, gender) %>%
  ggplot(aes(x=factor(decade), y=n, fill=gender)) +
  geom_col(position = "fill")  +
  scale_fill_manual(values = gender_col2) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(title="OBV: gender of defendants, by decade", y= "% of defendants", x="decade") 

In fact the percentage of female defendants fell from 28.1% in the 1780s to 12.1% in the 1870s.

Gender and verdicts

Looking at outcomes by gender, overall male defendants were more likely than female to be convicted (79.0% to 74.9%). But again, we’ll see how this is complicated by the different verdict categories.

Code
#obv_trials %>% add_count(vercat) %>% rename(v = n) %>% add_count(gender) %>% rename(g = n) %>% count(vercat, gender, v, g) %>%  mutate(pc = round(n/g*100,2) ) %>% arrange(gender, vercat)

# the other approach: count(gender, vercat) %>% group_by(gender) %>% mutate(pc= n/sum(n)) %>% ungroup
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  add_count(gender) %>% rename(tot = n) %>%
  count(gender, vercat, tot) %>%
  mutate(pc = round(n/tot, 3), pc_lab = paste0(pc*100, "%")  ) %>%
  ggplot(aes(x=gender, y=pc, fill=vercat, label=pc_lab)) +
  geom_col()  +
  geom_text(position=position_stack(vjust=0.5), colour = "white", size=4) +
  scale_fill_manual(values =verdict_col3  ) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(title="OBV: gender of defendants and verdicts", y= "% of defendants", fill="verdict") 

There are also some variations by decade.

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  count(gender, vercat, decade) %>%
  ggplot(aes(x=gender, y=n, fill=vercat)) +
  geom_col(position="fill")  +
  scale_fill_manual(values = verdict_col3  ) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(title="OBV: gender of defendants and verdicts, by decade", y= "% of defendants", fill="verdict")  +
  facet_wrap(~decade, ncol = 5) 

In most decades, men are clearly more likely to be convicted than women. That makes the two decades in which the opposite occurred (the 1800s and 1850s) rather intriguing. While the 1800s may be an isolated anomaly, the three decades from the 1830s to the 1850s look like more of a pattern; what could be happening there?

Gender and speech

Were women defendants as likely to speak as men and are there any gendered differences in outcomes? And does that change over time?

The first notable thing is that women defendants do seem to be more likely to speak than men. But if you look closely, the proportions of men and women who remain silent in trials containing speech are very similar.

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  add_count(gender) %>% rename(tot_sp = n) %>%
  count(speech, gender, tot_sp) %>%
  mutate(pc_sp = round(n/tot_sp,3), pc_lab = paste0(pc_sp*100, "%") ) %>%
  ggplot(aes(x=gender, y=pc_sp, fill=speech, label=pc_lab ) ) +
  geom_col() +
  geom_text(position=position_stack(vjust=0.5), colour = "white", size=4) +
  scale_fill_manual(values = speech_col3) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  + 
  coord_flip() +
  #theme(legend.position = "bottom") +
  guides(fill = guide_legend(reverse = TRUE)) +
  labs(y="% of defendants", title = "OBV: defendant gender and speech")    

Bearing in mind the different types of trials, this can be teased out into two distinct strands:

  1. men’s trials are more likely to be no-speech trials;
  2. female defendants are more likely to speak than males in trials containing speech.

Gender, speech and verdicts

By this point, hopefully you shouldn’t be at all amazed to learn that male defendants pleaded guilty considerably more frequently than women did:

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  add_count(gender) %>% rename(tot_sp = n) %>%
  count(verdict_type, gender, tot_sp) %>%
  mutate(pc_sp = round(n/tot_sp, 3), pc_lab = paste0(pc_sp*100,"%") ) %>%
  ggplot(aes(x=gender, y=pc_sp, fill=verdict_type, label=pc_lab ) ) +
  geom_col() +
  geom_text(position=position_stack(vjust=0.5), colour = "white", size=4) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  scale_fill_manual(values = verdict_col3 ) + 
  coord_flip() +
  #theme(legend.position = "bottom") +
  guides(fill = guide_legend(reverse = TRUE))  +
  labs(title="OBV: gender and verdicts", y="percentage of defendants", fill="verdict") 

I can speculate on a number of possible reasons for this pattern, but answers may need different data which will be considered in future posts. For example:

  • men might be more likely to be caught red-handed (and therefore plead guilty); this or similar questions could be explored using linguistic analysis of the OBV speech data.
  • men were more likely to be repeat offenders and more familiar with (and known to) the criminal justice process, making them more likely to plea bargain; this could be investigated using the Digital Panopticon life archives, or references to previous trials in the reports.

In the graph above, it also appears that men were also somewhat more likely to be found guilty than women; this can be seen more easily by looking only at jury trials.

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  filter(verdict_type !="pleadedGuilty") %>%
  add_count(gender) %>% rename(tot_sp = n) %>%
  count(verdict_type, gender, tot_sp) %>%
  mutate(pc_sp = round(n/tot_sp, 3), pc_lab = paste0(pc_sp*100, "%")) %>%
  ggplot(aes(x=gender, y=pc_sp, fill=verdict_type, label=pc_lab ) ) +
  geom_col() +
  geom_text(position=position_stack(vjust=0.5), colour = "white", size=4) +
  scale_fill_manual(values = verdict_col3 )  +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0))) + 
  coord_flip() +
  #theme(legend.position = "bottom") +
  guides(fill = guide_legend(reverse = TRUE))  +
  labs(title="OBV: gender and verdicts in jury trials only", y="percentage of defendants", fill="verdict") 

The jury verdicts can be broken down further to consider the effects of speech:

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  filter(verdict_type !="pleadedGuilty") %>%
  add_count(gender, verdict_type) %>% rename(tot = n) %>%
  count(gender, verdict_type, speech, tot) %>%
  mutate(pc = round(n/tot, 3), pc_lab = paste0(pc*100, "%")) %>%
  ggplot(aes(x=gender, y=pc, fill=speech, label=pc_lab)) +
  geom_col()  +
  geom_text(position=position_stack(vjust=0.5), colour = "white", size=4) +
  scale_fill_manual(values = speech_col3  ) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(title="OBV: gender, verdicts and speech, jury trials only", y= "% of defendants", fill="speech")  +
  facet_wrap(~verdict_type, ncol = 3) 

It’s already been shown that the percentages of men and women who stayed silent in trials with speech were quite similar (31.4 to 30.5); the bigger gendered differences were found in the two other speech categories (defendant speaks and no-speech). Here, it can also be seen that the proportions of speech categories were very similar for men and women when juries convicted; but in acquittals, men were quite a lot more likely to have kept silent than women.

I’ll finish by looking again at the gender/decade/verdict breakdown for jury trials only:

Code
obv_trials %>%
  #filter(!between(year, 1791,1792)) %>%
  filter(verdict_type!="pleadedGuilty") %>%
  count(gender, vercat, decade) %>%
  ggplot(aes(x=gender, y=n, fill=vercat)) +
  geom_col(position="fill")  +
  scale_fill_manual(values = verdict_col3  ) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(title="OBV: gender and verdicts by decade, jury trials only", y= "% of defendants", fill="verdict")  +
  facet_wrap(~decade, ncol = 5) 

This looks very similar to the previous breakdown (as would be expected of the top row, when there were few guilty pleas, in any case). But the reversal of the usual female-male conviction rate in the 1850s is even more striking. Let’s zoom in:

Code
obv_trials %>%
  filter(between(decade, 1830,1850)) %>%
  count(gender, vercat, decade) %>%
  ggplot(aes(x=gender, y=n, fill=vercat)) +
  geom_col(position="fill")  +
  scale_fill_manual(values = verdict_col3  ) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(subtitle="all verdicts", y= "% of defendants", fill="verdict")  +
  theme(legend.position = "bottom") +
  facet_wrap(~decade, ncol = 5)  +

# urrghh reversing stuff
obv_trials %>%
  filter(between(decade, 1830,1850), verdict_type !="pleadedGuilty") %>%
  count(gender, verdict_type, decade) %>%
  ggplot(aes(x=gender, y=n, fill=verdict_type)) +
  geom_col(position = position_fill(reverse = TRUE)) +
  scale_fill_manual(values = c('#f4a582','#ca0020') , guide=guide_legend(reverse=TRUE) ) +
  scale_y_continuous(labels=percent, expand = expand_scale(mult = c(0, 0)))  +
  labs(subtitle="jury verdicts only", y= "% of defendants", fill="verdict")  +
  theme(legend.position = "bottom") +
  facet_wrap(~decade, ncol = 5)  +
  
plot_layout(ncol=2) + plot_annotation(title="OBV: defendant gender and verdicts by decade, 1830s-50s")

Juries in the 1850s were much harsher on women compared to men than in any other decade, and already slightly more so in the 1840s. Something else for the list of questions!

Concluding thoughts and what’s next

There are some obvious questions to be addressed>

  • Why was speaking bad for defendants?
  • And why was it seemingly even worse for male defendants compared to women?

An obvious topic for further exploration is the nature of the offences men and women committed. The analysis so far already suggests some interesting relationships between the variables of gender, trial/defendant speech and verdicts; types of offence will be an important factor. (Though, even if there are differences, causation is likely to be hard to discern.)

But it’s already quite a complex picture, and adding even more variables makes it very difficult to analyse or measure the significance of gender compared to other factors. So, in the next post I’m going to start experimenting with some statistical methods and visualisations which are specifically designed for complex categorical data, but are less familiar to most historians than the kind of analysis and bar charts I’ve been using so far. Then I’ll take a closer look at offences. In subsequent posts, I’ll (yes, finally!) start to explore and analyse the words spoken in trials.

Resources

Old Bailey Voices demonstration with trial reconstructions and more (work in progress)

Old Bailey Corpus

Old Bailey Voices dataset