The word 'very' carries emotional weight, irony, and nuance that AI often misses. This article explores its impact on NLP, using the documentary 'Brexit: A Very British Civil War' as a case study.
The adverb 'very' appears deceptively simple: it intensifies an adjective. But in natural language processing, this single word exposes deep gaps in machine understanding of emotion, irony, and context. A recent review of the documentary Brexit: A Very British Civil War—described as a 'hoot' and 'irreverent'—highlights why 'very' remains a challenge for AI systems tasked with sentiment analysis, text generation, and translation.
When the documentary's title uses 'very', it does more than amplify 'British'. It layers the phrase with sarcasm and cultural meaning—'very British' evokes stereotypes of politeness and order, clashing with the harsh reality of 'civil war'. Sentiment analysis models, trained on surface-level correlations, frequently underweight such intensifiers. They treat 'very' as a weak modifier, scoring the phrase closer to neutral than the intended dark comedy.
Consider how the review characterizes the film: 'a total panto dame', 'a hilarious nightmare'. These phrases signal emotional excess that 'very' introduces. A typical lexicon-based sentiment tool would assign a modest positive score to 'British' and a negative one to 'civil war', but miss the ironic twist that 'very' provides. The result is a flat, literal interpretation that fails to capture the documentary's tone.
'No documentary about Brexit should be this much of a hoot.' — The review's opening line, which 'very' helps set as wry rather than literal.
Improving this requires models to learn 'very' as a pragmatic intensifier, not just a lexical one. Currently, sentiment datasets often annotate 'very good' as a simple positive, while 'very bad' as negative, but miss the scalar gradations in between—like 'very ironic' or 'very sarcastic'.
Large language models like GPT-4 and BERT are trained on terabytes of text where 'very' appears thousands of times per million words. But its distribution is skewed: it appears far more often in opinion pieces, reviews, and dramatic writing than in news reports or technical documentation. The documentary review—a highly emotional, opinionated text—rides this bias. Models trained on such corpora learn to associate 'very' with heightened emotion, which works fine for that genre but fails in neutral contexts.
For example, a model encountering 'a very normal day' might inflate the emotional weight, misclassifying a neutral statement as slightly positive or negative. Conversely, when 'very' is downsampled during fine-tuning to reduce bias, the model loses the ability to distinguish between mild and strong language. The fine balance between genre-specific frequency and universal semantics remains an unsolved problem.
Tech policy discussions, such as those redefining UK politics with technology regulation, often encounter similar linguistic nuances. The same word can flip meaning based on domain, and regulators must consider how AI interprets such subtleties.
The documentary title Brexit: A Very British Civil War is a masterclass in pragmatic complexity. 'Very British' is not a literal intensification—it is an ironic comment on the contradiction between British reserve and the divisive conflict. The review itself notes the film is 'a hoot' and revels in the 'blockbuster names' and 'gossip', using 'very' to set a flippant tone about a serious topic. AI systems that lack common-sense reasoning or cultural knowledge treat 'very' as a simple boost, missing the rhetorical function entirely.
Consider the review's description: Nigel Farage is seen as a 'panto dame', and the campaign is likened to Game of Thrones. The word 'very' here amplifies the absurdity. A robust NLP system needs world knowledge to parse that 'very British' in this context means 'stereotypically understated yet now chaotic'—a far cry from a neutral description. This pragmatic layer is exactly where current AI falls short.
'Johnson’s position had “nothing to do with the EU,” says George Osborne. “It was Game of Thrones.”' — A quote that, with 'very', the title frames as comic.
Storytelling techniques that convey such irony are often explored in media like how Spike Lee uses technology to tell powerful stories. Similarly, NLP must evolve to handle narrative nuance—'very' is a small tell for a large problem of pragmatic comprehension.