Collocation dictionaries catalog word pairings commonly found in a language. They show common combinations of words with other words that belong to different types, such as which adjectives are usually used to describe a specific noun, or what prepositions or adverbs are usually paired with a specific verb.
For instance, by using an English collocation dictionary, we can look up the word “analyse” and it will present us with a compilation of commonly co-located (placed alongside) adjectives, verbs, prepositions and phrases. These are words that usually accompany analysis and sound most natural in English.
In this case some of the words that the Online Oxford Collocation Dictionary returned are the adjectives “comprehensive”, “thorough”, “systematic”, “brief”, “comparative” and many more. It also listed verbs such as “carry out”, “perform”, “indicate sth”, “make” and the preposition “in + an/the”.
This result highlights some important guidelines related to the use of the word “analysis”. We don’t normally use the preposition “on” to refer to it. Simlarly, we don’t use the phrase “effectuate an analysis”, we say “carry out an analysis”. Regarding adjectives, it appears that the combination “attentive analysis” is not typically used. In practice, we can see how veering too greatly from the dictionary’s directives might produce sentences that seem off: “We have effectuated an attentive analysis on this topic”.
Collocation dictionaries are very useful tools not only for non-native speakers, but also for native speakers seeking to maximise the potential of each word while still adhering to the confines of what is conventionally accepted without much fuss or ambiguity. The combinations that these dictionaries propose offer an array of options for those who reach a certain difficult point in their attempts to express something through writing and struggle to find suitable words to employ along with the ones that they are already using.
In fact, I would say that collocations act as a kind of prompt, a turn of the crank to keep the mind moving forward in its flow of ideas while maintaining language as a friendly tool rather than a necessary but cumbersome, frustrating means of communication. How many times have our linguistic skills failed us and we had to content ourselves to stay silent or say something that was not quite what we meant but was the closest we could cobble together?
Another important function that collocation dictionaries have, which I think is overlooked, is that they are a linguistic record that reinforces conventions in a way that it can reduce unnecessary negative prejudice based upon the other person’s linguistic choices.
This might seem counterintuitive because having a collocation dictionary could imply that the person prone to biased prejudice would consider it a sort of credentialed proof that the other made the wrong word choice, and all that having your linguistic skills can cause to your reputation. But the truth is that these conventions exist regardless of the existence of a collocation dictionary. Its absence would only result in less defined rules.
And in any case, this is just a minor downside if we consider that the collocation dictionary would be easily findable and accessible in the search engine era. If we meet this simple condition, then everyone who wants to opt for a conventional, straightforward word choice, would have access to an authoritative resource that shows them what their options are. They would consult the dictionary and just get it over with.
For these reasons I think that the online space needs officially-sanctioned dictionaries such as Oxford’s Free Collocation Dictionary available in many other languages. This dictionary should serve as a search engine aking to Oxford’s. Nowadays with the advent of Large Language Models AI’s anyone willing to enter the proper prompts would theoretically receive the correct collocations in any of the languages supported, but I find this process not practical enough. It is full of unnecessary friction, whereas an exclusively dedicated search engine would streamline the experience.
Looking at the broader perspective, I believe that the availability of collocation dictionaries in multiple languages would aid them towards a more effective development over time. Moreover, perhaps it would enable the emergence of new linguistic expressions coming from those individuals who willingly and transparently choose to stretch the boundaries of existing conventions. As the adage goes, to break a rule one must first know the rule that one is breaking.
Regarding the first point, where I speculate that collocation dictionaries could contribute to the development of a language, it seems to me that the very existence of such as document could function as a pre-established, ready repository for all emerging word combinations and expressions that organically evolve over time.
I think that English, the undisputed language of the globalised online world, has acquired an outsized advantage as the primary means of discussing modern concepts and the modern world in general. In the international environment, new words and expressions are being coined in English not only by native speakers but also non-native individuals that are fluent in the language. And this is happening to the detriment of their own languages. For example, it is widely acknowledged that the predominant language of academia is English, and those who study academic topics internationally often struggle to articulate learned concepts in their own native languages. In consequence, because there are no widespread, easily accessible, officially-sanctioned conventions dictating what word/expression stands for what English word/expression, they choose to use English words for the sake of fluency in communication.
The issue at hand, much like many problems in the world, is a matter not of good intentions but of a faulty structure. Even if all these students would love to be able to speak in their own language about the characteristics of AI, the Blockchain, datafication, deep fakes, and this is only scratching the surface, the means are just not there. English has taken over driven by a sort of network effect, leading us to choose it, many times in spite of ourselves, because there is no unified network in the, let’s say avant-garde linguistic environment of other languages.
Thus, with the establishment of a repository that would serve as a go-to reference when looking for the ‘right’ and ‘conventional’ expression, there would be a common point for this network of linguistic pioneers to unify. The language would stand a better chance to develop while leveraging its unique properties (grammatical, pre-existing untranslatable words, declensions, etc.) to evolve while keeping pace with continuous changes and developments in our global turbo-charged culture.
The stretching of conventions not only serves to find new combinations that are more accurate than available conventions. It also serves to revitalise a reader/listener’s attentiveness to the content, because it wouldn’t be controversial to suggest that certain combinations, especially between adjectives and nouns, or adverbs and nouns, are formulaic to an extent that their utility seems closer to being a cognitive chunking mechanism than the intentional expression of a thought. Cognitive chunking our minds energy-efficient strategy to process large volumes of information by bundling its different components into bigger, more digestible chunks (a paradigmatic example is the chunking of digits in groups of three to remember phone numbers easier than if we thought about them number by number). When we read a text we might be cognitively tempted to chunk formulaic expressions and mentally process as if they were one unit to go faster. This is something that can deprive us of certain elements of nuance that the chunked words were intended to bring into it.
I sincerely hope that search-engine collocation dictionaries in more languages begin to turn up on the web. Now it’s easier than ever to develop one thanks to Large Language Models. Perhaps the right move would be to create a program that can generate these dictionaries for those interested, so that they can quickly set up a website for it at a much faster pace than previously possible. Only then, once the dictionary is operational and easily findable, accessible and popularly-sanctioned, will it become an adequate repository for all new expressions and combinations that emerge over time.
✵
