British English pronunciation dictionary
British English (RP/Standard Southern British ) pronunciation dictionary:
The main dictionary's words are in upper case, comma-separated from their space-separated pronunciation. For words with multiple pronunciations, a parenthesised number is attached to the end:
RAINBOW, ɹ ˈeɪ n b ˌəʊ
RAINING, ɹ ˈeɪ n ɪ ŋ
RAISE, ɹ ˈeɪ z
RAISED, ɹ ˈeɪ z d
RAISES, ɹ ˈeɪ z ɪ z
RAISING, ɹ ˈeɪ z ɪ ŋ
RAISINS, ɹ ˈeɪ z ɪ n z
RALEIGH(1), ɹ ˈɑː l i
RALEIGH(2), ɹ ˈɔː l i
Stress marks are attached to the stressed vowel/diphthong.
Multi-unit words are separated by the underscore _
, which stands for an actual space
. This is to ease further
processing:
COSTA_RICA, k ˌɒ s t ə ɹ ˈiː k ə
In the expansions dictionary entries are also in upper case, tab-separated from their expansions:
MON MONDAY(1)
MON. MONDAY(1)
MPG MILES PER(1) GALLON
MPH MILES PER(1) HOUR
MR MISTER
MR. MISTER
MRS MISSIS
MRS. MISSIS
strict IPA versus traditional phonetic symbols: the phonetic symbols are strictly as defined by the IPA, as opposed to how they have traditionally been used in many dictionaries and the language learning literature. In particular:
unstressed vowels as /ə/ and /ɪ/: due to the diversity of the sources for phonetic transcription, there's some inconsistency in how weak vowels are transcribed, though in most cases /ɪ/ is used, following the Collins Dictionary.
final i: final unstressed i's are given a short tense "i" phoneme /i/, different from both /iː/ and /ɪ/, to reflect happy-tensing. Most dictionaries show this vowel (https://en.wikipedia.org/wiki/English_phonology) or the short tense /ɪ/. There might be some inconsistency in the transcription as happy-tensing is preserved in inflected variants in spoken English (e.g., studied derives it from study, and it contrasts with studded) yet this might not always be reflected in the dictionary.
secondary stress: secondary stress is not always marked (the primary always is).
stems and inflections: not all inflected open-class words (noun, verbs, adjectives and adverbs) have all their inflected variants, and not all variants show all of the alternative pronunciations. The possessive form -'s of nouns is not included, and neither is the superlative form of most adjectives and adverbs.
acronyms vs initialisms: The expansions dictionary only contains acronyms, i.e., words that are not pronounced by spelling out the individual letters (e.g. NATO). Initialisms, on the other hand, (e.g. BBC, NHS) are excluded. The pronunciation of these can be obtained by looking up the names of the individual letters in the main dictionary, then concatenating them.
The initial source of the phonetic transcriptions is cmudict, plus a number of other sources for British English specifics: Wiktionary, Wikipedia, the Collins Dictionary, the Oxford Dictionary, the Cambridge Dictionary and the MacMillan Dictionary.
The main sources of the word frequency-filtered vocabulary are the top 10K in the British National Corpus, the Google Web Corpus and the New General Service Lists. Not all words in these lists are included since due to sampling bias there are uncommon words like athelstan or phentermine, as well as foreign words. Also excluded are initialisms.
See Changelog
If you'd like to contribute a correction or an addition, or make a request for an addition, you can make a pull request or open an issue.
Copyright (c) 2017 by Jose Llarena
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.