defmodule Cldr.Locale do
@moduledoc """
Functions to parse and normalize locale names into a structure
locale represented by a `Cldr.LanguageTag`.
CLDR represents localisation data organized into locales, with
each locale being identified by a locale name that is formatted
according to [RFC5646](https://tools.ietf.org/html/rfc5646).
In practise, the CLDR data utilizes a simple subset of locale name
formats being:
* a Language code such as `en` or `fr`
* a Language code and Territory code such as `en-GB`
* a Language code and Script such as `zh-Hant`
* and in only two cases a Language code, Territory code and Variant
such as `ca-ES-valencia` and `en-US-posix`.
The RFC defines a language tag as:
> A language tag is composed from a sequence of one or more "subtags",
each of which refines or narrows the range of language identified by
the overall tag. Subtags, in turn, are a sequence of alphanumeric
characters (letters and digits), distinguished and separated from
other subtags in a tag by a hyphen ("-", [Unicode] U+002D)
Therefore `Cldr` uses the hyphen ("-", [Unicode] U+002D) as the subtag
separator. On certain platforms, including POSIX platforms, the
subtag separator is a "_" (underscore) rather than a "-" (hyphen). Where
appropriate, `Cldr` will transliterate any underscore into a hyphen before
parsing or processing.
### Locale name validity
When validating a locale name, `Cldr` will attempt to match the requested
locale name to a configured locale. Therefore `Cldr.Locale.new/2` may
return an `{:ok, language_tag}` tuple even when the locale returned does
not exactly match the requested locale name. For example, the following
attempts to create a locale matching the non-existent "english as spoken
in Spain" local name. Here `Cldr` will match to the nearest configured
locale, which in this case will be "en".
iex> Cldr.Locale.new("en-ES", TestBackend.Cldr)
{:ok, %Cldr.LanguageTag{
backend: TestBackend.Cldr,
canonical_locale_name: "en-ES",
cldr_locale_name: :en,
extensions: %{},
gettext_locale_name: "en",
language: "en",
locale: %{},
private_use: [],
rbnf_locale_name: :en,
requested_locale_name: "en-ES",
script: :Latn,
territory: :ES,
transform: %{},
language_variants: []
}}
### Matching locales to requested locale names
When attempting to match the requested locale name to a configured
locale, `Cldr` attempt to match against a set of reductions in the
following order and will return the first match:
* language, script, territory, [variants]
* language, territory, [variants]
* language, script, [variants]
* language, [variants]
* language, script, territory
* language, territory
* language, script
* language
* requested locale name
* nil
Therefore matching is tolerant of a request for unknown scripts,
territories and variants. Only the requested language is a
requirement to be matched to a configured locale.
### Substitutions for Obsolete and Deprecated locale names
CLDR provides data to help manage the transition from obsolete
or deprecated locale names to current names. For example, the
following requests the locale name "mo" which is the deprecated
code for "Moldovian". The replacement code is "ro" (Romanian).
iex> Cldr.Locale.new("mo", TestBackend.Cldr)
{:ok, %Cldr.LanguageTag{
backend: TestBackend.Cldr,
extensions: %{},
gettext_locale_name: nil,
language: "ro",
language_subtags: [],
language_variants: [],
locale: %{}, private_use: [],
rbnf_locale_name: :ro,
requested_locale_name: "mo",
script: :Latn,
transform: %{},
canonical_locale_name: "ro",
cldr_locale_name: :ro,
territory: :RO
}}
### Likely subtags
CLDR also provides data to indetify the most likely subtags for a
requested locale name. This data is based on the default content data,
the population data, and the suppress-script data in [BCP47]. It is
heuristically derived, and may change over time. For example, when
requesting the locale "en", the following is returned:
iex> Cldr.Locale.new("en", TestBackend.Cldr)
{:ok, %Cldr.LanguageTag{
backend: TestBackend.Cldr,
canonical_locale_name: "en",
cldr_locale_name: :en,
extensions: %{},
gettext_locale_name: "en",
language: "en",
locale: %{},
private_use: [],
rbnf_locale_name: :en,
requested_locale_name: "en",
script: :Latn,
territory: :US,
transform: %{},
language_variants: []
}}
Which shows that a the likely subtag for the script is :Latn and the likely
territory is "US".
Using the example for Substitutions above, we can see the
result of combining substitutions and likely subtags for locale name "mo"
returns the current language code of "ro" as well as the likely
territory code of "MD" (Moldova).
### Unknown territory codes
Whilst `Cldr` is tolerant of invalid territory codes. Therefore validity is
not checked by `Cldr.Locale.new/2` but it is checked by `Cldr.validate_locale/2`
which is the recommended api for forming language tags.
iex> Cldr.Locale.new("en-XX", TestBackend.Cldr)
{:ok, %Cldr.LanguageTag{
backend: TestBackend.Cldr,
canonical_locale_name: "en-XX",
cldr_locale_name: :en,
extensions: %{},
gettext_locale_name: "en",
language: "en",
locale: %{},
private_use: [],
rbnf_locale_name: :en,
requested_locale_name: "en-XX",
script: :Latn,
territory: :XX,
transform: %{},
language_variants: []
}}
### Locale extensions
Unicode defines the [U extension](https://unicode.org/reports/tr35/#Locale_Extension_Key_and_Type_Data)
which support defining the requested treatment of CLDR data formats. For example, a locale name
can configure the requested:
* calendar to be used for dates
* collation
* currency
* currency format
* number system
* first day of the week
* 12-hour or 24-hour time
* time zone
* and many other items
For example, the following locale name will request the use of the timezone `Australia/Sydney`,
and request the use of `accounting` format when formatting currencies:
iex> MyApp.Cldr.validate_locale "en-AU-u-tz-ausyd-cf-account"
{
:ok,
%Cldr.LanguageTag{
backend: MyApp.Cldr,
canonical_locale_name: "en-AU-u-cf-account-tz-ausyd",
cldr_locale_name: :"en-AU",
extensions: %{},
gettext_locale_name: "en",
language: "en",
language_subtags: [],
language_variants: [],
locale: %Cldr.LanguageTag.U{
calendar: nil,
cf: :account,
col_alternate: nil,
col_backwards: nil,
col_case_first: nil,
col_case_level: nil,
col_normalization: nil,
col_numeric: nil,
col_reorder: nil,
col_strength: nil,
collation: nil,
currency: nil,
dx: nil,
em: nil,
fw: nil,
hc: nil,
lb: nil,
lw: nil,
ms: nil,
numbers: nil,
rg: nil,
sd: nil,
ss: nil,
timezone: "Australia/Sydney",
va: nil,
vt: nil
},
private_use: '',
rbnf_locale_name: :en,
requested_locale_name: "en-AU",
script: :Latn,
territory: :AU,
transform: %{}
}
}
"""
alias Cldr.LanguageTag
alias Cldr.LanguageTag.{U, T}
import Cldr.Helpers, only: [empty?: 1]
defguard is_locale_name(locale_name) when is_atom(locale_name)
@typedoc "The name of a locale"
@type locale_name() :: atom()
@typedoc "A reference to a locale"
@type locale_reference :: LanguageTag.t() | locale_name() | String.t()
@typedoc "The name of a language"
@type language :: String.t() | nil
@typedoc "The name of a script"
@type script :: atom() | String.t() | nil
@typedoc "The name of a territory"
@type territory_reference :: atom() | String.t() | nil
@typedoc "A territory code as an ISO3166 Alpha-2 in atom form"
@type territory_code :: atom()
@typedoc "The list of language variants as strings"
@type variants :: [String.t()] | []
@typedoc "The list of language subtags as strings"
@type subtags :: [String.t(), ...] | []
@root_locale Cldr.Config.root_locale_name()
@root_language Atom.to_string(@root_locale)
@root_rbnf_locale_name Cldr.Config.root_locale_name()
defdelegate new(locale_name, backend), to: __MODULE__, as: :canonical_language_tag
defdelegate new!(locale_name, backend), to: __MODULE__, as: :canonical_language_tag!
defdelegate locale_name_to_posix(locale_name), to: Cldr.Config
defdelegate locale_name_from_posix(locale_name), to: Cldr.Config
@doc """
Mapping of language data to known
scripts and territories
"""
@language_data Cldr.Config.language_data()
def language_data do
@language_data
end
@doc """
Returns mappings between a locale
and its parent.
The mappings exist only where normal
inheritance rules are not applied.
"""
@parent_locales Cldr.Config.parent_locales()
def parent_locale_map do
@parent_locales
end
@doc """
Returns a list of all the parent locales
for a given locale.
## Examples
Cldr.Locale.parents "fr-ca"
=> {:ok, [#Cldr.LanguageTag<fr [validated]>, #Cldr.LanguageTag<en [validated]>]}
"""
@spec parents(LanguageTag.t()) ::
{:ok, list(LanguageTag.t())} | {:error, {module(), String.t()}}
def parents(locale, acc \\ [])
def parents(%LanguageTag{} = locale, acc) do
case parent(locale) do
{:error, _} -> {:ok, Enum.reverse(acc)}
{:ok, locale} -> parents(locale, [locale | acc])
end
end
def parents(locale, []) do
with {:ok, locale} <- Cldr.validate_locale(locale, Cldr.default_backend!()) do
parents(locale)
end
end
def parents(locale, backend) when is_atom(backend) do
with {:ok, locale} <- Cldr.validate_locale(locale, backend) do
parents(locale)
end
end
@doc """
Returns the parent for a given locale.
The function implements locale inheritance
in accordance with [CLDR's inheritance rules](https://unicode.org/reports/tr35/#Locale_Inheritance).
Only locales that are configured are returned.
That is, there may be a different parent locale in CLDR
but unless those locales are configured they are not
candidates to be parents in this context. The contract
is to return either a known locale or an error.
### Inheritance
* Inheritance starts by looking for a parent locale via
`Cldr.Config.parent_locales/0`.
* If not found, strip in turn the variant, script and territory
while checking to see if a base locale for the given language
exists.
* If no parent language exists then move to the default
locale and its inheritance chain.
"""
@spec parent(LanguageTag.t()) ::
{:ok, LanguageTag.t()} | {:error, {module(), binary()}}
def parent(%LanguageTag{language: @root_locale}) do
{:error, no_parent_error(@root_locale)}
end
def parent(%LanguageTag{backend: backend} = child) do
if parent = Map.get(parent_locale_map(), child.cldr_locale_name) do
Cldr.validate_locale(parent, backend)
else
child
|> find_parent(backend)
|> return_parent_or_default(child, backend)
|> transfer_extensions(child)
end
end
@spec parent(locale_name(), Cldr.backend()) ::
{:ok, LanguageTag.t()} | {:error, {module(), binary()}}
def parent(locale_name, backend \\ Cldr.default_backend!()) do
with {:ok, locale} <- Cldr.validate_locale(locale_name, backend) do
parent(locale)
end
end
defp find_parent(%LanguageTag{language_variants: [_ | _] = variants} = locale, backend) do
%LanguageTag{language: language, script: script, territory: territory} = locale
first_match(language, script, territory, variants, &known_locale(&1, &2, backend))
end
defp find_parent(%LanguageTag{territory: territory} = locale, backend)
when not is_nil(territory) do
%LanguageTag{language: language, script: script} = locale
first_match(language, script, nil, [], &known_locale(&1, &2, backend))
end
defp find_parent(%LanguageTag{language: language}, backend) do
parent_locale_map()
|> Map.get(language)
|> known_locale(backend)
end
defp known_locale(locale_name, tags \\ [], backend)
defp known_locale(nil, _tags, _backend) do
nil
end
defp known_locale(locale_name, tags, backend) when is_binary(locale_name) do
locale_name = String.to_existing_atom(locale_name)
known_locale(locale_name, tags, backend)
rescue ArgumentError ->
nil
end
defp known_locale(locale_name, _tags, backend) when is_atom(locale_name) do
Enum.find(backend.known_locale_names(), &(locale_name == &1))
end
defp known_rbnf_locale_name(locale_name, _tags, backend) do
locale_name = String.to_existing_atom(locale_name)
Cldr.known_rbnf_locale_name(locale_name, backend)
rescue ArgumentError ->
nil
end
defp return_parent_or_default(parent, %LanguageTag{cldr_locale_name: parent} = child, backend) do
default_locale = Cldr.default_locale(backend)
if child.language == default_locale.language do
{:error, no_parent_error(child.canonical_locale_name)}
else
{:ok, default_locale}
end
end
defp return_parent_or_default(parent, _child, backend) do
Cldr.validate_locale(parent, backend)
end
defp transfer_extensions({:error, _reason} = error, _child) do
error
end
defp transfer_extensions({:ok, parent}, child) do
{:ok, %{parent | locale: child.locale, transform: child.transform}}
end
defp no_parent_error(locale_name) do
{Cldr.NoParentError, "The locale #{inspect(locale_name)} has no parent locale"}
end
@doc """
Returns the list of fallback locales, starting the
the provided locale.
Fallbacks are a list of locate names which can
be used to resolve translation or other localization
data if such localised data does not exist for
this specific locale. After locale-specific fallbacks
are determined, the default locale and its fallbacks
are added to the chain.
## Arguments
* `locale` is any `LanguageTag.t`
## Returns
* `{:ok, list_of_locales}` or
* `{:error, {exception, reason}}`
## Examples
In these examples the default locale is `:"en-001"`.
Cldr.Locale.fallback_locales(Cldr.Locale.new!("fr-CA", MyApp.Cldr))
=> {:ok,
[#Cldr.LanguageTag<fr-CA [validated]>, #Cldr.LanguageTag<fr [validated]>,
#Cldr.LanguageTag<en [validated]>]}
# Fallbacks are typically formed by progressively
# stripping variant, territory and script from the
# given locale name. But not always - there are
# certain fallbacks that take a different path.
Cldr.Locale.fallback_locales(Cldr.Locale.new!("nb", MyApp.Cldr))
=> {:ok,
[#Cldr.LanguageTag<nb [validated]>, #Cldr.LanguageTag<no [validated]>,
#Cldr.LanguageTag<en [validated]>]}
"""
@spec fallback_locales(LanguageTag.t()) ::
{:ok, [LanguageTag.t(), ...]} | {:error, {module(), binary()}}
@doc since: "2.26.0"
def fallback_locales(%LanguageTag{} = locale) do
with {:ok, parents} <- parents(locale) do
{:ok, [locale | parents]}
end
end
@doc """
Returns the list of fallback locales, starting the
the provided locale.
Fallbacks are a list of locate names which can
be used to resolve translation or other localization
data if such localised data does not exist for
this specific locale. After locale-specific fallbacks
are determined, the default locale and its fallbacks
are added to the chain.
## Arguments
* `locale_name` is any locale name returned by
`Cldr.known_locale_names/1`
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module. The default is
`Cldr.default_locale/0`.
## Returns
* `{:ok, list_of_locales}` or
* `{:error, {exception, reason}}`
## Examples
In these examples the default locale is `:"en-001"`.
Cldr.Locale.fallback_locales(:"fr-CA")
=> {:ok,
[#Cldr.LanguageTag<fr-CA [validated]>, #Cldr.LanguageTag<fr [validated]>,
#Cldr.LanguageTag<en [validated]>]}
# Fallbacks are typically formed by progressively
# stripping variant, territory and script from the
# given locale name. But not always - there are
# certain fallbacks that take a different path.
Cldr.Locale.fallback_locales(:nb)
=> {:ok,
[#Cldr.LanguageTag<nb [validated]>, #Cldr.LanguageTag<no [validated]>,
#Cldr.LanguageTag<en [validated]>]}
"""
@spec fallback_locales(locale_reference, Cldr.backend) ::
{:ok, [LanguageTag.t(), ...]} | {:error, {module(), binary()}}
@doc since: "2.26.0"
def fallback_locales(locale_name, backend \\ Cldr.default_backend!()) do
with {:ok, locale} <- Cldr.validate_locale(locale_name, backend) do
fallback_locales(locale)
end
end
@doc """
Returns the list of fallback locale names, starting with
the provided locale.
Fallbacks are a list of locate names which can
be used to resolve translation or other localization
data if such localised data does not exist for
this specific locale. After locale-specific fallbacks
are determined, the default locale and its fallbacks
are added to the chain.
## Arguments
* `locale` is any `LanguageTag.t`
## Returns
* `{:ok, list_of_locale_names}` or
* `{:error, {exception, reason}}`
## Examples
In these examples the default locale is `:"en-001"`.
iex> Cldr.Locale.fallback_locale_names(Cldr.Locale.new!("fr-CA", MyApp.Cldr))
{:ok, [:"fr-CA", :fr, :"en-001", :en]}
# Fallbacks are typically formed by progressively
# stripping variant, territory and script from the
# given locale name. But not always - there are
# certain fallbacks that take a different path.
iex> Cldr.Locale.fallback_locale_names(Cldr.Locale.new!("nb", MyApp.Cldr))
{:ok, [:nb, :no, :"en-001", :en]}
"""
@spec fallback_locale_names(LanguageTag.t()) ::
{:ok, [locale_name, ...]} | {:error, {module(), binary()}}
@doc since: "2.26.0"
def fallback_locale_names(%LanguageTag{} = locale) do
with {:ok, fallbacks} <- fallback_locales(locale) do
locale_names = Enum.map(fallbacks, &Map.get(&1, :cldr_locale_name))
{:ok, locale_names}
end
end
@doc """
Returns the list of fallback locale names, starting with
the provided locale.
Fallbacks are a list of locate names which can
be used to resolve translation or other localization
data if such localised data does not exist for
this specific locale. After locale-specific fallbacks
are determined, the default locale and its fallbacks
are added to the chain.
## Arguments
* `locale` is any `LanguageTag.t`
## Returns
* `list_of_locale_names` or
* raises an exception
## Examples
In these examples the default locale is `:"en-001"`.
iex> Cldr.Locale.fallback_locale_names!(Cldr.Locale.new!("fr-CA", MyApp.Cldr))
[:"fr-CA", :fr, :"en-001", :en]
# Fallbacks are typically formed by progressively
# stripping variant, territory and script from the
# given locale name. But not always - there are
# certain fallbacks that take a different path.
iex> Cldr.Locale.fallback_locale_names!(Cldr.Locale.new!("nb", MyApp.Cldr))
[:nb, :no, :"en-001", :en]
"""
def fallback_locale_names!(%LanguageTag{} = locale) do
case fallback_locale_names(locale) do
{:ok, fallback_chain} -> fallback_chain
{:error, {exception, reason}} -> raise exception, reason
end
end
@doc """
Returns the list of fallback locale names, starting the
the provided locale name.
Fallbacks are a list of locate names which can
be used to resolve translation or other localization
data if such localised data does not exist for
this specific locale. After locale-specific fallbacks
are determined, the default locale and its fallbacks
are added to the chain.
## Arguments
* `locale_name` is any locale name returned by
`Cldr.known_locale_names/1`
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module. The default is
`Cldr.default_locale/0`.
## Returns
* `{:ok, list_of_locale_names}` or
* `{:error, {exception, reason}}`
## Examples
In these examples the default locale is `:"en-001"`.
iex> Cldr.Locale.fallback_locale_names(:"fr-CA")
{:ok, [:"fr-CA", :fr, :"en-001", :en]}
# Fallbacks are typically formed by progressively
# stripping variant, territory and script from the
# given locale name. But not always - there are
# certain fallbacks that take a different path.
iex> Cldr.Locale.fallback_locale_names(:nb)
{:ok, [:nb, :no, :"en-001", :en]}
"""
@spec fallback_locale_names(locale_reference, Cldr.backend()) ::
{:ok, [locale_name, ...]} | {:error, {module(), binary()}}
@doc since: "2.26.0"
def fallback_locale_names(locale_name, backend \\ Cldr.default_backend!()) do
with {:ok, locale} <- Cldr.validate_locale(locale_name, backend) do
fallback_locale_names(locale)
end
end
@doc """
Returns a map of a territory code to its
most-spoken language.
## Example
Cldr.Locale.languages_for_territories()
=> %{
AQ: "und",
PE: "es",
SR: "nl",
NU: "en",
...
}
"""
@language_for_territory Cldr.Config.language_tag_for_territory()
@doc since: "2.26.0"
def languages_for_territories do
@language_for_territory
end
@doc """
Returns the "best fit" locale for a given territory.
Using the population percentage data from CLDR, the
language most commonly spoken in the given territory
is used to form a locale name which is then validated
against the given backend.
First a territory-specific locale is validated and if
that fails, the base language only is validate.
For example, if the territory is `AU` then then the
language most spoken is "en". First, the locale "en-AU"
is validated and if that fails, "en" is validated.
## Arguments
* `territory` is any ISO 3166 Alpha-2 territory
code that can be validated by `Cldr.validate_territory/1`
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module.
## Returns
* `{:ok, language_tag}` or
* `{:error, {exception, reason}}`
## Examples
iex> Cldr.Locale.locale_for_territory(:AU, TestBackend.Cldr)
Cldr.validate_locale(:"en-AU", TestBackend.Cldr)
iex> Cldr.Locale.locale_for_territory(:US, TestBackend.Cldr)
Cldr.validate_locale(:"en-US", TestBackend.Cldr)
iex> Cldr.Locale.locale_for_territory(:ZZ)
{:error, {Cldr.UnknownTerritoryError, "The territory :ZZ is unknown"}}
"""
@doc since: "2.26.0"
@spec locale_for_territory(territory_code(), Cldr.backend()) ::
{:ok, LanguageTag.t()} | {:error, {module(), String.t()}}
def locale_for_territory(territory, backend \\ Cldr.default_backend!()) do
with {:ok, territory} <- Cldr.validate_territory(territory) do
case Map.get(languages_for_territories(), territory) do
nil ->
{:error, no_locale_for_territory_error(territory)}
language ->
validate_locale(language, territory, backend)
end
end
end
# See first if there is a territory specific version of this
# language, otherwise the base language itself
defp validate_locale(language, nil, backend) do
Cldr.validate_locale(language, backend)
end
defp validate_locale(language, territory, backend) do
case Cldr.validate_locale("#{language}-#{to_string(territory)}", backend) do
{:ok, locale} -> {:ok, locale}
{:error, _} -> validate_locale(language, nil, backend)
end
end
@consider_as_tld [
:AD, :AS, :BZ, :CC, :CD, :CO, :DJ, :FM, :IO, :LA, :ME, :MS, :NU, :SC, :SR, :SU, :TV, :TK, :WS
]
@doc """
Returns a list of territory top-level domains that are
considered to be generic top level domains.
See https://developers.google.com/search/docs/advanced/crawling/managing-multi-regional-sites
for an explanation of why some valid territory suffixxes
are considered as TLDs.
## Example
iex> Cldr.Locale.consider_as_tlds
[:AD, :AS, :BZ, :CC, :CD, :CO, :DJ, :FM, :IO, :LA, :ME, :MS, :NU, :SC, :SR, :SU, :TV, :TK, :WS]
"""
def consider_as_tlds do
@consider_as_tld
end
@doc """
Returns a "best fit" locale for a host name.
## Arguments
* `host` is any valid host name
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module.
* `options` is a keyword list of options. The default
is `[tlds: Cldr.Locale.consider_as_tlds()]`.
## Options
* `:tlds` is a list of territory codes as upper-cased
atoms that are to be considered as top-level domains.
The default list is `consider_as_tlds/0`.
## Returns
* `{:ok, langauge_tag}` or
* `{:error, {exception, reason}}`
## Notes
Certain top-level domains have become associated with content
underlated to the territory for who the domain is registered.
Therefore Google (and perhaps others) do not associate these
TLDs as belonging to the territory but rather are considered
generic top-level domain names.
## Examples
iex> Cldr.Locale.locale_from_host "a.b.com.au", TestBackend.Cldr
Cldr.validate_locale(:"en-AU", TestBackend.Cldr)
iex> Cldr.Locale.locale_from_host "a.b.com.tv", TestBackend.Cldr
{:error,
{Cldr.UnknownLocaleError, "No locale was identified for territory \\"tv\\""}}
iex> Cldr.Locale.locale_from_host "a.b.com", TestBackend.Cldr
{:error,
{Cldr.UnknownLocaleError, "No locale was identified for territory \\"com\\""}}
"""
@doc since: "2.26.0"
@spec locale_from_host(String.t(), Cldr.backend(), Keyword.t()) ::
{:ok, LanguageTag.t()} | {:error, {module(), String.t()}}
def locale_from_host(host, backend, options \\ []) do
tld_list = Keyword.get(options, :tlds, consider_as_tlds())
with {:ok, territory} <- territory_from_host(host) do
if territory in tld_list do
{:error, no_locale_for_territory_error(territory)}
else
locale_for_territory(territory, backend)
end
end
end
@doc """
Returns the last segment of a host that might
be a territory.
## Arguments
* `host` is any valid host name
## Returns
* `{:ok, territory}` or
* `{:error, {exception, reason}}`
## Examples
iex> Cldr.Locale.territory_from_host("a.b.com.au")
{:ok, :AU}
iex> Cldr.Locale.territory_from_host("a.b.com")
{:error,
{Cldr.UnknownLocaleError, "No locale was identified for territory \\"com\\""}}
"""
@doc since: "2.26.0"
@spec territory_from_host(String.t()) ::
{:ok, territory_code()} | {:error, {module(), String.t()}}
def territory_from_host(host) do
territory =
host
|> String.split(".")
|> Enum.reverse()
|> hd()
try do
territory = String.upcase(territory) |> String.to_existing_atom()
Cldr.validate_territory(territory)
rescue ArgumentError ->
{:error, no_locale_for_territory_error(territory)}
end
end
@doc """
Returns the effective territory for a locale.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
or any `locale_name` returned by `Cldr.known_locale_names/1`. If
the parameter is a `locale_name` then a default backend must be
configured in `config.exs` or an exception will be raised.
## Returns
* The territory to be used for localization purposes.
## Examples
iex> Cldr.Locale.territory_from_locale "en-US"
:US
iex> Cldr.Locale.territory_from_locale "en-US-u-rg-cazzzz"
:CA
iex> Cldr.Locale.territory_from_locale "en-US-u-rg-xxxxx"
{:error, {Cldr.LanguageTag.ParseError, "The value \\"xxxxx\\" is not valid for the key \\"rg\\""}}
## Notes
A locale can reflect the desired territory to be used
when determining region-specific defaults for items such
as:
* default currency,
* default calendar and week data,
* default time cycle, and
* default measurement system and unit preferences
Territory information is stored in the locale in up to three
different places:
1. The `:territory` extracted from the locale name or
defined by default for a given language. This is the typical
use case when locale names such as `en-US` or `es-AR` are
used.
2. In some cases it might be desirable to override the territory
derived from the locale name. For example, the default
territory for the language "en" is "US" but it may be desired
to apply the defaults for the territory "AU" instead, without
otherwise changing the localization intent. In this case
the [U extension](https://unicode.org/reports/tr35/#u_Extension) is
used to define a
[regional override](https://unicode.org/reports/tr35/#RegionOverride).
3. Similarly, the [regional subdivision identifier]
(https://unicode.org/reports/tr35/#UnicodeSubdivisionIdentifier)
can be used to influence localization decisions. This identifier
is not currently used in `ex_cldr` and dependent libraries
however it is correctly parsed to support future use.
"""
@spec territory_from_locale(LanguageTag.t() | locale_name() | String.t()) :: territory_code()
@doc since: "2.18.2"
def territory_from_locale(%LanguageTag{locale: %{rg: _rg}} = language_tag) do
language_tag.locale.rg || language_tag.territory || Cldr.default_territory()
end
def territory_from_locale(%LanguageTag{} = language_tag) do
language_tag.territory || Cldr.default_territory()
end
def territory_from_locale(locale_name) do
territory_from_locale(locale_name, Cldr.default_backend!())
end
@doc """
Returns the effective territory for a locale.
## Arguments
* `locale_name` is any locale name returned by
`Cldr.known_locale_names/1`.
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module.
## Returns
* The territory to be used for localization purposes or
`{:error, {exception, reason}}`.
## Examples
iex> Cldr.Locale.territory_from_locale "en-US", TestBackend.Cldr
:US
iex> Cldr.Locale.territory_from_locale "en-US-u-rg-cazzzz", TestBackend.Cldr
:CA
iex> Cldr.Locale.territory_from_locale "en-US-u-rg-xxxxx", TestBackend.Cldr
{:error, {Cldr.LanguageTag.ParseError, "The value \\"xxxxx\\" is not valid for the key \\"rg\\""}}
## Notes
A locale can reflect the desired territory to be used
when determining region-specific defaults for items such
as:
* default currency,
* default calendar and week data,
* default time cycle, and
* default measurement system and unit preferences
Territory information is stored in the locale in up to three
different places:
1. The `:territory` extracted from the locale name or
defined by default for a given language. This is the typical
use case when locale names such as `en-US` or `es-AR` are
used.
2. In some cases it might be desirable to override the territory
derived from the locale name. For example, the default
territory for the language "en" is "US" but it may be desired
to apply the defaults for the territory "AU" instead, without
otherwise changing the localization intent. In this case
the [U extension](https://unicode.org/reports/tr35/#u_Extension) is
used to define a
[regional override](https://unicode.org/reports/tr35/#RegionOverride).
3. Similarly, the [regional subdivision identifier]
(https://unicode.org/reports/tr35/#UnicodeSubdivisionIdentifier)
can be used to influence localization decisions. This identifier
is not currently used in `ex_cldr` and dependent libraries
however it is correctly parsed to support future use.
"""
@spec territory_from_locale(locale_reference() | String.t(), Cldr.backend()) ::
territory_code() | {:error, {module(), String.t()}}
@doc since: "2.18.2"
def territory_from_locale(locale, backend) do
with {:ok, locale} <- Cldr.validate_locale(locale, backend) do
territory_from_locale(locale)
end
end
@doc """
Returns the script for a locale.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
or any `locale_name` returned by `Cldr.known_locale_names/1`. If
the parameter is a `locale_name` then a default backend must be
configured in `config.exs` or an exception will be raised.
## Returns
* The script to be used for localization purposes.
## Examples
iex> Cldr.Locale.script_from_locale "en-US"
:Latn
iex> Cldr.Locale.script_from_locale "th"
:Thai
"""
@spec script_from_locale(LanguageTag.t() | locale_name()) :: script()
@doc since: "2.31.0"
def script_from_locale(%LanguageTag{} = language_tag) do
language_tag.script || Cldr.default_script()
end
def script_from_locale(locale_name) do
script_from_locale(locale_name, Cldr.default_backend!())
end
@doc """
Returns the script for a locale.
## Arguments
* `locale_name` is any locale name returned by
`Cldr.known_locale_names/1`.
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module.
## Returns
* The script to be used for localization purposes.
## Examples
iex> Cldr.Locale.script_from_locale "en-US", TestBackend.Cldr
:Latn
iex> Cldr.Locale.script_from_locale "th", TestBackend.Cldr
:Thai
"""
@spec script_from_locale(locale_reference(), Cldr.backend()) ::
script() | {:error, {module(), String.t()}}
@doc since: "2.31.0"
def script_from_locale(locale, backend) do
with {:ok, locale} <- Cldr.validate_locale(locale, backend) do
script_from_locale(locale)
end
end
@doc """
Returns the effective time zone for a locale.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
or any `locale_name` returned by `Cldr.known_locale_names/1`. If
the parameter is a `locale_name` then a default backend must be
configured in `config.exs` or an exception will be raised.
## Returns
* The time zone ID as a `String.t` or `{:error, {exception, reason}}`
## Examples
iex> Cldr.Locale.timezone_from_locale "en-US-u-tz-ausyd"
"Australia/Sydney"
iex> Cldr.Locale.timezone_from_locale "en-AU"
{:error,
{Cldr.AmbiguousTimezoneError,
"Cannot determine the timezone since the territory :AU has 24 timezone IDs"}}
"""
@spec timezone_from_locale(LanguageTag.t() | locale_name() | String.t()) ::
String.t() | {:error, {module(), String.t()}}
@doc since: "2.19.0"
def timezone_from_locale(%LanguageTag{locale: %{timezone: timezone}})
when not is_nil(timezone) do
timezone
end
def timezone_from_locale(%LanguageTag{} = language_tag) do
territory = territory_from_locale(language_tag)
with {:ok, [zone]} <- Cldr.Timezone.timezones_for_territory(territory) do
zone
else
{:ok, zones} -> ambiguous_timezone_error(territory, zones)
_ -> Cldr.unknown_territory_error(territory)
end
end
def timezone_from_locale(locale_name) do
timezone_from_locale(locale_name, Cldr.default_backend!())
end
@doc """
Returns the effective time zone for a locale.
## Arguments
* `locale_name` is any name returned by `Cldr.known_locale_names/1`
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module
## Returns
* The time zone ID as a `String.t` or `{:error, {exception, reason}}`
## Examples
iex> Cldr.Locale.timezone_from_locale "en-US-u-tz-ausyd", TestBackend.Cldr
"Australia/Sydney"
iex> Cldr.Locale.timezone_from_locale :"en-AU", TestBackend.Cldr
{:error,
{Cldr.AmbiguousTimezoneError,
"Cannot determine the timezone since the territory :AU has 24 timezone IDs"}}
"""
@spec timezone_from_locale(locale_name() | String.t(), Cldr.backend()) ::
String.t() | {:error, {module(), String.t()}}
@doc since: "2.19.0"
def timezone_from_locale(locale, backend) do
with {:ok, locale} <- Cldr.validate_locale(locale, backend) do
timezone_from_locale(locale)
end
end
defp ambiguous_timezone_error(territory, zones) do
zone_count = length(zones)
{:error,
{Cldr.AmbiguousTimezoneError,
"Cannot determine the timezone since the territory #{inspect(territory)} " <>
"has #{zone_count} timezone IDs"}}
end
@doc """
Parses a locale name and returns a `Cldr.LanguageTag` struct
that represents a locale.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
or any `locale_name` returned by `Cldr.known_locale_names/1`
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module
* `options` is a keyword list of options
## Options
* `:add_likely_subtags` is a `boolean` thatdetermines
if subtags that are likely to be applicable to this
language tag are added to the language tag. The default
is `true`.
## Returns
* `{:ok, language_tag}` or
* `{:error, reason}`
## Method
1. The language tag is parsed in accordance with [RFC5646](https://tools.ietf.org/html/rfc5646)
2. Any language, script or region aliases are replaced. This
will replace any obsolete elements with current versions.
3. If a territory, script or language variant is not specified,
then a default is provided using the CLDR information returned by
`Cldr.Locale.likely_subtags/1` if the option `:add_likely_subtags`
is `true` (the default).
4. A `Cldr` locale name is selected that is the nearest fit to the
requested locale.
## Example
iex> Cldr.Locale.canonical_language_tag("en", TestBackend.Cldr)
{
:ok,
%Cldr.LanguageTag{
backend: TestBackend.Cldr,
canonical_locale_name: "en",
cldr_locale_name: :en,
extensions: %{},
gettext_locale_name: "en",
language: "en",
locale: %{},
private_use: [],
rbnf_locale_name: :en,
requested_locale_name: "en",
script: :Latn,
territory: :US,
transform: %{},
language_variants: []
}
}
"""
@spec canonical_language_tag(locale_name | Cldr.LanguageTag.t() | String.t(), Cldr.backend(), Keyword.t()) ::
{:ok, Cldr.LanguageTag.t()} | {:error, {module(), String.t()}}
def canonical_language_tag(locale_name, backend, options \\ [])
def canonical_language_tag(locale_name, backend, options) when is_binary(locale_name) do
case LanguageTag.parse(locale_name) do
{:ok, language_tag} ->
canonical_language_tag(language_tag, backend, options)
{:error, reason} ->
{:error, reason}
end
end
def canonical_language_tag(locale_name, backend, options) when is_atom(locale_name) do
language_tag = Map.get(Cldr.Config.all_language_tags(), locale_name)
if Keyword.get(options, :add_likely_subtags, true) && language_tag do
canonical_language_tag(language_tag, backend, options)
else
canonical_language_tag(to_string(locale_name), backend, options)
end
end
unvalidated_match = quote do
%LanguageTag{cldr_locale_name: var!(locale_name), canonical_locale_name: var!(canonical_name)}
end
def canonical_language_tag(unquote(unvalidated_match) = language_tag, backend, _options)
when not is_nil(locale_name) and not is_nil(canonical_name) do
language_tag =
language_tag
|> put_backend(backend)
|> put_gettext_locale_name()
{:ok, language_tag}
end
def canonical_language_tag(%LanguageTag{} = language_tag, backend, options) do
supress_requested_locale_substitution? = !language_tag.language
likely_subtags? = Keyword.get(options, :add_likely_subtags, true)
language_tag =
language_tag
|> transform_language(backend)
|> put_requested_locale_name(supress_requested_locale_substitution?)
|> substitute_aliases()
with {:ok, language_tag} <- validate_subtags(language_tag),
{:ok, language_tag} <- U.canonicalize_locale_keys(language_tag),
{:ok, language_tag} <- T.canonicalize_transform_keys(language_tag) do
language_tag
|> put_canonical_locale_name()
|> remove_unknown(:script)
|> remove_unknown(:territory)
|> maybe_put_likely_subtags(likely_subtags?)
|> put_backend(backend)
|> put_cldr_locale_name()
|> put_rbnf_locale_name()
|> put_gettext_locale_name()
|> wrap(:ok)
end
end
defp transform_language(%{transform: %{"language" => nil}} = language_tag, _backend) do
language_tag
end
defp transform_language(%{transform: %{"language" => language}} = language_tag, backend) do
canonical_language = canonical_language_tag(language, backend, add_likely_subtags: false)
transform = Map.put(language_tag.transform, "language", canonical_language)
Map.put(language_tag, :transform, transform)
end
defp transform_language(language_tag, _backend) do
language_tag
end
defp maybe_put_likely_subtags(language_tag, true), do: put_likely_subtags(language_tag)
defp maybe_put_likely_subtags(language_tag, _), do: language_tag
@doc false
# def canonical_language_tag(%LanguageTag{backend: nil} = language_tag) do
# canonical_language_tag(language_tag, Cldr.default_backend!(), add_likely_subtags: false)
# end
#
# def canonical_language_tag(%LanguageTag{backend: backend} = language_tag) do
# canonical_language_tag(language_tag, backend, add_likely_subtags: false)
# end
defp wrap(term, tag) do
{tag, term}
end
@doc """
Parses a locale name and returns a `Cldr.LanguageTag` struct
that represents a locale or raises on error.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
or any `locale_name` returned by `Cldr.known_locale_names/1`
* `backend` is any module that includes `use Cldr` and therefore
is a `Cldr` backend module
See `Cldr.Locale.canonical_language_tag/3` for more information.
"""
@spec canonical_language_tag!(locale_name | Cldr.LanguageTag.t(), Cldr.backend(), Keyword.t()) ::
Cldr.LanguageTag.t() | none()
def canonical_language_tag!(language_tag, backend, options \\ []) do
case canonical_language_tag(language_tag, backend, options) do
{:ok, canonical_tag} -> canonical_tag
{:error, {exception, reason}} -> raise exception, reason
end
end
@doc false
def canonical_locale_name(locale_name) when is_binary(locale_name) do
with {:ok, tag} <- Cldr.LanguageTag.Parser.parse(locale_name) do
{:ok, Cldr.LanguageTag.to_string(tag)}
end
end
@doc false
def canonical_locale_name!(locale_name) when is_binary(locale_name) do
case canonical_locale_name(locale_name) do
{:ok, canonical_name} -> canonical_name
{:error, {exception, reason}} -> raise exception, reason
end
end
@doc """
Substitute deprecated subtags with a `Cldr.LanguageTag` with their
non-deprecated alternatives.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
## Method
* Replace any deprecated subtags with their canonical values using the alias
data. Use the first value in the replacement list, if
it exists. Language tag replacements may have multiple parts, such as
`sh` ➞ `sr_Latn` or `mo` ➞ `ro_MD`. In such a case, the original script and/or
region/territory are retained if there is one. Thus `sh_Arab_AQ` ➞ `sr_Arab_AQ`, not
`sr_Latn_AQ`.
* Remove the script code 'Zzzz' and the territory code 'ZZ' if they occur.
* Get the components of the cleaned-up source tag (languages, scripts, and
regions/territories), plus any variants and extensions.
## Example
iex> Cldr.Locale.substitute_aliases Cldr.LanguageTag.Parser.parse!("mo")
%Cldr.LanguageTag{
backend: nil,
canonical_locale_name: nil,
cldr_locale_name: nil,
extensions: %{},
gettext_locale_name: nil,
language: "ro",
language_subtags: [],
language_variants: [],
locale: %{},
private_use: [],
rbnf_locale_name: nil,
requested_locale_name: "mo",
script: nil,
territory: nil,
transform: %{}
}
"""
def substitute_aliases(%LanguageTag{} = language_tag) do
updated_tag =
language_tag
|> replace_root_with_und()
|> substitute(:requested_name)
|> substitute(:language)
|> substitute(:variant)
|> substitute(:script)
|> substitute(:territory)
if updated_tag == language_tag do
updated_tag
else
substitute_aliases(updated_tag)
end
end
defp substitute(%LanguageTag{canonical_locale_name: nil} = language_tag, :requested_name) do
locale_name = locale_name_from(language_tag.language, nil, language_tag.territory, [])
if replacement_tag = aliases(locale_name, :language) do
type_tag = Cldr.LanguageTag.Parser.parse!(locale_name)
replacement_tag =
Map.put(replacement_tag, :language_variants, language_tag.language_variants)
merge_language_tags(replacement_tag, language_tag, type_tag)
else
language_tag
end
end
defp substitute(%LanguageTag{} = language_tag, :requested_name) do
language_tag
end
# No variants so we just check the language for an alias
defp substitute(%LanguageTag{language_variants: []} = language_tag, :language) do
if replacement_tag = aliases(language_tag.language, :language) do
type_tag = Cldr.LanguageTag.Parser.parse!(language_tag.language)
merge_language_tags(replacement_tag, language_tag, type_tag)
else
language_tag
end
end
# One or more language variants which, when combined with the language,
# may have an alias
defp substitute(%LanguageTag{language_variants: variants} = language_tag, :language) do
variants = variant_selectors(variants)
language = language_tag.language
{type_tag, replacement_tag} = find_language_alias(language, variants, :language)
if replacement_tag do
merge_language_tags(replacement_tag, language_tag, type_tag)
else
language_tag
end
end
defp substitute(%LanguageTag{language_variants: []} = language_tag, :variant) do
language_tag
end
defp substitute(%LanguageTag{language_variants: [variant]} = language_tag, :variant) do
{type_tag, replacement_tag} =
find_alias([[variant]], :variant) || find_language_alias("und", [variant], :language)
merge_variants(replacement_tag, language_tag, type_tag)
end
defp substitute(%LanguageTag{language_variants: variants} = language_tag, :variant) do
variants = variant_selectors(variants)
{type_tag, replacement_tag} = find_alias(variants, :variant)
if replacement_tag do
merge_variants(replacement_tag, language_tag, type_tag)
else
language_tag
end
end
defp substitute(%LanguageTag{script: script} = language_tag, :script) do
%{language_tag | script: aliases(script, :script) || script}
end
defp substitute(%LanguageTag{territory: territory} = language_tag, :territory) do
territory =
case aliases(territory, :region) || territory do
territories when is_list(territories) -> hd(territories)
territory when is_atom(territory) -> territory
other -> other
end
%{language_tag | territory: territory}
rescue
ArgumentError ->
language_tag
end
defp replace_root_with_und(%LanguageTag{language: "root"} = language_tag) do
%{language_tag | language: "und"}
end
defp replace_root_with_und(%LanguageTag{} = language_tag) do
language_tag
end
defp remove_unknown(%LanguageTag{script: "Zzzz"} = language_tag, :script) do
%{language_tag | script: nil}
end
defp remove_unknown(%LanguageTag{} = language_tag, :script), do: language_tag
defp remove_unknown(%LanguageTag{territory: :ZZ} = language_tag, :territory) do
%{language_tag | territory: nil}
end
defp remove_unknown(%LanguageTag{} = language_tag, :territory), do: language_tag
defp put_canonical_locale_name(language_tag) do
language_tag
|> Map.put(:canonical_locale_name, Cldr.LanguageTag.to_string(language_tag))
end
defp put_backend(language_tag, backend) do
language_tag
|> Map.put(:backend, backend)
end
@spec put_requested_locale_name(Cldr.LanguageTag.t(), boolean()) :: Cldr.LanguageTag.t()
defp put_requested_locale_name(language_tag, true) do
language_tag
end
defp put_requested_locale_name(language_tag, false) do
language_tag
|> Map.put(:requested_locale_name, locale_name_from(language_tag, false))
end
@spec put_cldr_locale_name(Cldr.LanguageTag.t()) :: Cldr.LanguageTag.t()
defp put_cldr_locale_name(%LanguageTag{} = language_tag) do
cldr_locale_name = cldr_locale_name(language_tag)
%{language_tag | cldr_locale_name: cldr_locale_name}
end
@spec put_rbnf_locale_name(Cldr.LanguageTag.t()) :: Cldr.LanguageTag.t()
defp put_rbnf_locale_name(%LanguageTag{} = language_tag) do
rbnf_locale_name = rbnf_locale_name(language_tag)
%{language_tag | rbnf_locale_name: rbnf_locale_name}
end
@spec put_gettext_locale_name(Cldr.LanguageTag.t()) :: Cldr.LanguageTag.t()
def put_gettext_locale_name(%LanguageTag{} = language_tag) do
gettext_locale_name = gettext_locale_name(language_tag)
%{language_tag | gettext_locale_name: gettext_locale_name}
end
@spec put_gettext_locale_name(Cldr.LanguageTag.t(), Cldr.Config.t()) :: Cldr.LanguageTag.t()
def put_gettext_locale_name(%LanguageTag{} = language_tag, config) do
gettext_locale_name = gettext_locale_name(language_tag, config)
%{language_tag | gettext_locale_name: gettext_locale_name}
end
@spec cldr_locale_name(Cldr.LanguageTag.t()) :: locale_name() | nil
defp cldr_locale_name(%LanguageTag{} = language_tag) do
first_match(language_tag, &known_locale(&1, &2, language_tag.backend)) ||
Cldr.known_locale_name(language_tag.requested_locale_name, language_tag.backend)
end
@spec rbnf_locale_name(Cldr.LanguageTag.t()) :: locale_name | nil
defp rbnf_locale_name(%LanguageTag{language: @root_language}) do
@root_rbnf_locale_name
end
# Get the rbnf locale name for this locale. If not found, see
# if a parent has RBNF> Note parent in this case means direct parent,
# not the fallback chain.
defp rbnf_locale_name(%LanguageTag{} = language_tag) do
cond do
rbnf_locale = first_match(language_tag, &known_rbnf_locale_name(&1, &2, language_tag.backend)) ->
rbnf_locale
parent = Map.get(parent_locale_map(), language_tag.cldr_locale_name) ->
case Cldr.validate_locale(parent, language_tag.backend) do
{:ok, parent} -> rbnf_locale_name(parent)
{:error, _} -> nil
end
true ->
nil
end
end
@spec gettext_locale_name(Cldr.LanguageTag.t()) :: locale_name | nil
defp gettext_locale_name(%LanguageTag{} = language_tag) do
language_tag
|> first_match(&known_gettext_locale_name(&1, &2, language_tag.backend))
|> locale_name_to_posix
end
# Used at compile time only
defp gettext_locale_name(%LanguageTag{} = language_tag, config) do
language_tag
|> first_match(&known_gettext_locale_name(&1, &2, config))
|> locale_name_to_posix
end
@spec known_gettext_locale_name(locale_name(), Cldr.backend() | Cldr.Config.t()) ::
locale_name() | false
def known_gettext_locale_name(locale_name, tags \\ [], backend)
def known_gettext_locale_name(locale_name, _tags, backend) when is_atom(backend) do
gettext_locales = backend.known_gettext_locale_names()
Enum.find(gettext_locales, &(&1 == locale_name)) || false
end
# This clause is only called at compile time when we're
# building a backend. In normal use is should not be used.
@doc false
def known_gettext_locale_name(locale_name, _tags, config) when is_map(config) do
gettext_locales = Cldr.Config.known_gettext_locale_names(config)
Enum.find(gettext_locales, &(&1 == locale_name)) || false
end
# Describes the ways in which a locale match can
# be constructed and the tags that are consumed
# in order to create a successful match.
potential_matches = quote do
[
{[language, script, territory, [], omit?], [:language, :script, :territory]},
{[language, nil, territory, [], omit?], [:language, :territory]},
{[language, script, nil, [], omit?], [:language, :script]},
{[language, nil, nil, [], omit?], [:language]}
]
end
potential_variant_matches = quote do
[
{[language, script, territory, variants, omit?],
[:language, :script, :territory, :language_variants]},
{[language, nil, territory, variants, omit?],
[:language, :territory, :language_variants]},
{[language, script, nil, variants, omit?],
[:language, :script, :language_variants]},
{[language, nil, nil, variants, omit?],
[:language, :language_variants]}
]
end
# Generate the expressions that check for
# the first match
defmacrop matches(matches) do
for {params, tags} <- matches do
params = Enum.map(params, fn
[] -> []
nil -> nil
var -> {:var!, [], [var]}
end)
quote do
var!(fun).(locale_name_from(unquote_splicing(params)), unquote(tags))
end
end
|> Enum.reverse
|> Enum.reduce(fn exp, acc -> {:||, [], [exp, acc]} end)
end
@doc """
Execute a function for a locale returning
the first match on language, script, territory,
and variant combination.
A match is determined when the `fun/1` returns
a `truthy` value.
## Arguments
* `language_tag` is any language tag returned by
`Cldr.Locale.new/2`.
* `fun/2` is 2-arity function that takes a string
locale name and subtags. The locale name is a built from the language,
script, territory and variant combinations of `language_tag`.
## Returns
* The first `truthy` value returned by `fun/2` or `nil` if no
match is made.
"""
def first_match(%LanguageTag{} = language_tag, fun, omit_singular_script? \\ false)
when is_function(fun, 2) do
%LanguageTag{language: language, script: script, territory: territory} = language_tag
%LanguageTag{language_variants: variants} = language_tag
first_match(language, script, territory, variants, fun, omit_singular_script?)
end
defp first_match(language, script, territory, variants, fun, omit? \\ false)
defp first_match(language, script, territory, [], fun, omit?) do
matches(unquote(potential_matches)) || nil
end
defp first_match(language, script, territory, variants, fun, omit?) do
matches(unquote(potential_variant_matches)) || matches(unquote(potential_matches)) || nil
end
@doc """
Return a locale name from a `Cldr.LanguageTag`
## Options
* `locale_name` is any `Cldr.LanguageTag` struct returned by
`Cldr.Locale.new!/2`
## Example
iex> Cldr.Locale.locale_name_from Cldr.Locale.new!("en", TestBackend.Cldr)
"en"
"""
@spec locale_name_from(Cldr.LanguageTag.t()) :: locale_name()
def locale_name_from(language_tag, omit_singular_script? \\ true)
def locale_name_from(%LanguageTag{canonical_locale_name: nil} = tag, omit_singular_script?) do
%LanguageTag{language: language, script: script, territory: territory} = tag
%LanguageTag{language_variants: language_variants} = tag
locale_name_from(language, script, territory, language_variants, omit_singular_script?)
end
def locale_name_from(%LanguageTag{canonical_locale_name: locale_name}, _omit_singular_script?) do
locale_name
end
def locale_name_from([language, script, territory, variants], omit_singular_script?) do
locale_name_from(language, script, territory, variants, omit_singular_script?)
end
@doc """
Return a locale name by combining language, script, territory and variant
parameters
## Arguments
* `language` is a string representing
a valid language code
* `script` is an atom that is a valid
script code.
* `territory` is an atom that is a valid
territory code.
* `variants` is a list of language variants as lower
case string or `[]`
## Returns
* The atom locale name constructed from the non-nil arguments joined
by a "-"
## Example
iex> Cldr.Locale.locale_name_from("en", :Latn, "001", [])
"en-001"
iex> Cldr.Locale.locale_name_from("en", :Latn, :"001", [])
"en-001"
"""
@spec locale_name_from(language(), script(), territory_reference(), variants(), boolean) ::
String.t()
def locale_name_from(language, script, territory, variants, omit_singular_script? \\ true) do
[language, script, territory, variants]
|> omit_script_if_only_one(omit_singular_script?)
|> Enum.reject(&is_nil/1)
|> Enum.reject(&(&1 == []))
|> Enum.join("-")
end
@doc false
def join_variants([]), do: nil
def join_variants(variants), do: variants |> Enum.sort() |> Enum.join("-")
# If the language has only one script for a given territory then
# we omit it in the canonical form
defp omit_script_if_only_one([_language, nil, _territory, _variants] = tag, _) do
tag
end
# If the language has only one script for a given territory then
# we omit it in the canonical form
defp omit_script_if_only_one([language, script, territory, variants], true) do
language_map = Map.get(language_data(), language, %{})
script = maybe_nil_script(Map.get(language_map, :primary), script, territory)
[language, script, territory, variants]
end
defp omit_script_if_only_one([language, script, territory, variants], false) do
[language, script, territory, variants]
end
# No :secondary
defp maybe_nil_script(nil, _script, _territory) do
nil
end
# There is only one script for this territory and its the requested one
# so its not required for the canonical form
defp maybe_nil_script(%{scripts: [script], territories: _territories}, script, _territory) do
nil
end
# In all other cases we keep the script
defp maybe_nil_script(%{scripts: _scripts, territories: _territories}, script, _territory) do
script
end
@doc """
Replace empty subtags within a `t:Cldr.LanguageTag.t/0` with the most likely
subtag.
## Arguments
* `language_tag` is any language tag returned by `Cldr.Locale.new/2`
* `options` is a keyword list of options
## Options
* `:add_likely` is a boolean indicating whether to add
likely subtags. The default is `true`.
## Notes
A subtag is called empty if it has a missing script or territory subtag, or it is
a base language subtag with the value `und`. In the description below,
a subscript on a subtag x indicates which tag it is from: x<sub>s</sub> is in the
source, x<sub>m</sub> is in a match, and x<sub>r</sub> is in the final result.
## Lookup
Lookup each of the following in order, and stops on the first match:
* language<sub>s</sub>-script<sub>s</sub>-region<sub>s</sub>
* language<sub>s</sub>-region<sub>s</sub>
* language<sub>s</sub>-script<sub>s</sub>
* language<sub>s</sub>
* und-script<sub>s</sub>
## Returns
* If there is no match,either return
* an error value, or
* the match for `und`
* Otherwise there is a match = language<sub>m</sub>-script<sub>m</sub>-region<sub>m</sub>
* Let x<sub>r</sub> = x<sub>s</sub> if x<sub>s</sub> is not empty, and x<sub>m</sub> otherwise.
* Return the language tag composed of language<sub>r</sub>-script<sub>r</sub>-region<sub>r</sub> + variants + extensions .
## Example
iex> Cldr.Locale.put_likely_subtags Cldr.LanguageTag.parse!("zh-SG")
%Cldr.LanguageTag{
backend: nil,
canonical_locale_name: nil,
cldr_locale_name: nil,
language_subtags: [],
extensions: %{},
gettext_locale_name: nil,
language: "zh",
locale: %{},
private_use: [],
rbnf_locale_name: nil,
requested_locale_name: "zh-SG",
script: :Hans,
territory: "SG",
transform: %{},
language_variants: []
}
"""
def put_likely_subtags(%LanguageTag{} = language_tag) do
%LanguageTag{language: language, script: script, territory: territory} = language_tag
subtags =
likely_subtags(locale_name_from(language, script, territory, [])) ||
likely_subtags(locale_name_from(language, nil, territory, [])) ||
likely_subtags(locale_name_from(language, script, nil, [])) ||
likely_subtags(locale_name_from(language, nil, nil, [])) ||
likely_subtags(locale_name_from("und", script, nil, [])) ||
likely_subtags(locale_name_from("und", nil, nil, []))
Map.merge(subtags, language_tag, fn _k, v1, v2 -> if empty?(v2), do: v1, else: v2 end)
end
# The process of applying alias substitutions is a map merge.
# In merging we ignore "und" fields, merge fields of interes
# and preserve all the other fields unchanged.
@merge_fields [:language, :script, :territory, :language_variants]
defp merge_language_tags(replacement_tag, source_tag, type_tag) do
Map.merge(replacement_tag, source_tag, fn
_k, "und", source_field ->
source_field
k, replacement_field, source_field when k in @merge_fields ->
type_field = type_field_from(type_tag, k)
replacement_field = field_from(replacement_field)
source_field = field_from(source_field, k)
# IO.inspect {source_field, replacement_field, type_field}, label: inspect(k)
replace(k, source_field, replacement_field, type_field)
_k, _replacement_field, source_field ->
source_field
end)
end
defp merge_variants(replacement, source_tag, type_tag) do
type_field = type_field_from(type_tag, :language_variants)
replacement_field = type_field_from(replacement, :language_variants)
source_field = type_field_from(source_tag, :language_variants)
replaced = replace(:language_variants, source_field, replacement_field, type_field)
%{source_tag | language_variants: replaced}
end
# Merge a single map field according to TR35
# https://unicode-org.github.io/cldr/ldml/tr35.html#replacement
#
# if type.field ≠ {}
# source.field = (source.field - type.field) ∪ replacement.field
# else if source.field = {} and replacement.field ≠ {}
# source.field = replacement.field
#
# The `Kernel.--` and `Kernel.++` operators appear to preserve order
# and since the data is ordered on arrival it appears to remain
# ordered after replacement.
defp replace(:language_variants, source_field, replacement_field, [_ | _] = type_field) do
do_replace(source_field, replacement_field, type_field)
end
defp replace(_field, source_field, replacement_field, [_ | _] = type_field) do
case do_replace(source_field, replacement_field, type_field) do
[] -> nil
other -> hd(other)
end
end
defp replace(:language_variants, [], [_ | _] = replacement_field, _type_field) do
replacement_field
end
defp replace(_field, [], [replacement_field], _type_field) do
replacement_field
end
defp replace(:language_variants, source_field, _replacement_field, _type_field) do
source_field
end
defp replace(_field, [], _replacement_field, _type_field) do
nil
end
defp replace(_field, [element], _replacement_field, _type_field) do
element
end
defp do_replace(source_field, replacement_field, type_field) do
(source_field -- type_field) ++ replacement_field
end
# In order to support replace/3, all arguments need
# to be lists. This function converts field from
# a language tag into the most relevant list representation.
defp type_field_from(nil, _), do: []
defp type_field_from(tag, :language_variants = key) do
Map.fetch!(tag, key)
end
defp type_field_from(tag, key) do
case Map.fetch!(tag, key) do
nil -> []
other -> [other]
end
end
# Convert a simple term into its most
# relevant list representation in order
# to support replace/4 which uses
# list operations.
defp field_from(nil), do: []
defp field_from(field) when is_list(field), do: field
defp field_from(field), do: [field]
defp field_from(field, :territory) when is_binary(field) do
case Integer.parse(field) do
{int, ""} when int in 0..999 -> [field]
_other -> [field]
end
end
defp field_from(field, _), do: field_from(field)
# When looking for alias substitutions we need to check
# a number of possible combinations of language and
# variants. For performance reasons we pre-calculate
# the combinations.
# This will crash if there are more than 4 variants
# which is possible but highly unlikely
defp variant_selectors([a]),
do: [[a]]
defp variant_selectors([a, b]),
do: [[a, b], [a], [b]]
defp variant_selectors([a, b, c]),
do: [[a, b, c], [a, b], [a, c], [b, c], [a], [b], [c]]
defp variant_selectors([a, b, c, d]),
do: [
[a, b, c, d],
[a, b, c],
[a, c, d],
[b, c, d],
[a, b],
[a, c],
[a, d],
[b, c],
[b, d],
[c, d],
[a],
[b],
[c],
[d]
]
# When we sort the candidate variants its a bi-level sort
# First on the length of the variants (ignoring "und") and
# then lexically
defp sort_variants(language, variants) do
Enum.flat_map(variants, &[[language | &1], ["und" | &1]])
|> Enum.sort(fn
["und" | rest1], ["und" | rest2] ->
if length(rest1) == length(rest2),
do: rest1 < rest2,
else: length(rest1) > length(rest2)
["und" | rest1], rest2 ->
if length(rest1) == length(rest2),
do: rest1 < rest2,
else: length(rest1) > length(rest2)
rest1, ["und" | rest2] ->
if length(rest1) == length(rest2),
do: rest1 < rest2,
else: length(rest1) > length(rest2)
rest1, rest2 ->
if length(rest1) == length(rest2),
do: rest1 < rest2,
else: length(rest1) > length(rest2)
end)
|> List.insert_at(0, [language])
end
# Finding a language alias requires recursing
# over the list of possible variants that are in
# a known and stable order. Since the merging of
# substitutions works on language tags, a successful
# match parses and returns the variant combination
# that led to the match.
defp find_language_alias(language, variants, type) do
variants = sort_variants(language, variants)
Enum.reduce_while(variants, {nil, nil}, fn variant, acc ->
alias_key = Enum.join(variant, "-")
if alias_tag = aliases(alias_key, type) do
type_field = Cldr.LanguageTag.Parser.parse!(alias_key)
{:halt, {type_field, alias_tag}}
else
{:cont, acc}
end
end)
end
# Similarly, finding a variant match recurses over
# the possible combinations and returns a
# language tag representing the variant combination
# that matched.
defp find_alias(variants, type) do
Enum.reduce_while(variants, {nil, nil}, fn variant, acc ->
alias_key = Enum.join(variant, "-")
if alias_tag = aliases(alias_key, type) do
type_tag = Cldr.LanguageTag.Parser.parse!("und-" <> alias_key)
replacement_tag = Cldr.LanguageTag.Parser.parse!("und-" <> alias_tag)
{:halt, {type_tag, replacement_tag}}
else
{:cont, acc}
end
end)
end
@doc """
Returns the map of likely subtags.
Note that not all locales are guaranteed
to have likely subtags.
## Example
Cldr.Locale.likely_subtags
=> %{
bez; %Cldr.LanguageTag{
backend: TestBackend.Cldr,
canonical_locale_name: nil,
cldr_locale_name: nil,
extensions: %{},
language: "bez",
locale: %{},
private_use: [],
rbnf_locale_name: nil,
requested_locale_name: nil,
script: :Latn,
territory: :TZ,
transform: %{},
language_variants: []
},
fuf: %Cldr.LanguageTag{
canonical_locale_name: nil,
cldr_locale_name: nil,
extensions: %{},
language: "fuf",
locale: %{},
private_use: [],
rbnf_locale_name: nil,
requested_locale_name: nil,
script: :Latn,
territory: :GN,
transform: %{},
language_variants: []
},
...
"""
@likely_subtags Cldr.Config.likely_subtags()
def likely_subtags do
@likely_subtags
end
@doc """
Returns the likely substags, as a `Cldr.LanguageTag`,
for a given locale name.
## Options
* `locale` is any valid locale name returned by `Cldr.known_locale_names/1`
or a `Cldr.LanguageTag` struct
## Examples
iex> Cldr.Locale.likely_subtags :en
%Cldr.LanguageTag{
backend: nil,
canonical_locale_name: nil,
cldr_locale_name: nil,
extensions: %{},
gettext_locale_name: nil,
language: "en",
locale: %{},
private_use: [],
rbnf_locale_name: nil,
requested_locale_name: "en-Latn-US",
script: :Latn,
territory: :US,
transform: %{},
language_variants: []
}
"""
@spec likely_subtags(locale_name | String.t()) :: LanguageTag.t() | nil
def likely_subtags(locale_name) when is_atom(locale_name) do
Map.get(likely_subtags(), locale_name)
end
def likely_subtags(%LanguageTag{requested_locale_name: requested_locale_name}) do
likely_subtags(requested_locale_name)
end
def likely_subtags(locale_name) when is_binary(locale_name) do
locale_name
|> String.to_existing_atom()
|> likely_subtags()
rescue ArgumentError ->
nil
end
@doc """
Return a map of the known aliases for Language, Script and Territory
"""
@aliases Cldr.Config.aliases()
@spec aliases :: map()
def aliases do
@aliases
end
@doc """
Return a map of the aliases for a given alias key and type
## Options
* `type` is one of `[:language, :region, :script, :variant, :zone]`
* `key` is the substitution key (a language, region, script, variant or zone)
"""
@alias_keys Map.keys(@aliases)
@spec aliases(locale_name() | String.t(), atom()) ::
String.t() | list(String.t()) | LanguageTag.t() | nil
def aliases(key, :region = type) when is_atom(key) do
aliases()
|> Map.get(type)
|> Map.get(to_string(key))
end
def aliases(key, type) when type in @alias_keys do
aliases()
|> Map.get(type)
|> Map.get(key)
end
def aliases(key, type) when is_binary(key) do
key
|> String.to_existing_atom()
|> aliases(type)
rescue ArgumentError ->
nil
end
defp validate_subtags(language_tag) do
with {:ok, language_tag} <- validate(language_tag, :language),
{:ok, language_tag} <- validate(language_tag, :script),
{:ok, language_tag} <- validate(language_tag, :territory),
{:ok, language_tag} <- validate(language_tag, :variants) do
{:ok, language_tag}
end
end
defp validate(language_tag, :language) do
case Cldr.Validity.Language.validate(language_tag.language) do
{:ok, language, _} -> {:ok, %{language_tag | language: language}}
{:error, _} -> {:error, invalid_language_error(language_tag.language)}
end
end
defp validate(language_tag, :script) do
case Cldr.Validity.Script.validate(language_tag.script) do
{:ok, script, _} -> {:ok, %{language_tag | script: script}}
{:error, _} -> {:error, invalid_script_error(language_tag.script)}
end
end
defp validate(language_tag, :territory) do
case Cldr.Validity.Territory.validate(language_tag.territory) do
{:ok, territory, _} -> {:ok, %{language_tag | territory: territory}}
{:error, _} -> {:error, invalid_territory_error(language_tag.territory)}
end
end
defp validate(language_tag, :variants) do
case Cldr.Validity.Variant.validate(language_tag.language_variants) do
{:ok, variants, _} -> {:ok, %{language_tag | language_variants: variants}}
{:error, variant} -> {:error, invalid_variant_error(variant)}
end
end
@doc false
def locale_error(%LanguageTag{requested_locale_name: requested_locale_name}) do
locale_error(requested_locale_name)
end
def locale_error(locale_name) do
{Cldr.UnknownLocaleError, "The locale #{inspect(locale_name)} is not known."}
end
@doc false
def gettext_locale_error(%LanguageTag{gettext_locale_name: gettext_locale_name}) do
gettext_locale_error(gettext_locale_name)
end
def gettext_locale_error(locale_name) do
{Cldr.UnknownLocaleError, "The gettext locale #{inspect(locale_name)} is not known."}
end
@doc false
def invalid_language_error(language) do
{Cldr.InvalidLanguageError, "The language #{inspect(language)} is invalid"}
end
@doc false
def invalid_script_error(script) do
{Cldr.InvalidScriptError, "The script #{inspect(script)} is invalid"}
end
@doc false
def invalid_territory_error(territory) do
{Cldr.InvalidTerritoryError, "The territory #{inspect(territory)} is invalid"}
end
@doc false
def invalid_variant_error(variant) do
{Cldr.InvalidVariantError, "The variant #{inspect(variant)} is invalid"}
end
@doc false
def alias_error(locale_name, alias_name) when is_binary(locale_name) do
{
Cldr.UnknownLocaleError,
"The locale #{inspect(locale_name)} and its " <>
"alias #{inspect(alias_name)} are not known."
}
end
@doc false
def alias_error(%LanguageTag{requested_locale_name: requested_locale_name}, alias_name) do
alias_error(requested_locale_name, alias_name)
end
@doc false
def no_locale_for_territory_error(territory) when is_binary(territory) do
{Cldr.UnknownLocaleError, "No locale was identified for territory #{inspect territory}"}
end
def no_locale_for_territory_error(territory) when is_atom(territory) do
territory
|> Atom.to_string()
|> String.downcase()
|> no_locale_for_territory_error()
end
end