feat(locale): Add ku_ckb locale #3441

arentalb · 2025-03-17T20:49:10Z

This PR adds Kurdish support for the lorem module in Faker.js. This is the first step towards adding full Kurdish locale support, with more modules to come in future updates.

Ran pnpm run preflight with no issues
Generated locales with pnpm run generate:locales
Formatted code (pnpm run format) and passed linting (pnpm run lint)
Verified tests pass

Let me know if any changes are needed!

netlify · 2025-03-17T20:49:28Z

✅ Deploy Preview for fakerjs ready!

Built without sensitive environment variables

Name	Link
🔨 Latest commit	`31ea85d`
🔍 Latest deploy log	https://app.netlify.com/projects/fakerjs/deploys/68e0b2437eeccc00080a789e
😎 Deploy Preview	https://deploy-preview-3441.fakerjs.dev
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

codecov · 2025-03-17T20:53:29Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.97%. Comparing base (dce234e) to head (31ea85d).
⚠️ Report is 1 commits behind head on next.

Additional details and impacted files

@@           Coverage Diff           @@
##             next    #3441   +/-   ##
=======================================
  Coverage   99.97%   99.97%           
=======================================
  Files        2894     2899    +5     
  Lines      222390   222476   +86     
  Branches      932      932           
=======================================
+ Hits       222337   222423   +86     
  Misses         53       53

Files with missing lines	Coverage Δ
src/locale/index.ts	`100.00% <100.00%> (ø)`
src/locale/ku_ckb.ts	`100.00% <100.00%> (ø)`
src/locales/index.ts	`100.00% <100.00%> (ø)`
src/locales/ku_ckb/index.ts	`100.00% <100.00%> (ø)`
src/locales/ku_ckb/lorem/index.ts	`100.00% <100.00%> (ø)`
src/locales/ku_ckb/lorem/word.ts	`100.00% <100.00%> (ø)`
src/locales/ku_ckb/metadata.ts	`100.00% <100.00%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

matthewmayer · 2025-03-18T00:11:51Z

According to Wikipedia https://en.wikipedia.org/wiki/Kurdish_language

The main varieties of Kurdish are Kurmanji, Sorani, and Southern Kurdish (Xwarîn). The majority of the Kurds speak Kurmanji,[15] and most Kurdish texts are written in Kurmanji and Sorani. Kurmanji is written in the Hawar alphabet, a derivation of the Latin script, and Sorani is written in the Sorani alphabet, a derivation of the Arabic script.

So I wonder if we should give this a script suffix similar to we did for Uzbek and Serbian https://fakerjs.dev/guide/localization.html#available-locales to distinguish Kurdish written in Latin characters and Arabic characters?

arentalb · 2025-03-18T19:47:02Z

Kurdish has multiple dialects, but the main ones are Kurmanji and Sorani. Kurmanji can be written in both Latin and Arabic scripts. I plan to add these three variations

ku_KMR_latin → Kurmanji (Latin)
ku_KMR_arab → Kurmanji (Arabic)
ku_CKB_arab → Sorani (Arabic)

Would this be the correct approach for Faker.js? I want to confirm before proceeding. @matthewmayer

matthewmayer · 2025-03-19T10:11:16Z

Hmm I don't think we yet have any other locales where it's necessary to disambiguate by ISO 639-3 code for different dialects. I think it might be confusing to put the language code where we would otherwise put the country code so it probably makes sense to put it as part of the variant suffix. Let's also check how other localisation of open source software handles this.

arentalb · 2025-03-19T22:34:44Z

I searched for a solution but couldn’t find anything that exactly fits our needs.

However, I believe we can handle this using the following approach:

ku_IQ_Arab: Sorani (Arabic) IQ represents Iraq, where most Sorani speakers live.
ku_TR_Latn: Kurmanji (Latin) TR represents Turkey, where most Kurmanji speakers use the Latin script.
ku_SY_Arab: Kurmanji (Arabic) SY represents Syria, where many Kurmanji speakers use the Arabic script.

This approach seems like our best fit.

However, I don’t like using IQ, TR, and SY because Kurdish people are divided among four countries, including Iran. I wish I could use KU instead, but unfortunately, it is not a standard code.

matthewmayer · 2025-03-20T01:45:23Z

I agree the country codes aren't ideal. It partly depends if you expect to add data for eg phone module which has country codes , location module which has cities etc. if those are all from one country then a country code would be appropriate. If not then we should try to find a generic solution.

matthewmayer · 2025-03-20T03:47:59Z

Wikipedia uses ku for Kurmanji (with a toggle between ku-latn and ku-arab) and ckb for Sorani

matthewmayer · 2025-03-20T03:51:21Z

Mozilla uses
https://pontoon.mozilla.org/sdh/
https://pontoon.mozilla.org/kmr/
https://pontoon.mozilla.org/ckb/

matthewmayer · 2025-03-20T03:58:56Z

I think the cleanest would probably be:

ku_kmr_latin → Kurdish (Kurmanji, Latin) - variant kmr_latin in metadata, fakerKU_kmr_latin when precompiled
ku_kmr_arab → Kurdish (Kurmanji, Arabic) - variant kmr_arab in metadata, fakerKU_kmr_arab when precompiled
ku_ckb → Kurdish (Sorani) - variant ckb in metadata, fakerKU_ckb when precompiled

arentalb · 2025-03-20T10:19:35Z

i though we should use only contry code like IQ after the ku_ , but if it is fine to use kmr and ckb like you said that would be the best way , thank you , i will start with ku_ckb

arentalb · 2025-03-20T11:30:58Z

@matthewmayer I’ve updated the changes and pushed them. Do I need to do anything else besides adding more modules?

matthewmayer · 2025-03-20T13:06:19Z

For ease for review please don't add any more modules for now. We prefer to get a small PR reviewed first, once it is approved you can follow up with additional PRs for other dialects and modules.

Nothing to do for now, I will let the other maintainers take a look. Please bear with us it can take a few days or weeks to get initial PR approved.

Shinigami92 · 2025-03-22T11:16:10Z

i though we should use only contry code like IQ after the ku_ , but if it is fine to use kmr and ckb like you said that would be the best way , thank you , i will start with ku_ckb

Hello 🙂

Thanks to @matthewmayer to drive this PR in review process for us 🚀
I think we (the @faker-js/maintainers) should have a meeting about how we handle this, because at first look my brain immediately told me also that this differs from our pattern we are used to (except of en_AU_ocker and en_BORK, but even there it is /^[a-z]{2}_[A-Z]{2}.*/) 🤔

Otherwise I would give already an approve ✅

matthewmayer · 2025-03-22T11:39:06Z

i though we should use only contry code like IQ after the ku_ , but if it is fine to use kmr and ckb like you said that would be the best way , thank you , i will start with ku_ckb

Hello 🙂

Thanks to @matthewmayer to drive this PR in review process for us 🚀 I think we (the @faker-js/maintainers) should have a meeting about how we handle this, because at first look my brain immediately told me also that this differs from our pattern we are used to (except of en_AU_ocker and en_BORK, but even there it is /^[a-z]{2}_[A-Z]{2}.*/) 🤔

Otherwise I would give already an approve ✅

i think en_BORK is the only other example of a locale with a language and variant but no country at the moment. (Admittedly a silly example, and "BORK" should probably be lowercase but i dont want to introduce a breaking change for a silly Easter egg locale).

But in this case when people speaking Kurdish are spread over several countries, it seems more appropriate.

Shinigami92 · 2025-03-22T11:41:48Z

i think en_BORK is the only other example of a locale with a language and variant but no country at the moment. (Admittedly a silly example, and "BORK" should probably be lowercase but i dont want to introduce a breaking change for a silly Easter egg locale).

But in this case when people speaking Kurdish are spread over several countries, it seems more appropriate.

These are exactly also my background thoughts 👍

matthewmayer · 2025-03-22T11:47:21Z

I think we don't have a really good definition for what goes in "variant" at the moment, but i think its basically

"the minimum needed to disambiguate versions of a macro-language", which might include a part 3 code from https://en.wikipedia.org/wiki/List_of_ISO_639_language_codes (e.g. ckb) and/or a script (e.g. latin, arab) and/or another descriptor if there's no standard code (en_GB_cockney_rhyming_slang)?

xDivisionByZerox · 2025-04-03T16:42:07Z

@arentalb I'll be honest with you here. We discussed this PR during the last two weekly meetings and still haven't come to a conclusion. The case you present is quite the hard one.

For starters, lets have a look into our current definition for locale names from our website: https://fakerjs.dev/guide/localization.html#locale-codes.
We noticed that the second paragraph was not as clearly written as we'd like it to be.

The same language may be spoken in different countries, with different patterns for addresses, phone numbers etc. Optionally a two-letter uppercase country code can be added after an underscore, following the ISO 3166-1 alpha-2 standard, for example en_US represents English (United States) and en_AU represents English (Australia).

One could argue that the "Optionally" indicates that the country code is optionally entirely, while another one could say that it is only optionally when considering the base locale code (from the first paragraph). By making the country code required, we could at least somewhat restrict the maximum amount of possible faker locales. Furthermore, a lot of modules (example internet, location) are only usful if they are paired with a specific country.

Having to maintain an indefinite amount of locales is sadly a fact we need to consider as maintainers.
Since we already have quite a lot of locales, making this decision right now, is not as easy as it might seem. Furthermore, we need to consider the implementation for future locales. Making the country code optionally could potentially lead to a drastic increase of "easter egg" locales like en_BORK as they would then be defined a valid locale.

We totally see the reasoning of your disliking regarding the implementation by definition. But by the current definition, this might be the best option.
To get this matter resolved more easily, could you imagine to create a generic ku locale that all other ku_* will fall back to? Or is this not possible due to the nature of the language itself being heavily defined into dialects?

arentalb · 2025-04-03T18:17:14Z

Thanks for reviewing the PR — much appreciated!

Regarding your suggestion, yes — I can definitely work on a generic ku locale.

I believe the best candidate for a base locale would be Kurdish Sorani (Central Kurdish) in Arabic script. It's my native dialect and also the most widely understood among Kurdish speakers.

do you agree with using Sorani Arabic script for the generic ku, or would you recommend a different approach?

arentalb · 2025-04-04T16:26:11Z

english-sorani-kurmanji.pdf

Sorry for the late reply

I’ve translated a set of base words to compare Sorani and Kurmanji - as shown in the file, there are noticeable differences between the two dialects.

While a few words are shared, the overlap doesn’t seem strong enough to build a reliable base fallback, in my opinion. So I don’t think a generic Kurdish locale is the right approach.

That leaves us with two possible options:

1- Fallback to one dialect - I’d suggest Sorani. In my experience, Kurmanji speakers usually understand Sorani, but the reverse isn’t true - Sorani speakers typically understand only about 20–30% of Kurmanji.

2- Treat them as separate locales - This might be the better option overall, especially since Kurmanji has two scripts (Latin and Arabic).

xDivisionByZerox

I'll approve this with the decision of allowing the "Kurdish"-locale in the following variants:

title	variant	script	locale name	faker instance name
`'Kurdish (Sorani)'`	`'ckb'`	`'Arab'`	ku_ckb	fakerKU_ckb
`'Kurdish (Kurmanji, Latin)'`	`'kmr_latin'`	`'Latn'`	ku_kmr_latin	fakerKU_kmr_latin
`'Kurdish (Kurmanji, Arabic)'`	`'kmr_arab'`	`'Arab'`	ku_kmr_arab	fakerKU_kmr_arab

Thank you for your patience and endurance @arentalb. May you be able to resolve the current conflicts by updating the snapshots for a final time?

arentalb · 2025-10-03T15:21:09Z

Thank you for the approval and the detailed instructions. I will resolve the conflicts and update the snapshots as requested

arentalb · 2025-10-04T05:17:07Z

I checked the workflows and noticed that the test failure comes from the snapshot file in the "ko" section. I didn’t modify that part when creating the pull request , it was part of the conflict resolution, and I accepted the existing changes. Now the workflow fails because of that section.

@xDivisionByZerox

matthewmayer · 2025-10-04T05:30:33Z

I checked the workflows and noticed that the test failure comes from the snapshot file in the "ko" section. I didn’t modify that part when creating the pull request , it was part of the conflict resolution, and I accepted the existing changes. Now the workflow fails because of that section.

@xDivisionByZerox

it looks like Git accidentally pulled in some changes to ko because its adjacent to ku in the snapshot test. i have reverted that.

arentalb · 2025-10-04T09:44:58Z

@matthewmayer thank you , but it still have a failed test

matthewmayer · 2025-10-04T12:32:13Z

Yes noted. That is unrelated to your changes.

xDivisionByZerox · 2025-10-04T14:30:09Z

Thank you for your contribution.

As concluded - through out the discussion in this PR - we will accept additional PRs for the locales ku_kmr_latin and ku_kmr_arab, if you are still interested in providing them.

arentalb · 2025-10-04T14:34:41Z

Thank you, @xDivisionByZerox , and thank you, @matthewmayer , for your work. I will add the other module for ku_ckb and also work on ku_kmr_latin and ku_kmr_arab.

matthewmayer · 2025-10-04T16:05:49Z

Note to avoid duplicating effort, we do have a pending PR for adding Kurmanji at #3615

xDivisionByZerox · 2025-10-04T19:40:53Z

@arentalb just FYI:
There has already been 2 PRs that add the base of one of the missing locales: ku_kmr_latin See #3629.

feat(locale): add ku_kmr_latin locale #3629

arentalb · 2025-10-04T20:14:30Z

Thank you. We realized we are friends and decided that I will work on "ckb" and he will work on both "kmr" , since he knows both "kmr" better .

…

On Sat, 4 Oct 2025 at 10:41 PM DivisionByZero ***@***.***> wrote: *xDivisionByZerox* left a comment (faker-js/faker#3441) <#3441 (comment)> @arentalb <https://github.com/arentalb> just FYI: There has already been 2 PRs that add the base of one of the missing locales: ku_kmr_latin See #3629 <#3629>. - #3629 <#3629> — Reply to this email directly, view it on GitHub <#3441 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AQ3FH5R2NXNXM43WXFS42UT3WAPFXAVCNFSM6AAAAABZGPWI6SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTGNRYGQ4TQMZYGA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

feat(locale): Add ku locale

e214844

arentalb requested a review from a team as a code owner March 17, 2025 20:49

xDivisionByZerox assigned arentalb Mar 17, 2025

xDivisionByZerox added c: feature Request for new feature p: 1-normal Nothing urgent c: locale Permutes locale definitions labels Mar 17, 2025

xDivisionByZerox requested a review from a team March 17, 2025 20:54

xDivisionByZerox added this to the vAnytime milestone Mar 17, 2025

fix(locale): Fix ku_ckb locale structure and metadata and folder name

7c3f11c

matthewmayer previously approved these changes Mar 22, 2025

View reviewed changes

xDivisionByZerox added the s: needs decision Needs team/maintainer decision label Mar 22, 2025

matthewmayer mentioned this pull request Sep 23, 2025

Add Kurdish Kurmanji (Latin and Arabic) locale support #3615

Closed

xDivisionByZerox previously approved these changes Oct 1, 2025

View reviewed changes

xDivisionByZerox removed the s: needs decision Needs team/maintainer decision label Oct 1, 2025

xDivisionByZerox modified the milestones: vAnytime, v10.x Oct 1, 2025

Merge branch 'next' into next

a06cca8

arentalb dismissed stale reviews from xDivisionByZerox and matthewmayer via a06cca8 October 3, 2025 18:22

revert accidental changes to ko during conflict resolution

70379a7

fix ordering in snapshot test

31ea85d

matthewmayer changed the title ~~feat(locale): Add ku locale~~ feat(locale): Add ku_ckb locale Oct 4, 2025

xDivisionByZerox approved these changes Oct 4, 2025

View reviewed changes

xDivisionByZerox requested a review from a team October 4, 2025 14:12

matthewmayer approved these changes Oct 4, 2025

View reviewed changes

xDivisionByZerox added this pull request to the merge queue Oct 4, 2025

xDivisionByZerox added the m: lorem Something is referring to the lorem module label Oct 4, 2025

Merged via the queue into faker-js:next with commit 9de894a Oct 4, 2025
51 of 53 checks passed

chrisbbreuer mentioned this pull request Oct 15, 2025

chore(deps): update all non-major dependencies stacksjs/ts-mocker#15

Merged

1 task

Uh oh!

feat(locale): Add ku_ckb locale #3441

feat(locale): Add ku_ckb locale #3441

Uh oh!

Conversation

arentalb commented Mar 17, 2025

Uh oh!

netlify bot commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for fakerjs ready!

Uh oh!

codecov bot commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

matthewmayer commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arentalb commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matthewmayer commented Mar 19, 2025

Uh oh!

arentalb commented Mar 19, 2025

Uh oh!

matthewmayer commented Mar 20, 2025

Uh oh!

matthewmayer commented Mar 20, 2025

Uh oh!

matthewmayer commented Mar 20, 2025

Uh oh!

matthewmayer commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arentalb commented Mar 20, 2025

Uh oh!

arentalb commented Mar 20, 2025

Uh oh!

matthewmayer commented Mar 20, 2025

Uh oh!

Shinigami92 commented Mar 22, 2025

Uh oh!

matthewmayer commented Mar 22, 2025

Uh oh!

Shinigami92 commented Mar 22, 2025

Uh oh!

matthewmayer commented Mar 22, 2025

Uh oh!

xDivisionByZerox commented Apr 3, 2025

Uh oh!

arentalb commented Apr 3, 2025

Uh oh!

arentalb commented Apr 4, 2025

Uh oh!

xDivisionByZerox left a comment

Choose a reason for hiding this comment

Uh oh!

arentalb commented Oct 3, 2025

Uh oh!

arentalb commented Oct 4, 2025

Uh oh!

matthewmayer commented Oct 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arentalb commented Oct 4, 2025

Uh oh!

matthewmayer commented Oct 4, 2025

Uh oh!

xDivisionByZerox commented Oct 4, 2025

Uh oh!

Uh oh!

arentalb commented Oct 4, 2025

Uh oh!

matthewmayer commented Oct 4, 2025

Uh oh!

xDivisionByZerox commented Oct 4, 2025

Uh oh!

arentalb commented Oct 4, 2025 via email

Uh oh!

Reviewers

netlify bot commented Mar 17, 2025 •

edited

Loading

codecov bot commented Mar 17, 2025 •

edited

Loading

matthewmayer commented Mar 18, 2025 •

edited

Loading

arentalb commented Mar 18, 2025 •

edited

Loading

matthewmayer commented Mar 20, 2025 •

edited

Loading

matthewmayer commented Oct 4, 2025 •

edited

Loading