mars-unicode-tables

Unicode data and generated table sources for MARS-NWE/libnwcore.

This repository intentionally keeps upstream Unicode source data separate from MARS-NWE-generated output.

Layout

  • UCD/
    Unicode Character Database input files, currently imported from Unicode 17.0.0.

  • MAPPINGS/
    Unicode mapping files from https://www.unicode.org/Public/MAPPINGS/, preserving the upstream VENDORS/... hierarchy.

  • scripts/
    MARS-NWE helper scripts/generators.

  • TAB/
    Generated C table output consumed by MARS-NWE/libnwcore.

  • LICENSES/
    License notes for Unicode data and MARS-NWE-authored helper code.

Policy

Do not copy Novell NSS shared/sdk/unitables/*.tab files into this repository. They may be used only as compatibility/reference material outside the committed source data.

Unicode case/codepage tables should be generated from Unicode.org data files.

Codepage table generation

MAPPINGS/ contains the Unicode.org vendor mapping files. The codepage helper generator emits compact byte/code-to-Unicode descriptors under TAB/:

./scripts/gen_codepage_tables.py

TAB/codepageTables.c and TAB/codepageTables.h are generated from direct single-BMP-code-point mappings only. Composite mappings, directional pseudo mappings, historical DatedVersions/, and WindowsBestFit/ reverse/fallback files remain in the source tree but are not emitted into byte-to-Unicode tables.

MARS-NWE links these generated tables into libnwcore; they are not loaded as runtime .tab files.

Description
No description provided
Readme 13 MiB
Languages
C 78.6%
HTML 21.1%
Python 0.3%