consider making 'ascii' the default/recommended casefolding #1718

slingamn · 2021-06-28T05:40:19Z

I'm not at all certain about this and would appreciate input from all stakeholders.

Right now, the default/recommended casefolding is 'precis', i.e. RFC 8264, allowing the use of non-ASCII characters in nicknames and channel names. However, it seems like we've given up hope of the wider IRCv3 community adopting internationalized identifiers at the protocol level; the community seems to be going in the direction of "display names" instead. (Although: I haven't seen a proposal for assigning display names to channels.)

This creates a problem for client developers, who won't have a reliable algorithm for determining whether Ergo considers two identifiers to be equivalent under case normalization. The workaround we've been pushing is #1083, i.e., always publishing the canonical form of the identifier. This approach is vulnerable to bugs and edge cases.

The other problem with PRECIS is confusable characters, which are only imperfectly addressed by the skeleton algorithm.

So the proposal here is to change the default/recommended value of casefolding from 'precis' to 'ascii'. ('precis' would remain fully supported in the codebase, especially because operators can't safely switch between casefoldings, at the risk of making account or channel registrations unusable.)

Mikaela · 2021-06-28T09:42:04Z

Does this have implications for #1441 ?

DanielOaks · 2021-06-28T10:22:09Z

I'd probably do this once we actually have a way to support display names (i.e. once Metadata is implemented which I... should be doing eventually). That also takes care of the display names for channel things, since you'd just attach the key to the channel as well.

I like the idea of core protocol identifiers being restricted to be as simple as possible (that's why I pulled the unicode identifiers spec out of v3, after all) but it is something that differentiates us from other servers in a fairly big way, so keeping that around for now until there's a recommended method we can switch to probably makes sense.

See discussion on ergochat#1718

slingamn mentioned this issue Jul 27, 2021

Implement SCRAM-SHA-256 #175

Closed

slingamn added a commit to slingamn/ergo that referenced this issue Dec 12, 2022

change default casefolding to ascii

05e5e88

See discussion on ergochat#1718

slingamn mentioned this issue Dec 12, 2022

change default casefolding to ascii #2015

Merged

slingamn added this to the v2.11 milestone Dec 12, 2022

slingamn closed this as completed Dec 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consider making 'ascii' the default/recommended casefolding #1718

consider making 'ascii' the default/recommended casefolding #1718

slingamn commented Jun 28, 2021

Mikaela commented Jun 28, 2021

DanielOaks commented Jun 28, 2021

consider making 'ascii' the default/recommended casefolding #1718

consider making 'ascii' the default/recommended casefolding #1718

Comments

slingamn commented Jun 28, 2021

Mikaela commented Jun 28, 2021

DanielOaks commented Jun 28, 2021