template talk:Unichar#Option to only show HTML mnemonic

{{User:MiszaBot/config

|archiveheader = {{talkarchivenav}}

|maxarchivesize = 100K

|counter = 1

|minthreadsleft = 3

|minthreadstoarchive = 1

|algo = old(730d)

|archive = Template talk:Unichar/Archive %(counter)d

}}

{{WikiProject banner shell|

}}

Proposal: use [[Template:Char]]

Would it be good to place the character itself in {{tl|char}}? jlwoodwa (talk) 06:43, 9 July 2023 (UTC)

:Although generally keen on char, I'd need to be convinced in this case. Char is used to "isolate" a glyph under discussion from the associated running text. In the output of unichar, that is usually clear.

:The only argument in favour that I can see is that, at present, unichar identifies the glyph by increasing its size and maybe the faint box used by char would be better? But conversely magnification makes it easier to "read".

:Did you have a particular case that provoked the proposal? 𝕁𝕄𝔽 (talk) 07:51, 9 July 2023 (UTC)

::It's clear to anyone who's familiar with the format, but I'm not sure it's as clear to a general reader, especially one who doesn't know what the "U+ stuff" means. I haven't noticed any specific problems that this would solve, I just think it's good to have a consistent format for "inline character literals" on Wikipedia. jlwoodwa (talk) 08:19, 9 July 2023 (UTC)

:::So how would we handle this example: {{unichar|20E0|Combining Enclosing Circle Backslash}} (which is already not handled terribly well). Likewise, Asiatic scripts present issues that don't occur to those of us only familiar with alphabetic scripts. A lot of development work has gone into this template to deal with these issues so changing it would not be trivial, given the need to verify many many test cases and rewrite to resolve anomalies. Annoyingly, one of the recent main developers, user:DePiep, is no longer available to advise. --𝕁𝕄𝔽 (talk) 10:20, 9 July 2023 (UTC)

::::{{char|⃠}} seems to work just fine. I understand the difficulty of modifying such a convoluted and widely-used template, though. Since it sounds like it's not {{em|obviously}} a bad idea, I'll try the "obvious implementation" in the sandbox, and give an update here when it's working. jlwoodwa (talk) 10:35, 9 July 2023 (UTC)

:::::on Chrome, the symbol overruns the box (or the box underruns)... 𝕁𝕄𝔽 (talk) 13:43, 9 July 2023 (UTC)

::::::... but then again it overruns the last digit of the codepoint right now. --𝕁𝕄𝔽 (talk) 13:45, 9 July 2023 (UTC)

:I'm 2 years late, but what about allowing {{para|use|char}} so people can opt into this instead of changing the default behaviour? I don't think U+20E0 should be a blocker for this since it's already very broken in current unichar and in my opinion at least {{unichar/sandbox|20E0|use=char}} -> {{unichar/sandbox|20E0|use=char}} looks better than {{unichar|20E0}} even if it is still broken. Warudo (talk) 21:30, 19 June 2025 (UTC)

::(I forgot to mention that I added the feature to the sandbox for this discussion) Warudo (talk) 21:32, 19 June 2025 (UTC)

Combining diacritics are displaying as tofu on Android - fault may be in cwith= handling?

I don't know if this is new? The argument {{code|1= cwith=◌}} or {{code|1= cwith=◌}} is used heavily to display combining diacritics. I'm editing in Android right now and the symbol displays correctly. But in articles like diacritic, it is has more tofu than a Japanese restaurant. Is there a {{code|style serif}} somewhere that is blocking the last resort substitution? --𝕁𝕄𝔽 (talk) 13:22, 21 September 2023 (UTC)

:No, it is not unique to Unichar, that just happens to be where it first saw it. Diacritic doesn't even use unichar, it just uses a dotted circle and combining diacritic directly, thus {{angbr|◌́}}. As it is a general problem, I will take it to Wikipedia:Village pump (technical). --𝕁𝕄𝔽 (talk) 13:37, 21 September 2023 (UTC)

::~~No solution suggested,~~ it is an implementation defect in Android. ~~So unless someone has a back-channel to Google, we just have to grin and bear it.~~ --𝕁𝕄𝔽 (talk) 16:33, 22 September 2023 (UTC)

:::Further discussion has revealed that the problem is due to deficiency in the system default sans-serif font. The workaround is to use serif and I have started to do that with success on "freestanding" cases. But {{tl|unichar}} is heavily used so we really need a fix to it, please? --𝕁𝕄𝔽 (talk) 16:32, 23 September 2023 (UTC)

=Template enhancement needed, please=

Requirement: when {{code|1= cwith=◌}} is invoked, wrap the output in {{1}}, where {{1}} is the sequence {{code|dotted circle + combining diacritic}}. Is there a doctor in the house? --𝕁𝕄𝔽 (talk) 16:32, 23 September 2023 (UTC)

:Will this work {{unichar |0301 |combining acute accent |cwith=◌|use=script|use2=serif}} {{unichar |0301 |combining acute accent |cwith=◌|use=script|use2=serif}}, I just extended the template to support "serif" as a use2 param if you set use as "script." This might work also. {{unichar |0301 |combining acute accent |cwith=◌|use=script|use2=noto}} {{unichar |0301 |combining acute accent |cwith=◌|use=script|use2=noto}} Andre🚐 20:03, 23 September 2023 (UTC)

::Yes, that would work. I hate to be ungrateful but to employ that solution would create a lot of work, many many articles would to be updated to use it{{snd}} and, when Google discards Roboto as default sans font, would all have to be undone again. AFIK, this is the only use-case for {{code|1=cwith=◌}} so it would not have any deleterious effect elsewhere (and would be easy to back out). [BTW, we couldn't have {{code|1=use2=noto}} because it would break Bing and Safari.] --𝕁𝕄𝔽 (talk) 22:11, 23 September 2023 (UTC)

:::Ok, I made the change to Template:Unichar/glyph, but let me know if it doesn't look right and I'll revert it. Andre🚐 23:51, 23 September 2023 (UTC)

Misaligned diacritics

Can anyone explain (better still fix) this phenomenon:

{{unichar|0360|Combining double tilde|cwith=◌◌}} , a tilde diacritic that spans a pair of adjacent characters: {{char|◌͠◌}} no markup: ◌͠◌

Just using the characters directly puts the diacritic in the right place but unichar fails (placement is offset). (At least when using Chrome on Chromebook).

{{unichar|0301|Combining acute accent |cwith=◌}} is ok. ◌́

𝕁𝕄𝔽 (talk) 16:42, 22 September 2023 (UTC)

:|cwith=◌◌ puts the dotted circles before the diacritic, but the diacritic is supposed to be between them. I don't know how it should be fixed though. — Eru·tuon 19:21, 22 September 2023 (UTC)

::Ah, of course. Obvious really. There are very few of these two-character diacritics so I don't really see it being worth anyone's while hacking the template to fix it. I'll just add a note to the documentation to say it doesn't work, handcrafting is required. --𝕁𝕄𝔽 (talk) 19:37, 22 September 2023 (UTC)

:::I have added this text. It is not quite right, the display of the U+0360 is not exactly as produced by the template but does it matter?

:::{{tpq|1=** Note that {{code|1=cwith=◌◌}} does not provide the desired result if the intention is to display a diacritic that spans two characters (such as those in the range U+035C to U+0362): the diacritic will be offset. In such cases, editors must emulate the template output by hand, because the correct HTML sequence is "first-character + combining-diacritic + second-character". Thus, for example, to show the combining double tilde U+0360, write {{code|1= U+0360 ◌͠◌}} then (in {{tl|small}}), COMBINING DOUBLE TILDE. This produces U+0360 ◌͠◌ {{small|COMBINING DOUBLE TILDE}}. }}

:::Comments (better still, direct edits to improve) welcome. --𝕁𝕄𝔽 (talk) 20:24, 22 September 2023 (UTC)

::::Really this needs a "print this instead" for the character. All this size/font/cwith stuff could be put into that instead of trying to fool the automatic text generator into producing the desired result. Spitzak (talk) 21:50, 23 September 2023 (UTC)

:::::Sorry, I don't follow. Rather than spend time explaining, would you write the alternative text please? Here or in the doc. --𝕁𝕄𝔽 (talk) 22:14, 23 September 2023 (UTC)

::::::I meant that there could be a parameter, perhaps show, so that if invoked with show=foobar then instead of showing the character it shows "foobar". This could then contain any wiki or html markup desired and any trick needed to get the character to be correctly visible. In this example it would contain the two circles and the combining diacritic. Spitzak (talk) 00:08, 28 December 2023 (UTC)

:::::::I think it does have a param does something similar, or it did 3 months ago. Andre🚐 00:15, 28 December 2023 (UTC)

::::Hmm, a double parameter could be introduced to change the order of the output. Andre🚐 19:50, 24 September 2023 (UTC)

Question on Error on off-Wiki

I've copied all the relating templates and modules to our wiki, and I've checked them a few times over, but it keeps giving me the following error:

:I wrote:

:└> "The character {{unichar|a9|COPYRIGHT SIGN}} is about intellectual property."

:It should write:

:└> "The character {{unichar|a9|COPYRIGHT SIGN}} is about intellectual property."

:but gives me:

:└> "The character {{red|1=Error using {{}}unichar{{red|1=}}: Input "a9" is not a Hexadecimal value.}} is about intellectual property."

I don't understand why it does this. Not sure if I should ask this here or somewhere else, but thought to try it here first. Kind regards, Rodejong 💬 ✉️ 23:15, 18 December 2023 (UTC)

:That is a charset encoding issue probably. Or something to do with your wiki's installation of php. {{unichar|a9|COPYRIGHT SIGN}} works fine here, as you can see. Andre🚐 00:16, 28 December 2023 (UTC)

::Thanks for answering. I'll ask the hosting guys to look in to that then. Kind regards, Rodejong 💬 ✉️ 00:53, 28 December 2023 (UTC)

Enhancement request: sanity check or lazy invocation

At Copyright sign, a vandal changed {{unichar|25|Percent sign|html=}} to {{unichar|26|Percent sign|html=}}. No error was generated, though inspection shows that the name doesn't match the new, wrong, glyph. The template really should do a sanity check that the name actually matches the code-point and display an error status if not. For familiar glyphs like % and &, it is obvious but not if it is a j

Better still, don't ask for any text, indeed ignore any provided. A simple {{unichar|25}} should fetch the official name and not expect editors to do make-work.

Is there a template doctor in the house? 𝕁𝕄𝔽 (talk) 19:53, 2 April 2024 (UTC)

:It seems this has fallen through the cracks. I'm going to see if I can wrangle a modification to this template that will simply allow one to print the canonical Unicode name for a given code point. I would prefer it being the default or {{em|only}} behavior, but I am curious is this would be a problem for anyone. Remsense诉 12:58, 5 April 2024 (UTC)

::To my mind, anything but the canonical name is at best finger trouble. The family {{code|1= nlink=}} is there when the WP:common name and the canonical name don't match. As in {{unichar|005E|circumflex accent|nlink=carat}} ({{unichar|005E|circumflex accent|nlink=carat}} 𝕁𝕄𝔽 (talk) 18:00, 5 April 2024 (UTC)

:::The issue being, it seems we need a data module of 150k entries that the module has to be searched every time—if we want to prevent vandalism, anyway—and that's about three orders of magnitude more entries than I've seen a module on here work with, so I am worried by the potential server load. Remsense诉 18:16, 5 April 2024 (UTC)

::::Maybe WP:village pump/technical could advise? But it is not really a search when you already have the index and just want to fetch the record that matches that index. 𝕁𝕄𝔽 (talk) 18:24, 5 April 2024 (UTC)

:::::Doy, you're completely right on the latter point. Had the current flowing the wrong way in my brain there. I'll poke the pump. Remsense诉 18:27, 5 April 2024 (UTC)

:Well, that was easy!!!!!!!!!!!!!! {{tlx|Unichar/sandbox}} seems to work perfectly well. Thank you so much @Cryptic for lending some lost, cold, and confused lexicographers a helping {{unichar/sandbox|2F3F}} Remsense诉 21:03, 5 April 2024 (UTC)

The sooner we can put this live, the better. There's a lot of it about! (Kudos to {{u|Nickps}} for spotting [https://en.wikipedia.org/w/index.php?title=Hyphen-minus&curid=2734201&diff=1217645792&oldid=1217645552 this one] in such a high-profile article but such basic stuff should't depend on eagle eyes to keep clean.) --𝕁𝕄𝔽 (talk) 10:33, 7 April 2024 (UTC)

:I am not sure of a particular reason why it can't, I just didn't want to be rash about doing so. It's not like it was a particularly technical change, if you'd like to do the honors? Remsense诉 10:38, 7 April 2024 (UTC)

::I'm happy to be the one to do it but you'll have to tell me how. 𝕁𝕄𝔽 (talk) 12:42, 7 April 2024 (UTC)

:::Oh! Apologies for assuming everyone else is the one I should be asking how to do things. I've done it. Remsense诉 12:54, 7 April 2024 (UTC)

:::The template should certainly ignore the text given but maybe we should start with a green warning to say that the template has done so. One like the error message you get if you accidently type {{code|1= firdt=John}} in a CS1/2 citation. We could do it silently and let those who have been taking advantage of the failure to check come and read the (to be revised) documentation which will tell them that the free text field is no more. 𝕁𝕄𝔽 (talk) 12:55, 7 April 2024 (UTC)

::::Yes I can do that also, great idea. Remsense诉 12:57, 7 April 2024 (UTC)

:::::Revising the doc, I noticed that calling the template with no text generated just omitted it. I can't see why anyone would want to do that but we had best add a {{code |1=name=none}} option? 𝕁𝕄𝔽 (talk) 13:10, 7 April 2024 (UTC)

::::::I think it's nice to have just because I often am too lazy to tab to a template's documentation so I try all the things ({{code|{{=}}none}}? could it be {{code|{{=}}false}}? how about {{code|{{=}}no}}? Surely it will no longer confound me if I try {{code|{{=}}""}}—there we go!) Remsense诉 13:13, 7 April 2024 (UTC)

:::::::Well we could just cheat and regard any input to {{code |1= name=}} as an instruction to omit. Who is ever going to use if to mean yes. --𝕁𝕄𝔽 (talk) 13:38, 7 April 2024 (UTC)

::::::::This is usually the pragmatist's move with a binary parameter. I swear there's a thing that lets you check all the ways a user wants to say no or yes to something. Remsense诉 14:09, 7 April 2024 (UTC)

:I probably don't deserve praise for that one considering I'm the one who made the mistake in the first place [https://en.wikipedia.org/w/index.php?title=Hyphen-minus&diff=prev&oldid=1217645552&diffonly=1] but thanks, I guess. Nickps (talk) 11:06, 7 April 2024 (UTC)

::Of course you do! It's never too late to make things right. Remsense诉 11:08, 7 April 2024 (UTC)

=Override option needed=

See

{{blockquote|In Unicode, the majuscule Ƣ is encoded in the Latin Extended-B block at U+01A2 and the minuscule ƣ is encoded at U+01A3.{{cite web|url=https://www.unicode.org/charts/PDF/U0180.pdf|title=Unicode chart}} The assigned names, "LATIN CAPITAL LETTER OI" and "LATIN SMALL LETTER OI" respectively, are acknowledged by the Unicode Consortium to be mistakes, as gha is unrelated to the letters O and I.{{cite web|url=http://unicode.org/notes/tn27/|title=Unicode Technical Note #27: Known Anomalies in Unicode Character Names}} The Unicode Consortium therefore has provided the character name aliases "LATIN CAPITAL LETTER GHA" and "LATIN SMALL LETTER GHA".}}

Right now, we have

{{unichar|01A2}}

We need a {{code |1=alias= }} as in {{code |1=alias=LATIN CAPITAL LETTER GHA}} , as suggested by {{u|Chatul}} at the Village Pump. There are a very few such cases where an error was made in the original standard that will never be changed. --𝕁𝕄𝔽 (talk) 13:49, 7 April 2024 (UTC)

:Will start this right now alongside the other thing. Remsense诉 14:10, 7 April 2024 (UTC)

::I think it would be ok for arg 1 to continue to work. Instead find all the invocations of this template and remove arg 1 unless it is actually necessary.Spitzak (talk) 19:07, 8 April 2024 (UTC)

:::In principle, you are absolutely right{{snd}} but in practice that would be a huge task, wildly out of proportion to the tiny number of cases where the Unicode Consortium admits it made an error. This is the most practicable solution to this specific problem. Meanwhile, ignoring the supplied 2= in favour of the canonical text resolves immediately the rather more cases of spelling errors and vandalism. --𝕁𝕄𝔽 (talk) 20:25, 8 April 2024 (UTC)

=Temporary reversion needed=

{{Ping|Remsense}} we forgot the many instances of uses like this: {{unichar|2120|Service mark|nlink=} which now fail {{unichar|2120|Service mark|nlink=}} because there is no such article as SERVICE MARK. Do'oh! --𝕁𝕄𝔽 (talk) 21:23, 8 April 2024 (UTC)

:Revert done: I'm working on the aliases as we speak also Remsense诉 21:26, 8 April 2024 (UTC)

::Which is now working:

::{{tlx|Unichar/sandbox|1A2}} → {{Unichar/sandbox|1A2}}

::{{tlx|Unichar/sandbox|1A2|alias{{=}}yesgivemethealias}} → {{Unichar/sandbox|1A2|alias=yesgivemethealias}}

::What should we do about this? It does say such use of {{para|nlink}} is deprecated. Should we clean it all up somehow? Remsense诉 21:38, 8 April 2024 (UTC)

:::I have seen a lot of {{code |1=nlink=}}, indeed I confess to have been a major perpetrator{{snd}} "monkey see monkey do". It works (worked) and there was (is?) no error message to say {{red|1=No data supplied with nlink=, ignored}}. So we need ...

:::first: a list of articles that use nlink= with no data, so that someone (aka me, since I know many of them are my fault) can go round and correct them. [I believe that the template already has such an exceptions report, though whether anyone has been checking since {{u|DePiep}} got canned must be doubtful.) Then we can reinstate the change.

:::second, add some code to say (for all the optional parameters), {{red|1=No data supplied with =, ignored}}

:::PS sorry to have dropped the bombshell and not been around until now to help with the cleanup; officially I was otherwise engaged and shouldn't have been in a position to spot the error. --𝕁𝕄𝔽 (talk) 23:01, 8 April 2024 (UTC)

::::My "first" wouldn't be needed if the current interception of {{code|1=nlink=}} were changed so that it linked to the U+XXXX or the target character rather than some name? Which adds support to the question of "do we even need nlink= ?". --𝕁𝕄𝔽 (talk) 23:58, 8 April 2024 (UTC)

:::::Don't apologize at all! Nothing about this is particularly burdensome. I am leaning towards linking to the character itself, are there cases where this is going to break? Remsense诉 00:03, 9 April 2024 (UTC)

::::So, do you think directly linking to the character itself is the best move? That's where I am presently unless there are edge cases (e.g. I can think of high-range code points and non-printable ones, and maybe we can define those manually). Remsense诉 02:26, 9 April 2024 (UTC)

:::::yes, see below. 𝕁𝕄𝔽 (talk) 08:30, 9 April 2024 (UTC)

:The {{para|nlink}} default is now also working:

:{{tlx|Unichar/sandbox|1A2|alias{{=}}yes|nlink{{=}}}} → {{Unichar/sandbox|1A2|alias=yes|nlink=}} Remsense诉 13:47, 9 April 2024 (UTC)

Do we even need nlink=

:::Say: we have a lot of technical redirects, why can't we just add U+XXXX as redirect format to a given page? Remsense诉 21:43, 8 April 2024 (UTC)

::::As in, U+2120 now redirects to Service mark symbol, as already did ℠. This seems like a pre-solved problem. Remsense诉 21:49, 8 April 2024 (UTC)

:::::It looks to be a neat solution. The only catch that I can see is that these U+XXXX aren't well watched and may be subject to vandalism. It is not an obvious vector for a "bad actor" so I guess it is a reasonable risk. The problem is that the attack won't be obvious and someone following a link to a Gardiner's sign list entity will have no idea how it happened. --𝕁𝕄𝔽 (talk) 23:01, 8 April 2024 (UTC)

::::::Are there any cases of nlink=target-name#section-name? I can't think why there would but if it is possible (as it is), someone somewhere will have done it. --𝕁𝕄𝔽 (talk) 23:58, 8 April 2024 (UTC)

:::::::I would say if necessary, the redirect page itself can link to a given section, if I'm understanding properly? Remsense诉 00:04, 9 April 2024 (UTC)

::::::::Yes, that makes sense. I can't see any other reasonable possibility. 𝕁𝕄𝔽 (talk) 07:43, 9 April 2024 (UTC)

:::::::::Though there are cases where the nlink goes to a broad concept article (such as Gardiner's sign list) when there is no specific article. So {{code|1=nlink=}} is certainly valid and useful.

:::::::::So to solve the current problem, we just need to change the behaviour of {{code |1=nlink=}} so that it links to the target character article rather than its Unicode name. As you proposed already, I think? But we can't dispense with nlink= completely and just link everything willy-nilly since many codepoints (e.g., Chinese characters) don't have their own articles. --𝕁𝕄𝔽 (talk) 08:11, 9 April 2024 (UTC)

Testcases

:::::As a template editor, I find it helpful, when people point out exceptions and cases like this, to put them in the testcases page so that future editors do not have to remember them. – Jonesey95 (talk) 21:52, 8 April 2024 (UTC)

::::::Which testcases? I'm planning on ensuring there's an adequate library of them there once I'm done with this round of updates. Remsense诉 21:54, 8 April 2024 (UTC)

:Per above...is there actually a purpose to being able to set a custom link rather than create easter eggs? I say we just have it link in most cases to Ƣ i.e. the page for the character itself most of the time. Remsense诉 21:57, 8 April 2024 (UTC)

=Almost there=

Great to see it working again, thank you. Just one left on the to-do list, I think?

{{code|1= name=none}} so that {{unichar|0123|name=none}} produces just plain {{tq|U+0123 ģ}}

I need to document {{code|1= alias=yes}}: I will copy Unicode#Alias. --𝕁𝕄𝔽 (talk) 14:48, 9 April 2024 (UTC)

:And there you are: {{tlx|Unichar|1A2|alias{{=}}yes|name{{=}}none}} → {{Unichar|1A2|alias=yes|name=none}} Remsense诉 15:15, 9 April 2024 (UTC)

:It looks a lot like the use of the alias can be automatic, by just checking the alias database and using it instead of the real one if there is an entry. Is there a reason you did not do this? Spitzak (talk) 09:44, 10 April 2024 (UTC)

=Anomalies=

Problems as I discover them

{{unichar|002E|Full stop|nlink=}} ({{unichar|002E|Full stop|nlink=}}) misbehaving. OTOH, {{unichar|002E|nlink=Full stop}} behaves as it should. --𝕁𝕄𝔽 (talk) 19:42, 9 April 2024 (UTC)

:Knew I should've just looked at the page that definitely exists where they tell me what characters can't be used as article titles. Remsense诉 19:44, 9 April 2024 (UTC)

::Some you win, some you lose. I just came back to say it must be something to do with that character because these work:

::{{unichar|002A|Asterisk| nlink= }}, {{unichar|0023|Number sign |nlink= }} --𝕁𝕄𝔽 (talk) 20:01, 9 April 2024 (UTC)

=Refs=

Cwith= and non-latin script

The Nepalese rupee sign, {{char|रू}} uses the combining diacritic technique of

{{unichar|0930}} + {{unichar|0942}}.

Unfortunately, {{unichar|0930|cwith=ू}} produces

{{unichar|0930|cwith=ू|note=A dog's breakfast}}.

Can anyone fix? 𝕁𝕄𝔽 (talk) 16:18, 21 April 2024 (UTC)

:I see that it is also a problem with latin script. In the example of "q with circumflex" below, the template fails to align the circumflex correctly over the q. --𝕁𝕄𝔽 (talk) 18:52, 21 April 2024 (UTC)

::The cwith character is printed first. Also you should not try to use this to show a character that is not a single code point. Spitzak (talk) 08:03, 22 April 2024 (UTC)

:::Ah yes, of course. The general solution is your response to the next question. 𝕁𝕄𝔽 (talk) 08:24, 22 April 2024 (UTC)

cwith handling generally

Suppose that somewhere there exist a letter q with circumflex, q̂. Before we enhanced the template to assert the canonical name (and only the canonical name), it was possible to write {{unichar|0071|cwith=̂|Latin small letter q with circumflex}} and get {{red|U+0071 q̂ {{sc|Latin small letter q with circumflex}}}}. Which of course was false: U+0071 is a common or garden q. The new arrangement is questionably better, producing {{red|{{unichar|0071|cwith=̂|Latin small letter q with circumflex}}}}, which is a different kind of lie: the grapheme shown is not U+0071 and it is not (just) a Latin small letter q.

So I would like to propose that, when {{code|1= cwith=}}, we expose that fact in the description.

Thus, for example, {{unichar|0071|cwith=̂}} should produce {{green|{{unichar|0071}} with {{unichar|0302}} : q̂}}

Comments? 𝕁𝕄𝔽 (talk) 18:50, 21 April 2024 (UTC)

:Cwith should be limited to only the dotted circle.

:I do think the should be a simple "print this instead" argument to replace all the size, font, IMG, and cwith stuff. Spitzak (talk) 08:07, 22 April 2024 (UTC)

::Yes, I agree that the dotted circle should be the only valid option. Perhaps way back in the early developments, it also supported a coloured block to show the various forms of space character? These are now hardcoded but I guess there are too many combining diacritics to do the same here too.

::I will revise the documentation accordingly.

::As for all the other bells and whistles, it would take a full search of existing usage to determine where and why they are used. That is not a trivial task. 𝕁𝕄𝔽 (talk) 08:34, 22 April 2024 (UTC)

:::I have revised the documentation to formally restrict the base character to ◌ and to deprecate any other usage. Please review.

:::When someone has time to revise the template, can this restriction be enforced, please? --𝕁𝕄𝔽 (talk) 10:27, 22 April 2024 (UTC)

:{{unichar|0302|cwith=q}} produces {{unichar|0302|cwith=q}}. Spitzak (talk) 10:28, 22 April 2024 (UTC)

::True, but should it? As per your earlier comment (with which I agree), the template should only produce real code points. --𝕁𝕄𝔽 (talk) 16:27, 23 April 2024 (UTC)

=More detailed request for development =

The only legitimate character to use to display a combining diacritic is the dotted circle. So I propose that

{{code |1=cwith=}} is redefined to mean "circle with".
The preferred syntax is {{code |1=cwith=yes}}
{{code |1=cwith=◌}} and {{code |1=cwith=◌}} are accepted alternatives.
Any other argument is flagged as an error.

Is that reasonable? --𝕁𝕄𝔽 (talk) 16:27, 23 April 2024 (UTC)

:Is it possible to determine it is combining from the unicode info database? If so maybe just ignore the field entirely and use that. Spitzak (talk) 07:15, 25 April 2024 (UTC)

::Do we know how/whether that would work with non-Western scripts? Interestingly (at least on ChromeOS), this Devangari combiner comes with dotted circle out of the box: {{nobr|{{unichar|0942}}}}. I don't know how typical that is. --𝕁𝕄𝔽 (talk) 17:10, 26 April 2024 (UTC)

Fixing nlink= for [[WP:FORBIDDEN]] characters

The docs say that {{para|nlink}} with no argument is deprecated but in my opinion it is a useful feature that we should try to support. The problematic characters are easy to fix simply by linking to the names instead of the characters. I have already written how this can be done in the sandbox ({{compare pages|Template:Unichar/name|Template:Unichar/name/sandbox|the diff}}). The only problem with the way its currently done is that I have to special case the underscore because low line is a disambiguation page. I don't like hardcoding things like that, but I don't think anyone plans to move underscore any time soon so it should be fine. Nickps (talk) 14:26, 14 June 2024 (UTC)

:It was only deprecated because it is a bit of a bear trap. Not every Unicode canonical name has a matching article, I think? And just because an article of that name exists, does it necessarily relate to the character.

:{{ping|Remsense}}, can you remember what the complications were? 𝕁𝕄𝔽 (talk) 19:00, 14 June 2024 (UTC)

::{{para|nlink}} does not link to the canonical name by default. It links to the character itself. See {{unichar/sandbox|32|nlink=}}->{{unichar/sandbox|32|nlink=}} for an example (digit two does not exist, 2 obviously does). ~~My proposal is that the name should be linked if and only if the character is not allowed in a title.~~ Nickps (talk) 20:05, 14 June 2024 (UTC)

:::To actually explain what my change is, if the character is any of # < > [ ] { } | : _ which are the characters not allowed in titles, then I link to the name (except low line which is disambiguated to underscore), otherwise, nothing changes. Nickps (talk) 20:29, 14 June 2024 (UTC)

::::Rereading the discussions about the last big change, it does seem to be the case that it was just these forbidden characters that caused the barf (specific example was full stop). Your proposed revision resolves that problem and seems lightweight enough not to cause any problems.

::::As this is such a high profile template, best we give it a week for any other editor to raise any red flag issues. --𝕁𝕄𝔽 (talk) 07:56, 15 June 2024 (UTC)

:::::Ok, that makes sense, you never know how these things can break. I also need to write testcases anyway, so there's no rush to merge. Nickps (talk) 09:01, 15 June 2024 (UTC)

::::::This makes perfect sense. I cannot figure out why a huge change to make it not use a user-defined name was somehow accompanied by a change that forced a user defined name for the link. I would implement this ASAP as somebody is busy adding text to the nlink in every instance, which is backwards. Spitzak (talk) 14:19, 15 June 2024 (UTC)

:::::::@Spitzak I'd suggest you ask them to stop their edits and comment here. I want to undeprecate the empty nlink parameter but apparently this editor disagrees and should be given a chance to explain their reasons. Nickps (talk) 14:50, 15 June 2024 (UTC)

:::::::Did you mean me? After the big change (when we discovered the anomaly that Nickps is now fixing), I certainly went round clearing nlink=nothing because of not knowing the full extent of the problem. That was a month ago. Has someone else resumed? 𝕁𝕄𝔽 (talk) 18:06, 15 June 2024 (UTC)

What's with [[TM:Unichar/hexformat/sandbox/doc]]?

Now, to be clear, that page used to be at TM:Unichar/sandbox/doc but since it was only used by {{Tl|Unichar/hexformat/sandbox}}, I moved it to its current title. Still, I can't understand the purpose of that page. To me it looks more like a bunch of notes for personal use rather than a documentation page. Does anyone have any idea what it's supposed to say or should it just go to TfD? Nickps (talk) 01:07, 25 June 2024 (UTC)

:{{rto|Nickps}} It looks like a bunch of test cases created by {{u|DePiep}} for regression testing. Since no-one has spoken up it its defence by now, off with its head. 𝕁𝕄𝔽 (talk) 16:22, 16 August 2024 (UTC)

Make |cwith=| a valid option, to save us having to dig out a dotted circle every time?

Since, as documented, the only valid parameter for {{code|1= cwith=}} is the dotted circle, can anyone see a reason to demand the parameter in for first place? Surely we can just have |cwith=| (a null parameter) as a valid option, with the dotted circle being supplied automatically. 𝕁𝕄𝔽 (talk) 16:26, 16 August 2024 (UTC)

:I think it should also be possible to automatically add the dotted circle if the unicode attributes indicates the character is combining, so no cwith is needed at all.

:If it is wrong, I really recommend an attribute be added that is the "print this instead" attribute. It can contain any markup wanted, and would replace all the stuff to set the font and size and cwith, and the image option, and so on. Spitzak (talk) 17:33, 16 August 2024 (UTC)

::Yes, first para makes sense, I agree.

::Sorry, I don't understand your second paragraph, could you expand? 𝕁𝕄𝔽 (talk) 22:12, 16 August 2024 (UTC)

:::I think most of the current parameters could be replaced with a single optional parameter. If that parameter is given, it's value is used to show the character. This would get rid of the need for the image and a lot of other controls for messing with the font. Popular substitutions could eventually be put in the template itself. Spitzak (talk) 03:02, 17 August 2024 (UTC)

::::But the only character we ever want to show is the canonical glyph and canonical name? (with the sole exception of combining diacritics which need the support of a dotted circle for clarity) [Caution: many Devangari diacritics come with the dotted circle 'as standard'.] I'm still not following you.

::::Or do you mean a option to use serif rather than the default sans, since some glyphs are difficult to "read" without the hinting supplied by serif.

::::Or am I still missing your point? (Though if is that there is surfeit of bells and whistles that are never used and should go, I agree "subject to survey". 𝕁𝕄𝔽 (talk) 15:43, 17 August 2024 (UTC)

:::::I want an option that if set to "BLAH" will make it print "BLAH" instead of attempting to print the character. Spitzak (talk) 17:52, 4 September 2024 (UTC)

::::::I think you really need to give an example. I assume you don't mean anything horrible like getting {{unichar|005E}} to display U+005E ^ {{sc|caret sign}}? --𝕁𝕄𝔽 (talk) 18:32, 4 September 2024 (UTC)

:::::::Assuming the new field is called "as", I propose that {{unichar|0040|as="FooBar"}} display as {{tt|U+0040}} FooBar {{sc|COMMERCIAL AT}} instead of {{unichar|0040}} Spitzak (talk) 18:39, 4 September 2024 (UTC)

Width bug

Recently this has been adding a lot of whitespace at the end of the small-caps name. Most obvious if the link is enabled as the underscore is also extended under this whitespace. Spitzak (talk) 17:53, 4 September 2024 (UTC)

:This may be Safari-only. Seems to work on Chrome on Linux Spitzak (talk) 22:57, 4 September 2024 (UTC)

More flexibility in parameter 1

Occasionally, I'd like to use the unicode character itself as the parameter. For instance, for 🎴, I'd like {{unichar|🎴}} to produce {{unichar|1F3B4}}. This would occasionally save me a short but slightly tedious round trip looking up the character code of a character I already have but I don't have the code of, and having the computer do this mapping for me seems quite doable using software (I don't know much about Wikipedia templates, though). Single characters 0-F/f can be exempt from this, of course, if their capacity to represent single-digit hexadecimal numbers from 0 to 15 is still important (although maybe it isn't, since most people write those like 000F or 0F anyway?).

While looking into this, I was reminded that the unichar template doesn't let you add the U+ prefix to the code in parameter 1. So, for instance, U+1F3B4 is an error. Apparently this is a common error for people to make, so maybe it should be detected and the U+ prefix should simply be stripped internally? Dingolover6969 (talk) 07:02, 19 October 2024 (UTC)

:I'm not sure how a reverse lookup like that could be easily accomplished in a Wikipedia template. It seems like something that ought to be possible since the computer obviously has this information, but I don't think you have access to the table that you'd need to do that. The best idea that comes to mind would be to generate a magic template list with a script or bot of some kind that hardcodes the table and then look it up from that. Andre🚐 07:08, 19 October 2024 (UTC)

::This is really easy to do with Lua modules. {{ml|ustring|codepoint|\🎴}} -> {{#invoke:ustring|codepoint|\🎴}} converts a unicode character to its corresponding code point. The problem is that adding support for this introduces ambiguity. Consider {{tlp|unichar|7}}. Should it return {{unichar|7}} or {{unichar|37}}? For this reason I oppose adding support for this feature. Instead, we should make a {{tl|unichar2}} that accepts only unicode characters as parameters. Nickps (talk) 13:26, 21 February 2025 (UTC)

:::I missed that Dingo already addressed this in the opening comment. I think that having 7 and 07 behave differently is unnecessarily confusing, so I've gone ahead and made {{tl|unichar2}}. {{tlp|unichar2|🎴}} -> {{unichar2|🎴}} works as specified. Nickps (talk) 15:19, 21 February 2025 (UTC)

::::Oh, nice work! Didn't know about that Lua module. Lua is awesome. There are definitely some templates that we did the old way that could be improved. Andre🚐 19:51, 21 February 2025 (UTC)

:::::Thanks! Yes, Lua is pretty useful, especially for stuff like this. From what I've seen, the two main reasons we don't use it more is because 1) not many people know Lua (I don't either) and 2) for some reason people really don't like calling modules from mainspace. Everything has to be a template.

:::::@Dingolover6969 Does this work for you? I'm pretty sure {{tl|unichar2}} solves the problem you're having. Nickps (talk) 22:42, 21 February 2025 (UTC)

::::::Wow, very cool, thank you Nickps! I reckon I will find that template quite useful. Dingolover6969 (talk) 15:22, 22 February 2025 (UTC)

:::::::You're welcome. I'm glad I could help. Nickps (talk) 22:07, 22 February 2025 (UTC)

Can anyone see why combining characters have strange redirects?

{{unichar|0302|nlink=}}
{{unichar|0303|nlink=}}

There are redirect articles (created just now) for Combining circumflex accent and Combining tilde. Clicking on the nlinked names above will not take you to either of those redirects. 0302 takes you to the top of Circumflex, 0303 takes you to Nasal vowel, which is not the only use of the diacritic. In each case, the top of the article shows "Redirected from ": normally I would choose this to find the redirecting article and correct it, but that doesn't seem to be possible. (There is actually a redirect article at {{char|̃}} (that's an orphaned combining tilde in there).

So two questions:

Why is the content of the {{code |1= nlink=}} being ignored in favour of an article named for the character itself? (Compare with {{unichar|0023|nlink=pound sign}} still gets you {{unichar|0023|nlink=pound sign}}, no ifs not buts. [So this behaviour may be a relic of earlier error handling?])
How can the erroneous redirects be corrected? (because {{tl|unichar}} is not the only way to reach them.

Any ideas? (I assume that these two cases are not unique.) 𝕁𝕄𝔽 (talk) 23:01, 17 February 2025 (UTC)

: What did you put as the nlink content? As I edit this wikipedia page, it's telling me that the source code is * {{unichar|0302|nlink=}} * {{unichar|0303|nlink=}}, which produces {{unichar|0302|nlink=}} {{unichar|0303|nlink=}}, which link to what you describe (as expected based on my reading of the documentation). So, that's weird, if you put something else in (as I think your comment implies). I'll try including {{unichar|0302|nlink=Combining circumflex accent}} {{unichar|0303|nlink=Combining tilde}} in this comment to see if they work or if the same bug(?) affects them. {{unichar|0302|nlink=Combining circumflex accent}} {{unichar|0303|nlink=Combining tilde}}.

: With regards to the erroneous redirects, I'm not sure what everything should redirect to, but https://en.wikipedia.org/w/index.php?title=%CC%83&redirect=no and https://en.wikipedia.org/w/index.php?title=%CC%84&redirect=no get me to the relevant redirect articles, which seem like they can be then edited normally. I got to these by going to another redirect page and then pasting the unicode character, which I copied from a website that puts it into one's clipboard, into the url — the "Redirected from " note that appears on the other pages is also unclickable for me.

:I'm on firefox. The html for the "Redirected from " note looks normal to me when I inspect-element it (̃ vs ~ for a regular tilde (ok, it doesn't look normal, but the html seems to be correct)), so I imagine this is just about the browser choosing not to render links clickable when they only occur on combining diacritics. In fact, I've just verified this is the way it works on both firefox and chrome using the following html: k̃ which isn't clickable. It isn't even blue in Chrome! So if we want this to change (which seems like a reasonable change to desire), I think we would have to file bug reports with the browsers themselves. Or do some hack to work around it in Wikipedia's software.

: Dingolover6969 (talk) 10:00, 18 February 2025 (UTC)

::OK, these filled-nlink ones I tried seem to work for me. Dingolover6969 (talk) 10:01, 18 February 2025 (UTC)

:::Thank you, that allowed me to fix the immediate problem with those two (and revealed many more, which I will work through). I didn't know about the {{code |1= title=%XY%99}} hack.

:::Coming back to the more general point of {{code |1= nlink=}}: "if no parameter is specified, then the redirect is to an article with the same name as the Unicode canonical name"{{snd}} or at least that was I believed should happen: I was wrong. I see now that what actually happens is the link goes to the an article whose name is just one character long, the character itself. For 'ordinary' characters, that is not a problem because they are accessible but these combining ones are not.

:::(Normally, we only need to use the nlink if we want a specific section or if the Wikipedia article name and the Unicode canonical name differ.)

:::Do we need to add anything to the documentation? 𝕁𝕄𝔽 (talk) 11:04, 18 February 2025 (UTC)

::::Maybe if there is a cwith= then it should use the name of the character as the link.

::::It would also be really nice if Unicode character properties were used to cause cwith= to happen for any combining character rather than caller having to do it! Spitzak (talk) 18:56, 18 February 2025 (UTC)

::::Glad to hear it! :)

::::The nlink documentation is certainly a little confusing, what with its mentions of "using its canonical name", by which it means that the canonical name is linked, to the unicode character. I was able to figure out the state of affairs by close examination of a quadruply-nested bullet point caveat; but it could certainly be made more evident.

::::Dingolover6969 (talk) 06:46, 20 February 2025 (UTC)

Template styles

Can we convert this to use TemplateStyles?

While code point names are an exception from MOS:SMALLCAPS, this template forces small-caps in a way that isn't overridable by user styles, which means users who [http://w3c.github.io/low-vision-a11y-tf/requirements.html#capitalization struggle with all-caps text] (hi! 👋🏼) can't use our user CSS to change the presentation.

If we convert to template styles, users can override for their login, where necessary. — OwenBlacker (he/him; Talk) 10:06, 21 February 2025 (UTC)

:Isn't the text in the database all-caps? It seems like this won't really help anybody who can't read all-caps text. Spitzak (talk) 17:01, 21 February 2025 (UTC)

Jingtian (井)

Our example documentation for note= is {{unichar|4E95|note=Jingtian}}. However, I don't see why that should get a note. Based on my research (googling stuff), 井 (jǐng) is not a name for 井田制度 (jǐngtián zhìdù) (although 井田 (jǐngtián) is, apparently), even though 井田制度 is named after 井. I have added jǐngtián to the page for 井, so {{unichar|4E95|nlink=}} should be fine... or maybe {{unichar|4E95|note=jǐng}}. Paging User:JMF, who added this, and so may wish to weigh in. This was also discussed on Talk:井#Wrong_target?, back when that page used to redirect to jingtian, I assume. Dingolover6969 (talk) 09:00, 20 April 2025 (UTC)

:@Dingolover6969, would you move this over to the talk page of the article where you saw it, please? because I don't remember it and it doesn't look like a "feature" of this template. 𝕁𝕄𝔽 (talk) 09:13, 20 April 2025 (UTC)

:and the reason I don't remember it is because I've been framed. I assume you mean [https://en.m.wikipedia.org/w/index.php?title=Number_sign&diff=1249513017&oldid=1247955741 this diff] at number sign?

:: Ah, I understand. I was thinking of [https://en.wikipedia.org/w/index.php?title=Template:Unichar/doc&diff=next&oldid=1249514522 this Template:Unichar/doc diff], but presumably the number sign diff is the progenitor of the example "in the wild".

:: In that case, there's I don't think there's any remaining difficulty to discuss; it's just a single other Wikipedian getting confused and not knowing about the consensus that was reached about 井. I'll just remove that note from the pages on which it occurs.

:: Out of curiosity, I also chased down the same verbiage on the Sharp_(music) page, and found [https://en.wikipedia.org/w/index.php?title=Sharp_%28music%29&diff=prev&oldid=1218307262 this diff] which ultimately is a correction of [https://en.wikipedia.org/w/index.php?title=Sharp_%28music%29&diff=prev&oldid=453298269 this diff]; so, it's just some cruft that's been circulating Wikipedia since the early days.

::Anyway, sorry to bother you/thanks for your time 🙂 Dingolover6969 (talk) 02:27, 21 April 2025 (UTC)

Cyrillic example of use/use2, which doesn't seem to work?

I've just added an example from Japanese that demonstrates the value of {{code|1= use=lang |use2=ja}}.

: {{unichar|3099|cwith=◌|use=lang|use2=ja}} → {{unichar|3099|cwith=◌|use=lang|use2=ja}} (If use+use2 are not used, this is the (undesirable) effect: {{unichar|3099|cwith=◌}}: the the Japanese diacritic dakuten is not shown properly.)

But I can't see what the existing example is intended to demonstrate?

: {{unichar|0485|cwith=◌|use=script|use2=Cyrs}} → {{unichar|0485|cwith=|use=script|use2=Cyrs}}

since it doesn't actually render the {{midsize|CYRILLIC DASIA PNEUMATA}} diacritic with the place-holder character ({{char|◌}}). [btw, {{unichar|0485|cwith=◌}} produces {{unichar|0485|cwith=}}, so that doesn't work either.]

What should happen with the Cyrillic example? 𝕁𝕄𝔽 (talk) 16:56, 21 April 2025 (UTC)

:I think again this shows there should be an argument that is "arbitrary wiki markup to print instead of the character". This would replace the lang, image, size, cwith, and a ton of other argument bloat, and also allow access to stuff that is currently impossible, such as the "cwith for two characters", or "remove the emoji formatting". Spitzak (talk) 17:52, 21 April 2025 (UTC)

text format

can someone add the ability to force display a character as text? — kwami (talk) 03:21, 11 May 2025 (UTC)

:{{rto|Kwamikagami}} Can you explain what you mean?

:* Taking your initial as an example, {{unichar|004B}} displays the character {{angbr|K}} as text.

:* Or do you mean an emoji? like {{unichar|01F604}} displays a text description. I assume you don't mean :-D

:* the parameter {{code|1=image=}} does the opposite of what you ask, so I assume that this is not what you mean. (We should document the circumstances where this is justified. The only legitimate reason that I can think of is when there is a new codepoint but the glyph is not yet widely implemented in computer fonts. )

:More info please. --𝕁𝕄𝔽 (talk) 15:26, 11 May 2025 (UTC)

::sure,

::at 42355 Typhon, we said about a planetary symbol,

:::A hurricane symbol (16px), which might be identified with {{unichar|1F300}}, has been used.

::the emoji variant of the character isn't appropriate here; we would want to specify it as the text variant, but i don't see how to apply u+FE0E to force it to display as text

::thanks — kwami (talk) 17:39, 11 May 2025 (UTC)

:::Although the Unicode spec ([https://www.unicode.org/charts/PDF/U1F300.pdf Miscellaneous Symbols and Pictographs Range: 1F300–1F5FF]) shows a two-tailed glyph such as you want, how U+1F300 is rendered is a type designer's choice. Google has chosen to represent it as whirlpool in their default computer font for Chrome. Other vendors may have taken a different approach.

:::IFF you can find a computer font that renders it the way you want, you can wrap the {{tl|unichar}} call in {{code|1= span style="font-family }} etc. Compare the use of {{serif|{{unichar|0067}}}} with {{sans-serif|{{unichar|0067}}}} to chose a open-tail or closed-tail {{angbr|g}} (done using {{serif|{{unichar|0067}}}} and {{sans-serif|{{unichar|0067}}}}). BUT you can't assume that readers have that font{{snd}}indeed you should assume that they don't have that font. I'm afraid you are stuck with using {{code|file:Typhon symbol (fixed width).svg}} if you are to be sure that what you see is what they get. "Unicode specifies the code point, not the glyph": the standard provides a general semantic meaning for a given code point; the images in the chart are suggestions or illustrations and no more.

:::Unicode does have a mechanism called 'Variation Sequences' (accessed using special Variation Selector code points) to specify alternative glyphs for certain characters. These are often used for things like different styles of CJK characters or mathematical symbols. However, there isn't a variation sequence defined for U+1F300 to specifically request the line drawing or "text" style. The effect of U+FE0E (and U+FE0F) relies entirely on whether the font in use has defined a specific rendering for the base character when followed by these variation selectors.𝕁𝕄𝔽 (talk) 22:51, 11 May 2025 (UTC) extended 22:55, 11 May 2025 (UTC)

::::ah, my bad. i thought 1F300 was one of the characters that unicode defined as having both text and emoji glyphs. still, there are other characters that are so defined. that's deprecated practice now, but is permanently enshrined for the characters where it was implemented. shouldn't our unichar template support Unicode-defined glyphs? — kwami (talk) 00:00, 12 May 2025 (UTC)

:::::You had best do a new section to formally request that enhancement (though I don't know who is going to do it). But here's a thought: could {{code|1= cwith=}} be used to prepend the U+FE0E? Do you know of any convenient test cases? 𝕁𝕄𝔽 (talk) 09:46, 12 May 2025 (UTC)

::::::Being able to specify a variation selector is very much needed, mostly to turn off emoji variants. However I still recommend we do a really simple field which is "print this instead". This would cover the font selection, size selection, replacement images, multiple combining characters, variation selector, and all the other stuff that is piling up here as many confusing options and requests for options. Spitzak (talk) 12:41, 12 May 2025 (UTC)

::::::JMF, because 'cwith' is prepended, you have to put the desired character there, and then it displays the text character correctly but labels it U+FE0E 🌀︎ VARIATION SELECTOR-15 — kwami (talk) 19:34, 12 May 2025 (UTC)

:::::::Then that's a bug (if I can be unfair to call it a bug, given that this is a new use for cwith). If I use {{code|1= cwith=◌}} (together with combining tilde, for example), I don't get {{tq|0303 dotted circle}}, I get {{unichar|0303|cwith=◌}}. Also, as already covered, 1F300 doesn't have a variant. Let me try a real example and see what happens. Real soon now... 𝕁𝕄𝔽 (talk) 22:29, 12 May 2025 (UTC)

::::::::i think the problem is that the vs has to come after the character. — kwami (talk) 22:39, 12 May 2025 (UTC)

::::::{{tq|could cwith{{=}} be used to prepend the U+FE0E?}}. Yes, it could but it doesn't matter because the variation selector must be placed after the character it modifies. Unichar does not provide a way to do this. Warudo (talk) 22:44, 12 May 2025 (UTC)

:::::::D'oh!!!! 𝕁𝕄𝔽 (talk) 22:47, 12 May 2025 (UTC)

:::::::So, I think we should just allow editors to put whatever character they want after the character they display with a parameter that works just like cwith. I added this in the sandbox and called it {{para|suffix}}. {{unichar/sandbox|01F604|suffix=︎}} results in the desired {{unichar/sandbox|01F604|suffix=︎}}. The reason I think this approach is better than adding a dedicated parameter for VS15 alone like @Kwamikagami {{diff2|1289827942|did here}} is that there are uses for other variation selectors on Wikipedia (see slashed zero where VS1 is used) which means that in the future we might need to add even more parameters if we go down this path. Warudo (talk) 23:39, 12 May 2025 (UTC)

::::::::sure, that's straightforward. we could use vs15 as an example in the documentation as to what the new param might be used for. — kwami (talk) 01:59, 13 May 2025 (UTC)

::::::::I would still prefer, rather than bloating this up with yet more arguments, to add a single "print exactly this for the character" argument. Possibly it could substitute the actual unicode code point for 'X' or something in this string if you are concerned that this makes it too easy to print something else. The font/image/size/prefix/suffix/etc is getting too extreme, and you still have not fixed the ability to show a double-width diacritic over two dotted circles. Spitzak (talk) 08:52, 13 May 2025 (UTC)

:::::::::if you put the dotted ring in both 'cwith' and 'suffix', that should work — kwami (talk) 09:07, 13 May 2025 (UTC)

::::::::::The suffix parameter allows: {{unichar/sandbox|0360|cwith=◌|suffix=◌}} -> {{unichar/sandbox|0360|cwith=◌|suffix=◌}}. So it actually does give us {{tq|the ability to show a double-width diacritic over two dotted circles.}}

::::::::::On the other hand, I really don't like the idea of a "print exactly this for the character" parameter. It could easily lead to misleading output because of a mistake. Warudo (talk) 13:22, 13 May 2025 (UTC)

:::::::::::sandbox version works for article 42355 Typhon — kwami (talk) 19:06, 13 May 2025 (UTC)

::::::::::::Since using the sandbox in the encyclopedia is very risky as any editor might come along and break it, I ported the change to the actual template and replaced the call in 42355 Typhon. Since this change solves two problems with the template (the VS and the double-width diacritic) I feel justified in doing it. WP:BRD still applies if Spitzak or anyone else feels strongly about this. Warudo (talk) 19:48, 13 May 2025 (UTC)

:::::::::::::Suffix looks like it works. For some reason in Safari it sometimes (but not always) draws different-sized circles. Spitzak (talk) 21:56, 13 May 2025 (UTC)

::::::::::::::could that be your display font? circle+diacritic might force use of a different font than the default for the circle itself — kwami (talk) 23:08, 13 May 2025 (UTC)

:::::::::::::::Sorry to spoil the party, but on my system [Chrome on ChromeOS], {{unichar|01F604|suffix=︎}} and {{unichar|01F604}} both give me exactly the same thing, an emoji smiley. So I guess Google's Roboto doesn't have the text-format grapheme? Can anyone suggest a font that does? 𝕁𝕄𝔽 (talk) 23:24, 13 May 2025 (UTC)

::::::::::::::::i think the noto fonts do — kwami (talk) 23:53, 13 May 2025 (UTC)

:::::::::::I agree with Warudo: "display whatever you like" seems like a good idea at first but it is too likely to be misused. Same logic that led us to stop requiring editors to supply a name and instead to fetch the canonical name from Wikidata. 𝕁𝕄𝔽 (talk) 23:13, 13 May 2025 (UTC)

::::::::::::I might even have gone too far with suffix. Technically since a variation selector changes the appearance of the character, we should be telling people that it is there. I {{diff2|1290305285|added a note}} to 42355 Typhon that the VS is there although that is probably moot since the notion that {{unichar|1F300}} can be used for Typhon, with or without VS15, appears to be WP:OR. I posted a message on the article talk page about that. Warudo (talk) 00:18, 14 May 2025 (UTC)

:::::::::::::I added a note in the docs so hopefully any future usage of variation selectors is not hidden. Warudo (talk) 00:24, 14 May 2025 (UTC)

Peculiar combining behaviour

Does anybody understand U+20E0 {{small|COMBINING ENCLOSING CIRCLE BACKSLASH}}? Because it is making unichar have a nervous breakdown,

With no cwith= this is the result: {{unichar|20E0}}.
On my system, that displays an oversized "NO ..." sign that overlaps the concluding zero of 20E0, the space between, and the C of COMBINING.
With cwith=◌, this is the result: {{unichar|20E0|cwith=◌}}
On my system, that displays as between 20E0 and COMBINING.
With cwith=P, this is the result: {{unichar|20E0|cwith=P}}
On my system, that displays as between 20E0 and COMBINING.

Does not compute, Captain! 𝕁𝕄𝔽 (talk) 23:08, 13 May 2025 (UTC)

:with the last, i see 'no parking' -- a capital 'p' with an overstruck circle and slash

:both 2 and 3 look ok [if a bit crowded]; only 1 looks bad

:i'm using firefox on linux — kwami (talk) 20:23, 14 May 2025 (UTC)

::Ok, looks like another Roboto artefact. I'll stop worrying about it. Thanks. 𝕁𝕄𝔽 (talk) 21:37, 14 May 2025 (UTC)

Some character names are not found by the template

Currently in {{slink|CJK Unified Ideographs Extension I|Background}} one finds {{unichar|2ED9D}} and {{unichar|2EDE0}}. The reason the names of the characters are not shown is a bug in {{ml|Unicode data|lookup}}. I have already made an edit request to have the bug fixed, but I'm also leaving a message here since this talk page is more watched than the module's talk page. Warudo (talk) 13:45, 15 June 2025 (UTC)