Type | Bug | Status | reviewed | Date | 12-Dec-2010 03:11 |
---|---|---|---|---|---|
Version | alpha 110 | Category | Native | Submitted by | Kaj |
Platform | Linux x86 libc6 | Severity | minor | Priority | high |
Summary | deline/lines misinterprets UTF-8 character as newline |
---|---|
Description |
In the Syllable websites, we have a Slovenian version. One character that is also in the native name of the language: SlovenÅ¡čina the č is interpreted as a newline by deline/lines and functions that use it, such as read/lines, so that the text is corrupted and broken up. č is UTF-8 C48D in hex. This is a common character in Slovenian, so this is a major problem in Slovenian text. I haven't tested further, so there may also be other characters with the same problem. I've also had other problems with DELINE, but haven't analysed those yet. |
Example code |
deline/lines "SlovenÅ¡čina" |
Assigned to | n/a | Fixed in | - | Last Update | 26-Jan-2015 06:00 |
---|
Comments | |
---|---|
(0002970)
BrianH 12-Dec-2010 06:49 |
Note: This problem does not occur on Windows (at least on XP with a Unicode console font).
This should be easy to fix, so I lowered the severity. It is important to fix so I raised the priority. There are other DELINE bugs that also need fixing. |
(0003000)
JerryTsai 7-Jan-2011 09:34 |
read/lines has this issue too, when the text file contains certain Chinese characters, and is in utf-8. |
(0004553)
abolka 26-Jan-2015 06:00 |
In the core-tests suite. |
Date | User | Field | Action | Change |
---|---|---|---|---|
26-Jan-2015 06:00 | abolka | Comment : 0004553 | Added | - |
30-Jan-2011 18:55 | carl | Status | Modified | submitted => reviewed |
7-Jan-2011 09:35 | JerryTsai | Comment : 0003000 | Modified | - |
7-Jan-2011 09:34 | JerryTsai | Comment : 0003000 | Added | - |
12-Dec-2010 06:51 | BrianH | Comment : 0002970 | Modified | - |
12-Dec-2010 06:50 | BrianH | Severity | Modified | major => minor |
12-Dec-2010 06:50 | BrianH | Priority | Modified | normal => high |
12-Dec-2010 06:49 | BrianH | Comment : 0002970 | Modified | - |
12-Dec-2010 06:49 | BrianH | Comment : 0002970 | Added | - |
12-Dec-2010 06:48 | BrianH | Description | Modified | - |
12-Dec-2010 03:11 | Kaj | Ticket | Added | - |