无法读取的符号,而不是ISO-8859-1文件中的俄语文本



我有编码问题。

我下载了很多相册,正如我所看到的,整个档案都是在Windows环境中制作的,因为.cue文件是ISO-8859-1编码的,其中的西里尔文(俄语(文本是不可读的。示例:

REM GENRE Rock
REM DATE 1987
REM DISCID B407700D
REM COMMENT "ExactAudioCopy v0.99pb4"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
TITLE "Èãðà â áèñåð ïåðåä ñâèíüÿìè [ÕÎÐ]"
FILE "Ãðàæäàíñêàÿ Îáîðîíà - Èãðà â áèñåð ïåðåä ñâèíüÿìè [ÕÎÐ].flac" WAVE
TRACK 01 AUDIO
TITLE "Èãðà â áèñåð"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 01 00:00:00
TRACK 02 AUDIO
TITLE "Íà íàøèõ ãëàçàõ"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 02:22:06
INDEX 01 02:24:59
TRACK 03 AUDIO
TITLE "×óæåðîäíûì ýëåìåíòîì (÷àñòèöåé ëæè)"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 04:31:69
INDEX 01 04:34:27
TRACK 04 AUDIO
TITLE "ß èëëþçîðåí"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 07:30:40
INDEX 01 07:32:61
TRACK 05 AUDIO
TITLE "Äåòñêèé Ìèð"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 10:58:50
INDEX 01 11:00:12
TRACK 06 AUDIO
TITLE "Çîîïàðê"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 12:54:46
INDEX 01 12:58:08
TRACK 07 AUDIO
TITLE "ÊÁÃ-Ðîê (Ðîê-ÊÁÃ)"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 15:41:48
INDEX 01 15:43:39
TRACK 08 AUDIO
TITLE "Ñêîðî íàñòàíåò ñîâñåì"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 17:57:23
INDEX 01 17:59:51
TRACK 09 AUDIO
TITLE "Íåíàâèæó êðàñíûé öâåò"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 20:59:09
INDEX 01 21:01:20
TRACK 10 AUDIO
TITLE "Îí óâèäåë ñîëíöå"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 23:07:31
INDEX 01 23:09:45
TRACK 11 AUDIO
TITLE "Îïòèìèçì"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 25:48:46
INDEX 01 25:51:13
TRACK 12 AUDIO
TITLE "Ìàìà, ìàìà..."
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 27:53:58
INDEX 01 27:56:03
TRACK 13 AUDIO
TITLE "Óáèéöà"
PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"
INDEX 00 30:34:19
INDEX 01 30:36:12

这是我已经从ISO-8859-1转换为UTF-8的文件,但之前的变体有带问号(?(的正方形。那么,我该如何使这个咒语可读呢?

您的文件不是iso-8859-1编码的。我猜是cp1251。您面临一个mojibake案例(Python中的示例,因为其通用可理解性(:

'PERFORMER "Ãðàæäàíñêàÿ Îáîðîíà"'.encode('iso-8859-1').decode('cp1251')
'PERFORMER "Гражданская Оборона"'

最新更新