Unknown symbols in subdivNames #14

Closed
opened 2025-12-28 18:12:06 +00:00 by sami · 0 comments
Owner

Originally created by @AliceInHunterland on GitHub (Nov 10, 2023).

Originally assigned to: @End-rey on GitHub.

While preprocessing SubdivisionCodes.csv it leaves some "?", "u0092", "u0091".

Current Behavior

currently, those records are corrupted:

Country, Location, Original subName, Result
AZ,KUR,K�rd?mir,Kürd?mir
AZ,MI,Ming?�evir,Ming?çevir
ER,DK,Debubawi K�eyyi? Ba?ri,Debubawi K’eyyi? Ba?ri
ER,SK,Semienawi K�eyyi? Ba?ri,Semienawi K’eyyi? Ba?ri
LK,5,N�?genahira pa?ata,Næ?genahira pa?ata
LK,7,Uturum�?da pa?ata,Uturumæ?da pa?ata
VN,24,Qu?ng B�nh,Qu?ng Bình
VN,26,Th?a Thi�n-Hu?,Th?a Thiên-Hu?
VN,29,Qu?ng Ng�i,Qu?ng Ngãi
VN,31,B�nh �?nh,Bình Ð?nh
VN,33,�?k L?k,Ð?k L?k
VN,35,L�m �?ng,Lâm Ð?ng
VN,39,�?ng Nai,Ð?ng Nai
VN,40,B�nh Thu?n,Bình Thu?n
VN,43,B� R?a - Vung T�u,Bà R?a - Vung Tàu
VN,45,�?ng Th�p,Ð?ng Tháp
VN,55,B?c Li�u,B?c Liêu
VN,58,B�nh Phu?c,Bình Phu?c
VN,67,Nam �?nh,Nam Ð?nh
VN,68,Ph� Th?,Phú Th?
VN,71,�i?n Bi�n,Ði?n Biên
VN,72,�?k N�ng,Ð?k Nông
VN,HN,H� N?i,Hà N?i
VN,HP,H?i Ph�ng,H?i Phòng
VN,SG,H? Ch� Minh,H? Chí Minh
YE,BA,Al Bay?a�,Al Bay?a’
YE,DA,"	A? ?ali�","	A? ?ali‘"

Possible Solution

find another way of decoding or another source for data.

Originally created by @AliceInHunterland on GitHub (Nov 10, 2023). Originally assigned to: @End-rey on GitHub. <!-- Provide a general summary of the issue in the Title above --> While preprocessing SubdivisionCodes.csv it leaves some "?", "u0092", "u0091". ## Current Behavior <!-- Tell us what happens instead of the expected behavior --> currently, those records are corrupted: ``` Country, Location, Original subName, Result AZ,KUR,K�rd?mir,Kürd?mir AZ,MI,Ming?�evir,Ming?çevir ER,DK,Debubawi K�eyyi? Ba?ri,Debubawi K’eyyi? Ba?ri ER,SK,Semienawi K�eyyi? Ba?ri,Semienawi K’eyyi? Ba?ri LK,5,N�?genahira pa?ata,Næ?genahira pa?ata LK,7,Uturum�?da pa?ata,Uturumæ?da pa?ata VN,24,Qu?ng B�nh,Qu?ng Bình VN,26,Th?a Thi�n-Hu?,Th?a Thiên-Hu? VN,29,Qu?ng Ng�i,Qu?ng Ngãi VN,31,B�nh �?nh,Bình Ð?nh VN,33,�?k L?k,Ð?k L?k VN,35,L�m �?ng,Lâm Ð?ng VN,39,�?ng Nai,Ð?ng Nai VN,40,B�nh Thu?n,Bình Thu?n VN,43,B� R?a - Vung T�u,Bà R?a - Vung Tàu VN,45,�?ng Th�p,Ð?ng Tháp VN,55,B?c Li�u,B?c Liêu VN,58,B�nh Phu?c,Bình Phu?c VN,67,Nam �?nh,Nam Ð?nh VN,68,Ph� Th?,Phú Th? VN,71,�i?n Bi�n,Ði?n Biên VN,72,�?k N�ng,Ð?k Nông VN,HN,H� N?i,Hà N?i VN,HP,H?i Ph�ng,H?i Phòng VN,SG,H? Ch� Minh,H? Chí Minh YE,BA,Al Bay?a�,Al Bay?a’ YE,DA," A? ?ali�"," A? ?ali‘" ``` ## Possible Solution find another way of decoding or another source for data.
sami 2025-12-28 18:12:06 +00:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
nspcc-dev/locode-db#14
No description provided.