Parse a non-ascii (unicode) number-string as integer in .NET

前端 未结 3 1616
臣服心动
臣服心动 2021-02-20 06:49

I have a string containing a number in a non-ascii format e.g. unicode BENGALI DIGIT ONE (U+09E7) : \"১\"

How do I parse this as an integer in .NET?

3条回答
  •  慢半拍i
    慢半拍i (楼主)
    2021-02-20 07:38

    You could create a new string that is the same as the old string except the native digits are replaced with Latin decimal digits. This could be done reliably by looping through the characters and checking the value of char.IsDigit(char). If this function returns true, then convert it with char.GetNumericValue(char).ToString().

    Like this:

    static class DigitHelper
    {
        public static string ConvertNativeDigits(this string text)
        {
            if (text == null)
                return null;
            if (text.Length == 0)
                return string.Empty;
            StringBuilder sb = new StringBuilder();
            foreach (char character in text)
            {
                if (char.IsDigit(character))
                    sb.Append(char.GetNumericValue(character));
                else
                    sb.Append(character);
            }
            return sb.ToString();
        }
    }
    
    
    int value = int.Parse(bengaliNumber.ConvertNativeDigits());
    

提交回复
热议问题