Removing nonnumerical data out of a number + SQL

前端 未结 5 1844
深忆病人
深忆病人 2021-01-21 06:06

I\'m trying find the best way to remove nonnumerical data from a varchar in SQL e.g.

\'(082) 000-0000\' to \'0820000000\' or
\'+2782 000 0000\' to \'0820000000\'         


        
相关标签:
5条回答
  • 2021-01-21 06:30

    Have a look at this post (it's the 8th post down - the first LONG one) which details how to use regular expressions in SQL Server. It's not the fastest (that would be do it before you get to SQL) but it provides a decent way to do it.

    0 讨论(0)
  • 2021-01-21 06:34

    The most effective and flexible that I have found is using the numbers/tally table method as shown in mwigdahl's answer on 10 Mar 2009

    i.e. an allow list is much safer than the do-not-allow list that you have put in the bottom of your question above.

    What you have not stated is how you handle non-integers ... what do you do with decimal points?

    0 讨论(0)
  • 2021-01-21 06:40

    What flavour of SQL backend are you using? If there's a regexp_replace kind of function, you could use that to replace [^0-9] with nothing.

    0 讨论(0)
  • 2021-01-21 06:52

    If you're using SQL Server 2005 or newer then your best option is to create a user-defined CLR function and use a regular expression to remove all non-numeric characters.

    If you don't want to use a CLR function then you could create a standard user-defined function. This will do the job although it won't be as efficient:

    CREATE FUNCTION dbo.RemoveNonNumerics(@in VARCHAR(255))
    RETURNS VARCHAR(255)
    AS
    BEGIN
        DECLARE @out VARCHAR(255)
    
        IF (@in IS NOT NULL)
        BEGIN
            SET @out = ''
    
            WHILE (@in <> '')
            BEGIN
                IF (@in LIKE '[0-9]%')
                    SET @out = @out + SUBSTRING(@in, 1, 1)
    
                SET @in = SUBSTRING(@in, 2, LEN(@in) - 1)
            END
        END
    
        RETURN(@out)
    END
    

    And then select from your table like so:

    SELECT dbo.RemoveNonNumerics(your_column) AS your_tidy_column
    FROM your_table
    
    0 讨论(0)
  • 2021-01-21 06:55

    It is much easier to handle string parsing in your business layer. However, baring that use the T-SQL REPLACE() function (assuming MS SQL).

    You could do a loop with that function on the parameter that was passed in to strip all non-numeric letters out of it.

    0 讨论(0)
提交回复
热议问题