Case insensitive dictionary

后端 未结 10 2293
栀梦
栀梦 2020-11-27 14:47

I\'d like my dictionary to be case insensitive.

I have this example code:

text = \"practice changing the color\"

words = {\'color\': \'colour\',
            


        
相关标签:
10条回答
  • 2020-11-27 15:13

    The currently approved answer doesn't work for a lot of cases, so it cannot be used as a drop-in dict replacement. Some tricky points in getting a proper dict replacement:

    • overloading all of the methods that involve keys
    • properly handling non-string keys
    • properly handling the constructor of the class

    The following should work much better:

    class CaseInsensitiveDict(dict):
        @classmethod
        def _k(cls, key):
            return key.lower() if isinstance(key, basestring) else key
    
        def __init__(self, *args, **kwargs):
            super(CaseInsensitiveDict, self).__init__(*args, **kwargs)
            self._convert_keys()
        def __getitem__(self, key):
            return super(CaseInsensitiveDict, self).__getitem__(self.__class__._k(key))
        def __setitem__(self, key, value):
            super(CaseInsensitiveDict, self).__setitem__(self.__class__._k(key), value)
        def __delitem__(self, key):
            return super(CaseInsensitiveDict, self).__delitem__(self.__class__._k(key))
        def __contains__(self, key):
            return super(CaseInsensitiveDict, self).__contains__(self.__class__._k(key))
        def has_key(self, key):
            return super(CaseInsensitiveDict, self).has_key(self.__class__._k(key))
        def pop(self, key, *args, **kwargs):
            return super(CaseInsensitiveDict, self).pop(self.__class__._k(key), *args, **kwargs)
        def get(self, key, *args, **kwargs):
            return super(CaseInsensitiveDict, self).get(self.__class__._k(key), *args, **kwargs)
        def setdefault(self, key, *args, **kwargs):
            return super(CaseInsensitiveDict, self).setdefault(self.__class__._k(key), *args, **kwargs)
        def update(self, E={}, **F):
            super(CaseInsensitiveDict, self).update(self.__class__(E))
            super(CaseInsensitiveDict, self).update(self.__class__(**F))
        def _convert_keys(self):
            for k in list(self.keys()):
                v = super(CaseInsensitiveDict, self).pop(k)
                self.__setitem__(k, v)
    
    0 讨论(0)
  • 2020-11-27 15:20

    While a case insensitive dictionary is a solution, and there are answers to how to achieve that, there is a possibly easier way in this case. A case insensitive search is sufficient:

    import re
    
    text = "Practice changing the Color"
    words = {'color': 'colour', 'practice': 'practise'}
    
    def replace(words,text):
            keys = words.keys()
            for i in keys:
                    exp = re.compile(i, re.I)
                    text = re.sub(exp, words[i], text)
            return text
    
    text = replace(words,text)
    print text
    
    0 讨论(0)
  • 2020-11-27 15:21

    In my particular instance, I needed a case insensitive lookup, however, I did not want to modify the original case of the key. For example:

    >>> d = {}
    >>> d['MyConfig'] = 'value'
    >>> d['myconfig'] = 'new_value'
    >>> d
    {'MyConfig': 'new_value'}
    

    You can see that the dictionary still has the original key, however it is accessible case-insensitively. Here's a simple solution:

    class CaseInsensitiveKey(object):
        def __init__(self, key):
            self.key = key
        def __hash__(self):
            return hash(self.key.lower())
        def __eq__(self, other):
            return self.key.lower() == other.key.lower()
        def __str__(self):
            return self.key
    

    The __hash__ and __eq__ overrides are required for both getting and setting entries in the dictionary. This is creating keys that hash to the same position in the dictionary if they are case-insensitively equal.

    Now either create a custom dictionary that initializes a CaseInsensitiveKey using the provided key:

    class CaseInsensitiveDict(dict):
        def __setitem__(self, key, value):
            key = CaseInsensitiveKey(key)
            super(CaseInsensitiveDict, self).__setitem__(key, value)
        def __getitem__(self, key):
            key = CaseInsensitiveKey(key)
            return super(CaseInsensitiveDict, self).__getitem__(key)
    

    or simply make sure to always pass an instance of CaseInsensitiveKey as the key when using the dictionary.

    0 讨论(0)
  • 2020-11-27 15:22

    If I understand you correctly and you want a way to key dictionaries in a non case-sensitive fashion, one way would be to subclass dict and overload the setter / getter:

    class CaseInsensitiveDict(dict):
        def __setitem__(self, key, value):
            super(CaseInsensitiveDict, self).__setitem__(key.lower(), value)
    
        def __getitem__(self, key):
            return super(CaseInsensitiveDict, self).__getitem__(key.lower())
    
    0 讨论(0)
  • 2020-11-27 15:22

    I've modified the simple yet good solution by pleasemorebacon (thanks!) making it slightly more compact, self-contained and with minor updates to allow construction from {'a':1, 'B':2} and support __contains__ protocol. Finally, since the CaseInsensitiveDict.Key is expected to be string (what else can be case-sensitive or not), it is a good idea to derive Key class from the str, then it is possible, for instance, to dump CaseInsensitiveDict with json.dumps out of the box.

    # caseinsensitivedict.py
    class CaseInsensitiveDict(dict):
    
        class Key(str):
            def __init__(self, key):
                str.__init__(key)
            def __hash__(self):
                return hash(self.lower())
            def __eq__(self, other):
                return self.lower() == other.lower()
    
        def __init__(self, data=None):
            super(CaseInsensitiveDict, self).__init__()
            if data is None:
                data = {}
            for key, val in data.items():
                self[key] = val
        def __contains__(self, key):
            key = self.Key(key)
            return super(CaseInsensitiveDict, self).__contains__(key)
        def __setitem__(self, key, value):
            key = self.Key(key)
            super(CaseInsensitiveDict, self).__setitem__(key, value)
        def __getitem__(self, key):
            key = self.Key(key)
            return super(CaseInsensitiveDict, self).__getitem__(key)
    

    Here is a basic test script for those who like to check things in action:

    # test_CaseInsensitiveDict.py
    import json
    import unittest
    from caseinsensitivedict import *
    
    class Key(unittest.TestCase):
        def setUp(self):
            self.Key = CaseInsensitiveDict.Key
            self.lower = self.Key('a')
            self.upper = self.Key('A')
    
        def test_eq(self):
            self.assertEqual(self.lower, self.upper)
    
        def test_hash(self):
            self.assertEqual(hash(self.lower), hash(self.upper))
    
        def test_str(self):
            self.assertEqual(str(self.lower), 'a')
            self.assertEqual(str(self.upper), 'A')
    
    class Dict(unittest.TestCase):
        def setUp(self):
            self.Dict = CaseInsensitiveDict
            self.d1 = self.Dict()
            self.d2 = self.Dict()
            self.d1['a'] = 1
            self.d1['B'] = 2
            self.d2['A'] = 1
            self.d2['b'] = 2
    
        def test_contains(self):
            self.assertIn('B', self.d1)
            d = self.Dict({'a':1, 'B':2})
            self.assertIn('b', d)
    
        def test_init(self):
            d = self.Dict()
            self.assertFalse(d)
            d = self.Dict({'a':1, 'B':2})
            self.assertTrue(d)
    
        def test_items(self):
            self.assertDictEqual(self.d1, self.d2)
            self.assertEqual(
                [v for v in self.d1.items()],
                [v for v in self.d2.items()])
    
        def test_json_dumps(self):
            s = json.dumps(self.d1)
            self.assertIn('a', s)
            self.assertIn('B', s)
    
        def test_keys(self):
            self.assertEqual(self.d1.keys(), self.d2.keys())
    
        def test_values(self):
            self.assertEqual(
                [v for v in self.d1.values()],
                [v for v in self.d2.values()])
    
    0 讨论(0)
  • 2020-11-27 15:23

    Just for the record. I found an awesome impementation on Requests:

    https://github.com/kennethreitz/requests/blob/v1.2.3/requests/structures.py#L37

    0 讨论(0)
提交回复
热议问题