how to have Accent-insensitive filter in django with postgres?

前端 未结 5 1786
挽巷
挽巷 2021-02-04 12:32

Hi I find that on postgres database, we can\'t configure default accent sensivity (on old mail exchanges).

Is there a way to have a _icontains also insensitive to specia

相关标签:
5条回答
  • 2021-02-04 12:35

    I don't believe you'll be able to use the standard Django field-lookups for this unless you store a non-accented version of your text in another column and do the lookup there. You could add a duplicate column with editable=False and override the model's save() method to update that field from the original accented text.

    Python: Remove accents from unicode

    PostgreSQL Wiki: Strip accents from strings, and output in lowercase

    0 讨论(0)
  • 2021-02-04 12:44

    EDIT: Django 1.8 makes accent unsensitive lookup for postgresql builtin. https://docs.djangoproject.com/en/dev/ref/contrib/postgres/lookups/#std:fieldlookup-unaccent

    In fact in postgres contrib (8.4+) there is an unaccent function to search easily:

    for postgres 9/8.5:

    • https://github.com/adunstan/postgresql-dev/commits/master/contrib/unaccent
    • http://www.sai.msu.su/~megera/wiki/unaccent

    for postgres 8.4:

    • https://launchpad.net/postgresql-unaccent

    here an example of usage from django:

    vals = MyObject.objects.raw(
            "SELECT * \
             FROM myapp_myobject \
             WHERE unaccent(name) LIKE \'%"+search_text+"%'")
    

    You may apply apply unaccent on text-search before comparison.

    Option I made is:

    #!/usr/bin/env python
    # -*- coding: utf-8 -*-
    # parts of credits comes to clarisys.fr
    from django.db.backends.postgresql_psycopg2.base import *
    
    class DatabaseOperations(DatabaseOperations):
        def lookup_cast(self, lookup_type):
            if lookup_type in('icontains', 'istartswith'):
                return "UPPER(unaccent(%s::text))"
            else:
                return super(DatabaseOperations, self).lookup_cast(lookup_type)
    
    class DatabaseWrapper(DatabaseWrapper):
        def __init__(self, *args, **kwargs):
            super(DatabaseWrapper, self).__init__(*args, **kwargs)
            self.operators['icontains'] = 'LIKE UPPER(unaccent(%s))'
            self.operators['istartswith'] = 'LIKE UPPER(unaccent(%s))'
            self.ops = DatabaseOperations(self)
    

    Use this file base.py in a folder and use this folder as db backend. icontains and istartswith are now case and accent insensitive.

    0 讨论(0)
  • 2021-02-04 12:46

    I managed to install unaccent from postgresql contrib, but this answer that patches django didn't work. load_backend on django.db.utils enforces that the backend name starts with django.db.backends.

    The solution that worked for me was inserting this code in one of my modules:

    from django.db.backends.postgresql_psycopg2.base import DatabaseOperations, DatabaseWrapper
    
    def lookup_cast(self, lookup_type, internal_type=None):
        if lookup_type in('icontains', 'istartswith'):
            return "UPPER(unaccent(%s::text))"
        else:
            return super(DatabaseOperations, self).lookup_cast(lookup_type, internal_type)
    
    def patch_unaccent():
        DatabaseOperations.lookup_cast = lookup_cast
        DatabaseWrapper.operators['icontains'] = 'LIKE UPPER(unaccent(%s))'
        DatabaseWrapper.operators['istartswith'] = 'LIKE UPPER(unaccent(%s))'
        print 'Unaccent patch'
    
    patch_unaccent()
    

    Now unaccent searches are working fine, even inside django admin! Thanks for your answer above!

    0 讨论(0)
  • 2021-02-04 12:49

    I'm working on an unaccent lookup field for django and postgreSQL. It's on github: https://github.com/marianobianchi/django-accent-free-lookup

    It's working fine for now, but it still need a lot of work. I'm using it and it doesn't show any problems for now.

    The way of using it is to make a new Manager for the model you want to have unaccents searches (look at the example stored at the end of managers.py file at the project).

    The lookups I have already implement are:

    "__aexact"

    "__aiexact"

    "__acontains"

    "__aicontains"

    They are equivalent to the common field lookups that come with django:

    "__exact"

    "__iexact"

    "__contains"

    "__icontains"

    with the difference that they are "accent insensitive" for the most common accented characters.

    0 讨论(0)
  • 2021-02-04 12:58

    I just released (a few days ago) the django-unaccent library that add operators to the django ORM for unaccent search. It monkeypatch the django ORM and uses the unaccent() function of postgres to do so.

    Please, check this out => https://github.com/djcoin/django-unaccent

    0 讨论(0)
提交回复
热议问题