SQLAlchemy ON DUPLICATE KEY UPDATE

前端 未结 9 1445
死守一世寂寞
死守一世寂寞 2020-11-27 15:53

Is there an elegant way to do an INSERT ... ON DUPLICATE KEY UPDATE in SQLAlchemy? I mean something with a syntax similar to inserter.insert().execute(lis

相关标签:
9条回答
  • 2020-11-27 16:28

    My way

    import typing
    from datetime import datetime
    from sqlalchemy.dialects import mysql
    
    class MyRepository:
    
        def model(self):
            return MySqlAlchemyModel
    
        def upsert(self, data: typing.List[typing.Dict]):
            if not data:
                return
            model = self.model()
            if hasattr(model, 'created_at'):
                for item in data:
                    item['created_at'] = datetime.now()
    
            stmt = mysql.insert(getattr(model, '__table__')).values(data)
            for_update = []
            for k, v in data[0].items():
                for_update.append(k)
    
            dup = {k: getattr(stmt.inserted, k) for k in for_update}
            stmt = stmt.on_duplicate_key_update(**dup)
            self.db.session.execute(stmt)
            self.db.session.commit()
    

    Usage:

    myrepo.upsert([
        {
            "field11": "value11",
            "field21": "value21",
            "field31": "value31",
        },
        {
            "field12": "value12",
            "field22": "value22",
            "field32": "value32",
        },
    ])
    
    0 讨论(0)
  • 2020-11-27 16:28

    As none of these solutions seem all the elegant. A brute force way is to query to see if the row exists. If it does delete the row and then insert otherwise just insert. Obviously some overhead involved but it does not rely on modifying the raw sql and it works on non orm stuff.

    0 讨论(0)
  • 2020-11-27 16:32

    The other answers have this covered but figured I'd reference another good example for mysql I found in this gist. This also includes the use of LAST_INSERT_ID, which may be useful depending on your innodb auto increment settings and whether your table has a unique key. Lifting the code here for easy reference but please give the author a star if you find it useful.

    from app import db
    from sqlalchemy import func
    from sqlalchemy.dialects.mysql import insert
    
    def upsert(model, insert_dict):
        """model can be a db.Model or a table(), insert_dict should contain a primary or unique key."""
        inserted = insert(model).values(**insert_dict)
        upserted = inserted.on_duplicate_key_update(
            id=func.LAST_INSERT_ID(model.id), **{k: inserted.inserted[k]
                                   for k, v in insert_dict.items()})
        res = db.engine.execute(upserted)
        return res.lastrowid
    
    0 讨论(0)
  • 2020-11-27 16:34

    Based on phsource's answer, and for the specific use-case of using MySQL and completely overriding the data for the same key without performing a DELETE statement, one can use the following @compiles decorated insert expression:

    from sqlalchemy.ext.compiler import compiles
    from sqlalchemy.sql.expression import Insert
    
    @compiles(Insert)
    def append_string(insert, compiler, **kw):
        s = compiler.visit_insert(insert, **kw)
        if insert.kwargs.get('on_duplicate_key_update'):
            fields = s[s.find("(") + 1:s.find(")")].replace(" ", "").split(",")
            generated_directive = ["{0}=VALUES({0})".format(field) for field in fields]
            return s + " ON DUPLICATE KEY UPDATE " + ",".join(generated_directive)
        return s
    
    0 讨论(0)
  • 2020-11-27 16:39

    ON DUPLICATE KEY UPDATE post version-1.2 for MySQL

    This functionality is now built into SQLAlchemy for MySQL only. somada141's answer below has the best solution: https://stackoverflow.com/a/48373874/319066

    ON DUPLICATE KEY UPDATE in the SQL statement

    If you want the generated SQL to actually include ON DUPLICATE KEY UPDATE, the simplest way involves using a @compiles decorator.

    The code (linked from a good thread on the subject on reddit) for an example can be found on github:

    from sqlalchemy.ext.compiler import compiles
    from sqlalchemy.sql.expression import Insert
    
    @compiles(Insert)
    def append_string(insert, compiler, **kw):
        s = compiler.visit_insert(insert, **kw)
        if 'append_string' in insert.kwargs:
            return s + " " + insert.kwargs['append_string']
        return s
    
    
    my_connection.execute(my_table.insert(append_string = 'ON DUPLICATE KEY UPDATE foo=foo'), my_values)
    

    But note that in this approach, you have to manually create the append_string. You could probably change the append_string function so that it automatically changes the insert string into an insert with 'ON DUPLICATE KEY UPDATE' string, but I'm not going to do that here due to laziness.

    ON DUPLICATE KEY UPDATE functionality within the ORM

    SQLAlchemy does not provide an interface to ON DUPLICATE KEY UPDATE or MERGE or any other similar functionality in its ORM layer. Nevertheless, it has the session.merge() function that can replicate the functionality only if the key in question is a primary key.

    session.merge(ModelObject) first checks if a row with the same primary key value exists by sending a SELECT query (or by looking it up locally). If it does, it sets a flag somewhere indicating that ModelObject is in the database already, and that SQLAlchemy should use an UPDATE query. Note that merge is quite a bit more complicated than this, but it replicates the functionality well with primary keys.

    But what if you want ON DUPLICATE KEY UPDATE functionality with a non-primary key (for example, another unique key)? Unfortunately, SQLAlchemy doesn't have any such function. Instead, you have to create something that resembles Django's get_or_create(). Another StackOverflow answer covers it, and I'll just paste a modified, working version of it here for convenience.

    def get_or_create(session, model, defaults=None, **kwargs):
        instance = session.query(model).filter_by(**kwargs).first()
        if instance:
            return instance
        else:
            params = dict((k, v) for k, v in kwargs.iteritems() if not isinstance(v, ClauseElement))
            if defaults:
                params.update(defaults)
            instance = model(**params)
            return instance
    
    0 讨论(0)
  • 2020-11-27 16:42

    I just used plain sql as:

    insert_stmt = "REPLACE INTO tablename (column1, column2) VALUES (:column_1_bind, :columnn_2_bind) "
    session.execute(insert_stmt, data)
    
    0 讨论(0)
提交回复
热议问题