Transactions with Python sqlite3

后端 未结 7 1641
梦如初夏
梦如初夏 2020-12-07 17:33

I\'m trying to port some code to Python that uses sqlite databases, and I\'m trying to get transactions to work, and I\'m getting really confused. I\'m really confused by th

相关标签:
7条回答
  • 2020-12-07 18:10

    This is a bit old thread but if it helps I've found that doing a rollback on the connection object does the trick.

    0 讨论(0)
  • 2020-12-07 18:16

    Normal .execute()'s work as expected with the comfortable default auto-commit mode and the with conn: ... context manager doing auto-commit OR rollback - except for protected read-modify-write transactions, which are explained at the end of this answer.

    sqlite3 module's non-standard conn_or_cursor.executescript() doesn't take part in the (default) auto-commit mode (and so doesn't work normally with the with conn: ... context manager) but forwards the script rather raw. Therefor it just commits a potentially pending auto-commit transactions at start, before "going raw".

    This also means that without a "BEGIN" inside the script executescript() works without a transaction, and thus no rollback option upon error or otherwise.

    So with executescript() we better use a explicit BEGIN (just as your inital schema creation script did for the "raw" mode sqlite command line tool). And this interaction shows step by step whats going on:

    >>> list(conn.execute('SELECT * FROM test'))
    [(99,)]
    >>> conn.executescript("BEGIN; UPDATE TEST SET i = 1; FNORD; COMMIT""")
    Traceback (most recent call last):
      File "<interactive input>", line 1, in <module>
    OperationalError: near "FNORD": syntax error
    >>> list(conn.execute('SELECT * FROM test'))
    [(1,)]
    >>> conn.rollback()
    >>> list(conn.execute('SELECT * FROM test'))
    [(99,)]
    >>> 
    

    The script didn't reach the "COMMIT". And thus we could the view the current intermediate state and decide for rollback (or commit nevertheless)

    Thus a working try-except-rollback via excecutescript() looks like this:

    >>> list(conn.execute('SELECT * FROM test'))
    [(99,)]
    >>> try: conn.executescript("BEGIN; UPDATE TEST SET i = 1; FNORD; COMMIT""")
    ... except Exception as ev: 
    ...     print("Error in executescript (%s). Rolling back" % ev)
    ...     conn.executescript('ROLLBACK')
    ... 
    Error in executescript (near "FNORD": syntax error). Rolling back
    <sqlite3.Cursor object at 0x011F56E0>
    >>> list(conn.execute('SELECT * FROM test'))
    [(99,)]
    >>> 
    

    (Note the rollback via script here, because no .execute() took over commit control)


    And here a note on the auto-commit mode in combination with the more difficult issue of a protected read-modify-write transaction - which made @Jeremie say "Out of all the many, many things written about transactions in sqlite/python, this is the only thing that let me do what I want (have an exclusive read lock on the database)." in a comment on an example which included a c.execute("begin"). Though sqlite3 normally does not make a long blocking exclusive read lock except for the duration of the actual write-back, but more clever 5-stage locks to achieve enough protection against overlapping changes.

    The with conn: auto-commit context does not already put or trigger a lock strong enough for protected read-modify-write in the 5-stage locking scheme of sqlite3. Such lock is made implicitely only when the first data-modifying command is issued - thus too late. Only an explicit BEGIN (DEFERRED) (TRANSACTION) triggers the wanted behavior:

    The first read operation against a database creates a SHARED lock and the first write operation creates a RESERVED lock.

    So a protected read-modify-write transaction which uses the programming language in general way (and not a special atomic SQL UPDATE clause) looks like this:

    with conn:
        conn.execute('BEGIN TRANSACTION')    # crucial !
        v = conn.execute('SELECT * FROM test').fetchone()[0]
        v = v + 1
        time.sleep(3)  # no read lock in effect, but only one concurrent modify succeeds
        conn.execute('UPDATE test SET i=?', (v,))
    

    Upon failure such read-modify-write transaction could be retried a couple of times.

    0 讨论(0)
  • 2020-12-07 18:17

    You can use the connection as a context manager. It will then automatically rollback the transactions in the event of an exception or commit them otherwise.

    try:
        with con:
            con.execute("insert into person(firstname) values (?)", ("Joe",))
    
    except sqlite3.IntegrityError:
        print("couldn't add Joe twice")
    

    See https://docs.python.org/3/library/sqlite3.html#using-the-connection-as-a-context-manager

    0 讨论(0)
  • 2020-12-07 18:24

    For anyone who'd like to work with the sqlite3 lib regardless of its shortcomings, I found that you can keep some control of transactions if you do these two things:

    1. set Connection.isolation_level = None (as per the docs, this means autocommit mode)
    2. avoid using executescript at all, because according to the docs it "issues a COMMIT statement first" - ie, trouble. Indeed I found it interferes with any manually set transactions

    So then, the following adaptation of your test works for me:

    import sqlite3
    
    sql = sqlite3.connect("/tmp/test.db")
    sql.isolation_level = None
    c = sql.cursor()
    c.execute("begin")
    try:
        c.execute("update test set i = 1")
        c.execute("fnord")
        c.execute("update test set i = 0")
        c.execute("commit")
    except sql.Error:
        print("failed!")
        c.execute("rollback")
    
    0 讨论(0)
  • 2020-12-07 18:25

    Per the docs,

    Connection objects can be used as context managers that automatically commit or rollback transactions. In the event of an exception, the transaction is rolled back; otherwise, the transaction is committed:

    Therefore, if you let Python exit the with-statement when an exception occurs, the transaction will be rolled back.

    import sqlite3
    
    filename = '/tmp/test.db'
    with sqlite3.connect(filename) as conn:
        cursor = conn.cursor()
        sqls = [
            'DROP TABLE IF EXISTS test',
            'CREATE TABLE test (i integer)',
            'INSERT INTO "test" VALUES(99)',]
        for sql in sqls:
            cursor.execute(sql)
    try:
        with sqlite3.connect(filename) as conn:
            cursor = conn.cursor()
            sqls = [
                'update test set i = 1',
                'fnord',   # <-- trigger error
                'update test set i = 0',]
            for sql in sqls:
                cursor.execute(sql)
    except sqlite3.OperationalError as err:
        print(err)
        # near "fnord": syntax error
    with sqlite3.connect(filename) as conn:
        cursor = conn.cursor()
        cursor.execute('SELECT * FROM test')
        for row in cursor:
            print(row)
            # (99,)
    

    yields

    (99,)
    

    as expected.

    0 讨论(0)
  • 2020-12-07 18:31

    Here's what I think is happening based on my reading of Python's sqlite3 bindings as well as official Sqlite3 docs. The short answer is that if you want a proper transaction, you should stick to this idiom:

    with connection:
        db.execute("BEGIN")
        # do other things, but do NOT use 'executescript'
    

    Contrary to my intuition, with connection does not call BEGIN upon entering the scope. In fact it doesn't do anything at all in __enter__. It only has an effect when you __exit__ the scope, choosing either COMMIT or ROLLBACK depending on whether the scope is exiting normally or with an exception.

    Therefore, the right thing to do is to always explicitly mark the beginning of your transactions using BEGIN. This renders isolation_level irrelevant within transactions, because thankfully it only has an effect while autocommit mode is enabled, and autocommit mode is always suppressed within transaction blocks.

    Another quirk is executescript, which always issues a COMMIT before running your script. This can easily mess up the transactions, so your choice is to either

    • use exactly one executescript within a transaction and nothing else, or
    • avoid executescript entirely; you can call execute as many times as you want, subject to the one-statement-per-execute limitation.
    0 讨论(0)
提交回复
热议问题