Database speed optimization: few tables with many rows, or many tables with few rows?

前端未结

关注

 7  852

鱼传尺愫

I have a big doubt.

Let\'s take as example a database for a whatever company\'s orders.

Let\'s say that this company make around 2000 orders per month, so, a

相关标签:

7条回答

北海茫月

2021-01-19 07:56

Before you worry about query speed, consider the costs.

If you split the code into separate code, you will have to have code that handles it. Every bit of code you write has the chance to be wrong. You are asking for your code to be buggy at the expense of some unmeasured and imagined performance win.

Also consider the cost of machine time vs. programmer time.

0 讨论(0)
发布评论:

提交评论
- 加载中...
轮回少年

2021-01-19 08:00

Look into partitioning your tables in time slices. Partitioning is good for the log-like table case where no foreign keys point to the tables.

0 讨论(0)
发布评论:

提交评论
- 加载中...
情深已故

2021-01-19 08:01

I agree that smaller tables are faster. But it depends on your business logic if it makes sense to split a single entity over multiple tables. If you need a lot of code to manage all the tables than it might not be a good idea.

It also depends on the database what logic you're able to use to tackle this problem. In Oracle a table can be partitioned (on year for example). Data is stored physically in different table spaces which should make it faster to address (as I would assume that all data of a single year is stored together)

An index will speed things up but if the data is scattered across the disk than a load of block reads are required which can make it slow.

0 讨论(0)
发布评论:

提交评论
- 加载中...
小蘑菇

2021-01-19 08:05

I would not split tables by year.

Instead I would archive data to a reporting database every year, and use that when needed.

Alternatively you could partition the data, amongst drives, thus maintaining performance, although i'm unsure if this is possible in postgresql.

0 讨论(0)
发布评论:

提交评论
- 加载中...
你的背包

2021-01-19 08:07

If you use indexes properly, you probably need not split it into multiple tables. Most modern DBs will optimize access.

Another option you might consider is to have a table for the current year, and at the end append the data to another table which has data for all the previous years. ?

0 讨论(0)
发布评论:

提交评论
- 加载中...
时光说笑

2021-01-19 08:10
For the volume of data you're looking at splitting the data seems like a lot of trouble for little gain. Postgres can do partitioning, but the fine manual [1] says that as a rule of thumb you should probably only consider it for tables that exceed the physical memory of the server. In my experience, that's at least a million rows.
1. http://www.postgresql.org/docs/current/static/ddl-partitioning.html
0 讨论(0)
发布评论:

提交评论
- 加载中...

1 2 下一页