Best strategy to automating 'complex' calculation tasks on SQL Server 2012 from Excel [closed]

我只是一个虾纸丫 提交于 2020-04-17 21:30:39

问题


I have a workbook which has 4 spreadsheets.

The first spreadsheet uses parameters and data from the other three to calculate some functions and in the end it gives the net present value of some benefit plan conceded to an employee.

Description

First spreadsheet - Net Present Value of the Obligations

That same first spreadsheet uses 21 tables with ~30 columns each. That first table is the only different from the others. The other 20 tables have the same exact structure, only the parameters received by each table are different (age and interest rate).

Each table is calculated as follows:

1st step: the data from the user is retrieved from the third table using VLOOKUP. The age of the user is calculated from their birthdate. The current age is then incremented until 120, creating one row for each age from the current age until the user reaches the age of 120. Each variable from the table is calculated for each age using the variables' previous values.

I achieved that result using CTEs (common table expressions) for the first table.

But the other 20 tables whose strutucre is the same for all of them need to be calculated using data from that first CTE.

First table

Only the first table has the age and it's used by all the tables. I was able to translate this first table using CTE (and I use it with the fourth spreadsheet that I was able to translate to SQL. See below.) StackOverflow post.

Second table and the following tables

That pattern repeats for the other 19 tables (20 tables with the one below). Each row represents one age from the first table.

In this table I have to get the total sum of the social secutiry benefit, disability savings, pension and release earnings columns and sum those values to the columns with the same name/header in the other 19 tables, only the parameters are different.

I need to pass this workbook calculation to SQL, but I don't know what's the best approach.

Second spreadsheet - Parameters

These are the prameters used to calculate the first table in the first spreadsheet. Just VLOOKUPs to retrieve data from other tables with some probabilities.

Third spreadsheet - Benefit plan participant data

In this spreadsheet we have the information of the participant and their dependants. I have to calculate the variables from the first table for them both. I use a CTE and in its recursive part I am using a OUTER APPLY function, because when I used JOIN as soon as one of them reached the age of 120 the younger one would stop calculating. I don't have this problem using

Fourth spreadsheet - Comutation table

The commutation tables were done using once again CTEs. I used a stored procedure to produce a CTE and then when I call the calculation procedure I create a temptable and JOIN with the calculation table in SQL.

Approaches

Firstly, I don't even know if using CTEs are the best approach even for calculating the first table. I have serious doubts about my work, because I have nobody to whom I can ask questions, only Q&A (like StackExchange) sites and forums on the internet.

Secondly, I have the woorkbook on Google Sheets, but I'm reluctant to share it, because there are really sensitive data in it. But if you really need to see it, you can leave me your email so I can share the workbook via email.

Conclusion

I need some brainstorming on the best approach to solve this problem.

CTEs, SSIS, windowing, etc. I just don't know what to do.

Edit #1

There are no plugins or add-ins. The Excel suffers from performance. One of the spreasheets has participants' data (between 70,000 to 80,000. That number changes, because the participants are real people who can leave the benefit plan, die, retire, etc.).

There are a number of users that can access the Excel workbook also. I want to move everything from Excel to SQL Server.

One of the benefit plans that I could I moved to SQL Server using CTEs, that hasn't the complexity of the plan in this post, went from 4 hours executing in Excel to only 1.3 minutes in SQL Server.

The benefit plan I'm exposing here takes 6.5 hours in Excel.

The first table in the first spreadsheet I could easily move to SQL Server using a CTE, as I have done to others.

I could also build a CTE to the second, third, fourth... but that would be too much work and I think it's inefficient.

Even though the 20 tables have the same structure, the parameters are not the same and that's really the problem because I don't how to include those parameters in the code.

I'll follow the advice from @Nick.McDermaid and @billinkc and see if I can progress.

来源:https://stackoverflow.com/questions/61150117/best-strategy-to-automating-complex-calculation-tasks-on-sql-server-2012-from

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!