The link: http://www.geeksforgeeks.org/segment-tree-set-1-sum-of-given-range/. This is the quoted text:
We start with a segment arr[0 . . . n-1]. And ever
Let the size of input array is n.
All the input array elements will be leaf nodes in segment tree so the number of leaf nodes = n
Since the Segment tree is a complete tree so the Hight of Segment Tree h = ⌈ Log2n ⌉ + 1
Maximum number of nodes in a binary tree of height ‘h’ is 2h-1
So Number of nodes in a segment tree = 2⌈ Log2n ⌉ + 1 -1
Equals to 2*2⌈ Log2n ⌉ -1
What is happening here is, if you have an array of n elements, then the segment tree will have a leaf node for each of these n entries. Thus, we have (n) leaf nodes, and also (n-1) internal nodes.
Total number of nodes= n + (n-1) = 2n-1 Now, we know its a full binary tree and thus the height is: ceil(Log2(n)) +1
Total no. of nodes = 2^0 + 2^1 + 2^2 + … + 2^ceil(Log2(n)) // which is a geometric progression where 2^i denotes, the number of nodes at level i.
Formula of summation G.P. = a * (r^size - 1)/(r-1) where a=2^0
Total no. of nodes = 1*(2^(ceil(Log2(n))+1) -1)/(2-1)
= 2* [2^ceil(Log2(n))] -1 (you need space in the array for each of the internal as well as leaf nodes which are this count in number), thus it is the array of size.
= O(4 * n) approx..
You may also think of it this way, Let the below be the segment tree:
10
/ \
3 7
/\ /\
1 2 3 4
If the above is you segment tree, then you array of segment tree will be: 10,3,7,1,2,3,4 i.e. 0th element will store the sum of 1st and 2nd entries, 1st entry will store the sum of 3rd and 4th and 2nd will store the sum of 5th and 6th entry!!
Also, the better explanation is: if the array size n is a power of 2, then we have exactly n-1 internal nodes, summing up to 2n-1 total nodes. But not always, we have n as the power of 2, so we basically need the smallest power of 2 which is greater than n. That means this,
int s=1;
for(; s<n; s<<=1);
You may see my same answer here
Oddly enough, I was reading from the same source as the question when I came upon this. I'll try and answer my best.
Let's start with a basic difference in trees representations (in context only):
The almost "Worst Case" scenario. This one is not completely balanced and not really fun to traverse. Why? Because, with different inputs, different trees might be generated and hence time taken to traverse is not very predictable.
Our "Best Case" scenario. This one is totally balanced or complete and will take a predictable amount of time to traverse, always. Moreover, this tree is also better "hacked".
Now let's get back to our question. [Refer to the first image] We know that for every n-input array (The numbers in green), there will be n-1 internal nodes (The numbers in blue). So a maximum of 2n-1 node space must be allocated.
But the code here does something on the contrary. Why and how?
What you expect: You expect that the memory allocated for 2n-1 nodes should be sufficient. In other words, this should be done:
int *st = new int[2*n - 1];
Assuming the rest of the code works well, this is isn't a very good idea. That's because it creates our unbalanced tree, much like in our first case. Such a tree is not easy to traverse nor easy to apply to problem-solving.
What really happens: We add/pad extra memory with null
or 0
values. We do this:
int x = (int)(ceil(log2(n))); //Height of segment tree
int max_size = 2*(int)pow(2, x) - 1; //Maximum size of segment tree
int *st = new int[max_size];
That is we allocate enough space to generate a balanced complete tree. Such a tree is easy to traverse (using some special modifications) and can be applied to problems directly.
How did we allocate enough memory for case 2? Here's how:
We know there are at least three components in our balanced Segment Tree:
We also know that a balanced tree with k leaves will have:
Combining the two we get the desired outcome:
int x = (int)(ceil(log2(n))); //Height of segment tree
int max_size = 2*(int)pow(2, x) - 1; //Maximum size of segment tree
int *st = new int[max_size];
Trivia! Raising 2 to the power of x
above, ensures that we get the nearest ceiling integer which is:
n
(Number of elements in our input array).Segment tree is a full binary tree. But we are representing it in an array. Remember, representing any Binary Tree of height h in an array representation, will require the space equivalent to a Perfect Binary Tree of height h.
[ Maximum possible Height of a Binary Tree with n nodes] (h) = Ceil[ Log_2 (n+1) ] - 1
[ No. of nodes in a Perfect Binary Tree of height h] (n) = 2 ^ (h+1) - 1
The given array will represent the leaves of the segment tree. So. the size of the given array will be the no. of leaves.
In a segment tree, every pair of leaves will be joined by their parent in the previous level. And these parents will again be joined by their parents at the previous level. This goes on until the root.
* Say, if there are 4 leaves in a Binary Tree, then the maximum no. of interior nodes in the Binary Tree will be N-1. So, 3.
- Then the total number of nodes in the Binary Tree = No. of interior nodes + No. of leaves. So, 4+3 = 7.
- The max possible height of this Binary Tree will be 2. Formula: Maximum possible Height of a Binary Tree (h) = Ceil[ Log_2 (n+1) ] - 1 .
- Remember, the total space required in the Segment Tree Array will be nothing but the total no. of nodes of the Perfect Binary Tree at this height.
- So, the total no. of nodes of the Perfect Binary Tree at this height is (n) = 7. Formula: No. of nodes in a Perfect Binary Tree (n) = 2 ^ (h+1) - 1.
- Thus the Segment Tree Array should also be of the size 7.
* But if there is one more leaf, say 5 and remember that this leaf can be anywhere between the beginning of the level till the end of the level.
- Then the total number of nodes in the Binary Tree = No. of interior nodes + No. of leaves. So, 5+4 = 9.
- The max possible height of this Binary Tree will be 3. Maximum possible Height of a Binary Tree (h) = Ceil[ Log_2 (n+1) ] - 1 .
- Remember, the total space required in the Segment Tree Array will be nothing but the total no. of nodes of the Perfect Binary Tree at this height.
- So, the total no. of nodes of the Perfect Binary Tree at this height is (n) = 15. Formula: No. of nodes in a Perfect Binary Tree (n) = 2 ^ (h+1) - 1.
- Thus the Segment Tree Array should also be of the size 15.
Talking generally,
* Say, if there are N leaves in a Binary Tree, then the maximum no. of interior nodes in the Binary Tree will be N-1.
- Then the total number of nodes in the Binary Tree = No. of interior nodes + No. of leaves. So, 2N-1.
- The max possible height of this Binary Tree will be Ceil[ Log_2 (2N) ] - 1. Formula: Maximum possible Height of a Binary Tree (h) = Ceil[ Log_2 (n+1) ] - 1 .
- Remember, the total space required in the Segment Tree Array will be nothing but the total no. of nodes of the Perfect Binary Tree at this height.
- So, the total no. of nodes of the Perfect Binary Tree at this height is (n) = 2 ^ (Ceil[ Log_2 (2N) ] ) - 1. Formula: No. of nodes in a Perfect Binary Tree (n) = 2 ^ (h+1) - 1.
- Thus the Segment Tree Array should also be of the size 2 ^ (Ceil[ Log_2 (2N) ] ) - 1.
- This can also be written as [2*2 ^ (Ceil[ Log_2 (N) ] )] - 1.
Thus, Size of the Segment Tree Array = [2*2 ^ (Ceil[ Log_2 (N) ] )] - 1
Best Case Scenario: (No. of leaves (N) is a power of 2)
Say, the no. of leaves , N is 4.
Since N is a power of 2, the Segment tree will be a Perfect Binary Tree.
So the total no of nodes will be N+N-1 = 2N-1 = 7
So, the size of the Segment Tree Array = 7.
Not the Best Case Scenario: (No. of leaves (N) is not a power of 2)
If the no. of leaves , N is 5.
Since N is not a power of 2, the Segment Tree will need one more entire level to accommodate the extra 1 leaf, as this leaf can be anywhere from the beginning of the level till the end.
We know that in a Perfect binary tree, the no of nodes in every new level, will be equal to No. of all the previous level nodes + 1.
Now, total no. of nodes in the segment tree upto the previous power of 2. i.e. 8 is 8+7 = 15
So, the no. of nodes in the new level will be 15+1 = 16
So, the size of the Segment Tree Array = 15 + 16 = 31.
Talking generally,
Best Case Scenario: (No. of leaves (N) is a power of 2)
Since N is a power of 2, the Segment tree will be a Perfect Binary Tree.
So the total no of nodes will be N+N-1 = 2N-1
So, the size of the Segment Tree Array = 2N-1
Not the Best Case Scenario: (No. of leaves (N) is not a power of 2)
Since N is not a power of 2, the Segment Tree will need one more entire level to accommodate the extra leaves, as this leaf can be anywhere from the beginning of the level till the end.
We know that in a Perfect binary tree, the no of nodes in every new level, will be equal to No. of all the previous level nodes + 1.
Now, total no. of nodes in the segment tree upto the previous power of 2 will be 2N-1.
So, the no. of nodes in the new level will be 2N-1+1 = 2N
So, the size of the Segment Tree Array = 2N + 2N - 1 = 4N - 1 = 4N (approx.)
Thus, Size of the Segment Tree Array = 4N (approx.)
here are some links .. iterative implementation for building segment tree of size 2*n-1 from array of n(any number) length https://www.geeksforgeeks.org/segment-tree-efficient-implementation/ recursive implementation for building segment tree of size 2*n-1 from array of n(any number) length https://www.hackerearth.com/practice/notes/segment-tree-and-lazy-propagation/#c191521
iterative implementation for building segment tree of size less than 4*n from array of n(any number) https://codeforces.com/blog/entry/18051
Segment tree will be a full-binary tree where all leaves will denote the element in your input array. And as mention here
The number of nodes n in a full binary tree, is at least n = 2h+1 and at most n = 2^{h+1} - 1, where h is the height of the tree. And h = log_2n .
Note - log_2n indicates log base 2
Here is the python code for finding out maximum number of nodes in segment tree -
from math import pow, log, ceil
def initialize_seg_tree(input_arr):
n = len(input_arr)
height = ceil(log(n, 2))
# max_nodes = 2^(h+1) - 1, where h = log(n) // base 2
seg_tree_size = int(pow(2, height + 1) - 1)
seg_tree_arr = empty_1d_array(seg_tree_size)
return seg_tree_arr