One Hot Encoding using numpy [duplicate]

前端未结

关注

 9  1404

终归单人心

相关标签:

9条回答

猫巷女王i

2020-12-05 00:10
You could use List comprehension:
```
[0 if i !=5 else 1 for i in range(10)]
```
turns to
```
[0,0,0,0,0,1,0,0,0,0]
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
醉话见心

2020-12-05 00:10
The problem here is that you save your array nowhere. The put function works in place on the array and returns nothing. Since you never give your array a name you can not address it later. So this
```
one_pos = 5
x = np.zeros(10)
np.put(x, one_pos, 1)
```
would work, but then you could just use indexing:
```
one_pos = 5
x = np.zeros(10)
x[one_pos] = 1
```
In my opinion that would be the correct way to do this if no special reason exists to do this as a one liner. This might also be easier to read and readable code is good code.
0 讨论(0)
发布评论:

提交评论
- 加载中...
一生所求

2020-12-05 00:12
Usually, when you want to get a one-hot encoding for classification in machine learning, you have an array of indices.
```
import numpy as np
nb_classes = 6
targets = np.array([[2, 3, 4, 0]]).reshape(-1)
one_hot_targets = np.eye(nb_classes)[targets]
```
The one_hot_targets is now
```
array([[[ 0.,  0.,  1.,  0.,  0.,  0.],
        [ 0.,  0.,  0.,  1.,  0.,  0.],
        [ 0.,  0.,  0.,  0.,  1.,  0.],
        [ 1.,  0.,  0.,  0.,  0.,  0.]]])
```
The .reshape(-1) is there to make sure you have the right labels format (you might also have [[2], [3], [4], [0]]). The -1 is a special value which means "put all remaining stuff in this dimension". As there is only one, it flattens the array.

Copy-Paste solution
```
def get_one_hot(targets, nb_classes):
    res = np.eye(nb_classes)[np.array(targets).reshape(-1)]
    return res.reshape(list(targets.shape)+[nb_classes])
```
Package

You can use mpu.ml.indices2one_hot. It's tested and simple to use:
```
import mpu.ml
one_hot = mpu.ml.indices2one_hot([1, 3, 0], nb_classes=5)
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
梦如初夏

2020-12-05 00:14
I'm not sure the performance, but the following code works and it's neat.
```
x = np.array([0, 5])
x_onehot = np.identity(6)[x]
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
执笔经年

2020-12-05 00:15
Use np.identity or np.eye. You can try something like this with your input i, and the array size s:
```
np.identity(s)[i:i+1]
```
For example, print(np.identity(5)[0:1]) will result:
```
[[ 1.  0.  0.  0.  0.  0.  0.  0.  0.  0.]]
```
If you are using TensorFlow, you can use tf.one_hot: https://www.tensorflow.org/api_docs/python/array_ops/slicing_and_joining#one_hot
0 讨论(0)
发布评论:

提交评论
- 加载中...

南旧

2020-12-05 00:22

import time
start_time = time.time()
z=[]
for l in [1,2,3,4,5,6,1,2,3,4,4,6,]:
    a= np.repeat(0,10)
    np.put(a,l,1)
    z.append(a)
print("--- %s seconds ---" % (time.time() - start_time))

#--- 0.00174784660339 seconds ---

import time
start_time = time.time()
z=[]
for l in [1,2,3,4,5,6,1,2,3,4,4,6,]:
    z.append(np.array([int(i == l) for i in range(10)]))
print("--- %s seconds ---" % (time.time() - start_time))

#--- 0.000400066375732 seconds ---

0 讨论(0)

1 2 下一页

热议问题