AttributeError: 'Series' object has no attribute 'reshape'

前端 未结 3 1640
情歌与酒
情歌与酒 2021-02-03 17:45

I\'m using sci-kit learn linear regression algorithm. While scaling Y target feature with:

Ys = scaler.fit_transform(Y)

I got

相关标签:
3条回答
  • 2021-02-03 18:05

    The solution is indeed to do:

    Y.values.reshape(-1,1)

    This extracts a numpy array with the values of your pandas Series object and then reshapes it to a 2D array.

    The reason you need to do this is that pandas Series objects are by design one dimensional. Another solution if you would like to stay within the pandas library would be to convert the Series to a DataFrame which would then be 2D:

    Y = pd.Series([1,2,3,1,2,3,4,32,2,3,42,3])
    
    scaler = StandardScaler()
    
    Ys = scaler.fit_transform(pd.DataFrame(Y))
    
    0 讨论(0)
  • 2021-02-03 18:20

    Solution was linked on reshaped method on documentation page.

    Insted of Y.reshape(-1,1) you need to use:

    Y.values.reshape(-1,1)
    
    0 讨论(0)
  • 2021-02-03 18:29

    You cannot reshape a pandas series, so you need to perform the operation on a numpy array. As others have suggested, you can use y.values.reshape(-1, 1), but if you want to impress your friends, you can use:

    y.values[Ellipsis, None]
    

    Which is equivalent to:

    y.values[..., None]
    

    It basically means all dimensions as they where, then a new dimension for the last one. Here's a fully working example:

    import numpy as np
    import pandas as pd
    from sklearn.preprocessing import StandardScaler
    
    y = pd.Series(np.random.rand(5))
    
    0    0.497165
    1    0.818659
    2    0.327064
    3    0.772548
    4    0.095715
    dtype: float64
    
    scaler = StandardScaler()
    
    scaler.fit_transform(y.values[Ellipsis, None])
    
    array([[-0.019],
           [ 1.165],
           [-0.645],
           [ 0.995],
           [-1.496]])
    
    0 讨论(0)
提交回复
热议问题