How to mark slope changes in LOESS curve using ggplot2?

前端 未结 2 864
无人共我
无人共我 2020-12-29 12:54

I have some time-series data that I\'m fitting a loess curve in ggplot2, as seen attached. The data takes the shape of an \"S\" curve. What I really need to find out is the

相关标签:
2条回答
  • 2020-12-29 13:27

    It all depends on what you mean by "where the data starts to level off". You need to put this into math. LOESS curves can be really bumpy depending on what bandwidth you use. You might want to modify the line below the comment marked "line A" to specify what you mean. For example, you might want to look at more than just the first previous value. You could, for example, look at the sum of the previous 5 the_diff values.

    library(ggplot2)
    christi <- read.table("stover_data.txt",header=TRUE)
    the_fit  = loess(org_count ~ date, data=christi)
    pred = predict(the_fit, christi, se=FALSE) #could change data with larger grid
    with_loess <- cbind(christi,pred)
    
    p<-qplot(date,org_count, data=christi)
    the_plot <- p+stat_smooth(method="loess",size=1.5)
    
    the_diff <- diff(with_loess$pred)
    
    tolerance <- .1
    
    #line A: the following line is what you want to modify.
    vline <- min(with_loess$date[the_diff < -tolerance])
    new_plot <- the_plot + geom_vline(xintercept=vline)
    

    plot with vline

    for example, you could do the following (replace the last part of the above code starting with the the_diff line)

    the_diff <- diff(with_loess$pred, lag=20)
    
    tolerance <- 1
    vline <- min(with_loess$date[the_diff < -tolerance])
    new_plot <- the_plot + geom_vline(xintercept=vline)
    

    enter image description here

    Also note that you might want to shift the the_diff vector depending on what you mean by "start to level off" (ie in the future it's going to level off, or it's already starting to level off, etc.). Also remember that the_diff is a shorter by the length of its lag argument than your dataset.

    0 讨论(0)
  • 2020-12-29 13:28

    If you are asking for a way of determining the point where the curve is a maximum (i.e. flat), this is the same as finding the point where the slope of the line is at its maximum (from basic calculus).

    First, read your data:

    christi <- read.table("http://dl.dropbox.com/u/75403/stover_data.txt", sep="\t", header=TRUE)
    

    Next, use loess to fit a smoothed model:

    fit <- loess(org_count~date, data=christi)
    

    Then, predict the values in your range of x-values (with predict.loess), determine the slope (diff is close enough`), and find the

    x <- 200:800
    px <- predict(fit, newdata=x)
    px1 <- diff(px)
    
    which.max(px1)
    [1] 367
    

    Since the start value of x is 200, this means the curve is flat at position 200+367=567.


    If you wanted to plot this:

    par(mfrow=c(1, 2))
    plot(x, px, main="loess model")
    
    plot(x[-1], px1, main="diff(loess model)")
    abline(v=567, col="red")
    

    enter image description here

    0 讨论(0)
提交回复
热议问题