Nothing to see here… move along (regression discontinuity edition)

2020 January 9
by Daniel Lakeland

From Gelman’s blog, he shows yet another regression discontinuity. Apparently people have never heard of the Runge phenomenon, or the basis for why it happens. Here’s some R code, and a PDF of the output…

## regression discontinuity such as:

#generally is garbage that ignores what's essentially a correlary to
##the well known Runge phenomenon... we demonstrate here.



datasets = list()
for (i in 1:20) {
    datasets[[i]] = data.frame(x=runif(20,0,2),y=rt(20,5))

plotgraph = function(d){
    g = ggplot(d,aes(x,y)) + geom_point() + geom_smooth(data=d[d$x < 1,],method="lm") + geom_smooth(data=d[d$x >= 1,],method="lm")

graphs = lapply(datasets,plotgraph)

In almost every plot there is “something going on” at the discontinuity, either the level of the function has changed, or the slope, or both. And yet, the whole thing is random t-distributed noise…

I don’t know what that paper did to calculate its p values, but it probably wasn’t simulations like this, and it should have been.

2 Responses leave one →
  1. January 10, 2020

    There’s a typo in the code. See if you can find it without running it!

    (I have gotten as far as downloading NetLogo but I’m finding myself very distracted lately.)

    • Daniel Lakeland
      January 10, 2020

      Weird, I fixed it, 1:20 got converted to 1:0 somehow, I must have hit backspace and it went into the wrong window at some point. Obviously the code worked fine at the point when I ran it, so it happened after generating the PDFs.

      NetLogo is cool, check it out.

Leave a Reply

Note: You can use basic XHTML in your comments. Your email address will never be published.

Subscribe to this comment feed via RSS