Reproducible Quantitative Methods

Lesson 7

Programming in R, continued /Authorship and citation practices for non-manuscript research products

yeah bar

Topics and Resources

  1. Even more with R (writing simple software)

    Review everything you did last week, and do some more! This is where things start getting very freeform and directed by your data- start really working towards the scientific questions your students asked of the data.

    Depending on where you are with your data and analysis, we may want to introduce plotting with ggplot2- we can work on this in the seminar.

  2. Authorship in the digital age

    We talked about the 'traditional' definition of authorship on scienctific products a couple weeks ago, but now that we're a little deeper, you can see there's (even) more layers than what we discussed. Please complete the readings below and we'll have a class discussion on Wednesday.

Exercises

  1. Loops, Conditionals and functions
  2. And student directed analyses! Look! See what you find! This is the fun part :)

Discussion

Authorship on data, authorship in collaboration

Now that you're a little deeper into the course, you're ready for a more in-depth discussion about scientific authorship. The readings here are meant to present some extreme views and really provoke discussion. Please read these readings prior to class on Thursday.

Readings

Pubwication of software papers, and authorship on them. This is a blog post by Titus Brown, a professor of data-intensive biology at UC Davis. Titus’ group built a software product, khmer, out in the open, and invited contributions from the community. Long story short, he awarded authorship to everyone who contributed when he published a paper on the subject, no matter what the contribution was. This sparked a big debate in the community about what authorship really meant.

Data Sharing (an editorial from the New England Journal of Medicine).This editorial presents another view of authorship- namely, making the argument that data is essentially owned by those who collected it, and that no derivative works should be created without the data owner’s permission.

A Fundamental Difference of Opinion is blog post by me in response to the above editorial.

Questions

Why is there a discrepancy between the way credit is offered to data and software?

Are data and software research products?

What is the best way to credit data and software creators?

Previous Lesson | Home | Next Lesson