Confusion Between 'sample' and 'rbinom' in R

Question

Why are these not equivalent?

#First generate 10 numbers between 0 and .5
set.seed(1)
x <- runif(10, 0, .5)

These are the two statements I'm confused by:

#First    
sample(rep(c(0,1), length(x)), size = 10, prob = c(rbind(1-x,x)), replace = F)
#Second
rbinom(length(x), size = 1, prob=x)

I was originally trying to use 'sample'. What I thought I was doing was generating ten (0,1) pairs, then assigning the probability that each would return either a 0 or a 1.

The second one works and gives me the output I need (trying to run a sim). So I've been able to solve my problem. I'm just curious as to what's going on under the hood with 'sample' so that I can understand R better.

IRTFM · Accepted Answer

The first area of difference is the location of the length of the vector specification in the parameter list. The names size have different meanings in these two functions. (I hadn't thought about that source of confusion before, and I'm sure I have made this error myself many times.)

The random number generators (starting with r and having a distribution suffix) have that choice as the first parameter, whereas sample has it as the second parameter. So the length of the second one is 10 and the length of the first is 1. In sample the draw is from the values in the first argument, while 'size' is the length of the vector to create. In the rbinom function, n is the length of the vector to create, while size is the number of items to hypothetically draw from a theoretical urn having a distribution determined by 'prob'. The result returned is the number of "ones". Try:

rbinom(length(x), size = 10, prob=x)

Regarding the argument to prob: I don't think you need the c().

Confusion Between 'sample' and 'rbinom' in R

Answers (2)

Related Questions

Confusion Between &#39;sample&#39; and &#39;rbinom&#39; in R

Answers (2)

Related Questions

Confusion Between 'sample' and 'rbinom' in R