Create species matrix with abundance (counts)

Question

I have a dataset (as.data.frame) like this one:

Site	Species	Count
a	Abies	14
b	Alnus	1
c	Pinus	1
c	Artem	2
n	...	...

, n of sites = 26000. I need to convert it into a matrix like this one in R:

	Abies	Alnus	Pinus	Artem
a	14	0	0	0
b	0	1	0	0
c	0	0	1	2
n	...	...	...	...

I came across the 'fossil' package, with the create.matrix fuction. This function creates the matrix I need but only with the presence (1) or absence (0) of each species for each site. However, I need the abundance (count), not the presence-absence (1-0).

Alejandro Ruiz-Garc&#237;a · Accepted Answer

I hope I'm not too late to answer your question.

If you type ?create.matrix in the RStudio console you can get the documentation about the function. There it's said that you can actually use your original raw data to make an abundance matrix, but you have to include a couple of extra arguments(tax.name to indicate the species names, locality to indicate the sites, abund.col to indicate the count of each species and abund = TRUE just to let the function know we're working with abundance data).

In your case...

df <- create.matrix(x, tax.name = "Species",
   locality = "Site",
   abund.col = "Count",
   abund = TRUE)

Where x is the name of your data.frame containg those three columns (Site, Species and Count). However, this will create a data.frame where the rows are the species and the columns are the sites. If you want to transpose it, just use the function t(df) to change the species to the columns and the sites to the rows!

Hope this was helpful, also you can check the rest of the documentation right here.

Also it is important to know that the output of the function create.matrix is not a data.frame, so you might want to convert it to a data frame using as.data.frame while doing the transposition...

abundance.matrix <- as.data.frame(t(df))

Create species matrix with abundance (counts)

Answers (2)

Related Questions