Likert in R with unequal number of factor levels

Question

I have some survey data leading to a 5-point likert scale. However, in some response columns, some factors are missing. Here is the data:

Increased student engagement ,Instructional time effectiveness increased,Increased student confidence,Increased student performance in class assignments,Increased learning of the students,Added unique learning activities

Strongly agree,Strongly agree,Strongly agree,Strongly agree,Strongly agree,Strongly agree

Neither agree nor disagree,Neither agree nor disagree,Neither agree nor disagree,Neither agree nor disagree,Neither agree nor disagree,Neither agree nor disagree

Disagree,Strongly disagree,Neither agree nor disagree,Disagree,Disagree,Neither agree nor disagree

As you can see, that some response columns have some missing factors, e.g. in first column, Agree, and Strongly disagree are missing (for simplicity, I have pasted a subset of the actual data set)

I am using the following code in R:

facultyData <- read_excel("FacultyResponsesForR.xlsx")
facultyData[] <- lapply( facultyData, factor)
facultyData[1:6] <- lapply( facultyData[1:6], factor, levels=1:5)
likertData <- likert(facultyData, nlevels = 5)
plot(likertData)

However, this is leading to the following error:

Error in mean(as.numeric(items[, i]), na.rm = TRUE) : 
  (list) object cannot be coerced to type 'double'

I have tried the solution mentioned over other posts(the one in the commented line of code facultyData[] <- lapply(facultyData[], factor, levels=1:5)), but it doesn't work either

Apparently, before executing this lappy the data contains:

# A tibble: 14 × 1
   `Increased student engagement`
                           
1                  Strongly agree
2                           Agree
3                           Agree
4                           Agree
5                           Agree
6                           Agree
7                           Agree
8                           Agree
9                           Agree
10     Neither agree nor disagree
11     Neither agree nor disagree
12     Neither agree nor disagree
13     Neither agree nor disagree
14                       Disagree

After executing it data is overriden with NA values? Why is this happening?

> facultyData[1:6] <- lapply( facultyData[1:6], factor, levels=1:5)
> facultyData[,1]
# A tibble: 14 × 1
   `Increased student engagement`
                           
1                              NA
2                              NA
3                              NA
4                              NA
5                              NA
6                              NA
7                              NA
8                              NA
9                              NA
10                             NA
11                             NA
12                             NA
13                             NA
14                             NA

After changing the code as follows, data is retained (doesn't become NA, yet I get the same error)

mylevels <- c('Strongly disagree', 'Disagree', 'Neither agree nor disagree', 'Agree', 'Strongly agree')
facultyData <- read_excel("FacultyResponsesForR.xlsx")
facultyData[] <- lapply( facultyData, factor)
facultyData[1:6] <- lapply( facultyData[1:6], factor, levels=mylevels)

This solution doesn't work for me - https://github.com/jbryer/likert/blob/master/demo/UnusedLevels.R

Evan Friedland · Accepted Answer

Rewriting your data was no fun, and this took a bit to figure out but I think this will help you. Someone may have a shorter way. Let me know if it helps.

df <- rbind(c("Strongly agree","Strongly agree","Strongly agree","Strongly agree","Strongly agree","Strongly agree"),
            c("Neither agree nor disagree","Neither agree nor disagree","Neither agree nor disagree","Neither agree nor disagree","Neither agree nor disagree","Neither agree nor disagree"),
            c("Disagree","Strongly disagree","Neither agree nor disagree","Disagree","Disagree","Neither agree nor disagree"))
df <- as.data.frame(df)
colnames(df) <- c("Increased student engagement", "Instructional time effectiveness increased", "Increased student confidence", "Increased student performance in class assignments", "Increased learning of the students", "Added unique learning activities")

lookup <- data.frame(levels = 1:5, mylabels = c('Strongly disagree', 'Disagree', 'Neither agree nor disagree', 'Agree', 'Strongly agree'))

df.1 <- as.data.frame(apply(df, 2, function(x) match(x, lookup$mylabels)))
df.new <- as.data.frame(lapply(as.list(df.1), factor, levels = lookup$levels, labels = lookup$mylabels))

str(df.new)
'data.frame':   3 obs. of  6 variables:
 $ Increased.student.engagement                      : Factor w/ 5 levels "Strongly disagree",..: 5 3 2
 $ Instructional.time.effectiveness.increased        : Factor w/ 5 levels "Strongly disagree",..: 5 3 1
 $ Increased.student.confidence                      : Factor w/ 5 levels "Strongly disagree",..: 5 3 3
 $ Increased.student.performance.in.class.assignments: Factor w/ 5 levels "Strongly disagree",..: 5 3 2
 $ Increased.learning.of.the.students                : Factor w/ 5 levels "Strongly disagree",..: 5 3 2
 $ Added.unique.learning.activities                  : Factor w/ 5 levels "Strongly disagree",..: 5 3 3

Likert in R with unequal number of factor levels

Answers (2)

Related Questions