---
title: "Related work and grouper"
output:
  rmarkdown::html_vignette:
    toc: true
    toc_depth: 2
vignette: >
  %\VignetteIndexEntry{Related work and grouper}
  %\VignetteEngine{knitr::rmarkdown}
  %\VignetteEncoding{UTF-8}
bibliography: references.bib
--- 


```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)
```

# Introduction

In this vignette, we provide a brief, non-exhaustive overview of related work to 
assign students to projects, in an educational setting. Finally, we introduce 
our `grouper` package.

## Balancing staff workload and student preferences

One of the earlier papers found was by @anwar2003student. The goal of the educator
in that paper was to assign students to two types of project: individual ones 
and group ones. For the individual projects, students were asked to rank their top
four projects from a list of project briefs. Each student was then assigned to a 
project and a corresponding staff member.

The assignment was carried out using two models. In the first model, the
objective was to minimise the total number of projects that each staff managed.
In the second model, the feasible solutions from the first model were used to
decide upon an assignment that maximised the number of students obtaining their
first choice projects. 

Compared to the models in `grouper`, the primary difference is in the objective
function. In our case, we only work with the student preference; there is no
need to balance staff workload since the projects are run within a single
course.

For the group project, students were asked to rank their top 3 projects
*individually*. They were then allocated groups that maximised their preferences.
Compared to the model in our package, this approach does not allow for
self-formed groups, or for larger projects that involve sub-groups.

## OptAssign

In many ways, the model described in @meyer2009optassign is similar to ours. The
objective function there maximises student preference for project topics. The
model in the paper also discusses an extension whereby some groups of students
can be favoured for particular topics. This is carried out through the inclusion
of weights in the objective function. Another additional feature adds
contstraints to ensure that students with certain skills are included in
particular topics. For instance, a project topic on "Marketing surveys" may
require at least two students with a statistics background in the group.

Overall, this model formulation is very similar to the one in our package.
Again, one difference is in the affordance for self-formed groups, and for
sub-groups in the project team.

## Students to elective courses

This paper @berovs2015integer deals with assignment of students to elective
courses in Croatia. In short, the institution offers a fixed number of such
courses. Every student must be enrolled in a pre-determined number of courses.
Each student selects and ranks a certain number of electives. The integer
programming solution that is derived here maximises the total student
satisfaction, based on these rankings.

## Assigning students to groups based on traits

The master's thesis from @mcguirk2020assigning describes a very complex integer
linear programming model. The assignment was used to assign students to
discussion groups at the University of New Hampshire chemistry department. The
complete objective function can be written as:

\begin{eqnarray}
\text{Minmise:}\; \text{preferences} + \text{sizes} + \text{single-trait} + \text{shared-trait}
\end{eqnarray}

The first component, preferences, incorporates the ranking that put students put 
down for a group. The second component was used to control the overall group size. Note 
that in our formulation, this appears as a constraint. The third component arises 
as a result of the educators experience in teaching this course. They realised that 
if a student is the only one with a certain trait, e.g. gender, year-of-study, etc.,
that student would be disinclined to participate in discussions. This is a very 
interesting point! The final portion of the objective function applies a penalty if
all students in the group share the same trait.

In our formulation, we have avoided using multi-objective functions for the most
part. The reason for this decision is that it is very difficult to decide how each
component should be weighted. It typically requires a further step of post-assignment
sensitivity analysis.

Another similarity with our findings is the recommendation to use Gurobi over 
open source solvers.

## Student-to-project supervisor assignment

The next article we discuss is @ramotsisi2022optimization. The formulation of this 
model was the result of a thorough investigation into the process by which 
students are assigned projects that are supervised by faculty members at the 
University of Botswana. However, similar to @anwar2003student, the objective 
function in this study aims to maximise the total workload of supervisors. It 
does not approach the problem solely from the angle of student preferences.

## Automated group assignments in academic setting

In the final work that we discuss, @bonfert2020improving use a probabilistic 
weighted optimisation routine to perform the group assignment, based on survey 
input data. Questions in the survey pertain to scheduling, desired attributes 
of team members, and indications of whether a student may be isolated. For each
question, a homogeneity score is computed using the answers from all members of a 
group. This score is summed over all questions to result in a "fitness" score 
for each group. A probabilistic hill-climbing algorithm is then used to maximise 
the average fitness score. Out of the papers discussed on this page, this is 
the only one that does not use an integer linear programming model.

# Introducing `grouper` package

`grouper` provides two distinct optimisation models, both of which are are
formulated as integer linear programs.

The Preference-Based Assignment (PBA) model allows educators to assign student
groups to topics primarily to maximise overall student preferences for those
topics. The topics can be viewed as project titles. The model allows for
repetitions of each project title. It is also possible to allow each project
team to be comprised of multiple sub-groups in this formulation.

To execute the optimisation routine, an instructor needs to prepare:

1.  A group composition table listing the member students within each
    self-formed group.
2.  A preference matrix containing the preference that each self-formed 
    group has for each topic.
3.  A YAML file defining the remaining parameters of the model.

The Diversity-Based Assignment (DBA) model enables educators to assign students
to groups and topics with the dual aims of maximising diversity (based on
student attributes) within groups and balancing specific skill levels across
different groups. These are formulated in a single objective function with
weights assigned to each component.

To execute the DBA optimisation routine, the instructor needs to prepare:

1.  A group composition table containing:
    *   the member students within each self-formed group,
    *   the demographics that will be used to compute pairwise dissimilarity
        between students, and
    *   a numeric measure of each student's skill.
2.  A YAML file defining the remaining parameters of the model.

Refer to the remaining vignettes for more information about the model definition
and application. 

In our package, we use the `ompr` framework for optimisation, which means that
you are free to use any solver. In our small examples, we demonstrate the
package using the `gplk` solver. However, for more complicated problems, we
highly recommend the `gurobi` solver. There is a free academic license for this 
tool. However, the `ROI.plugin.gurobi` and `gurobi` packages are not on CRAN. The 
following links will aid in installing the `gurobi` package:

* [Installing the R package gurobi](https://docs.gurobi.com/projects/optimizer/en/current/reference/r/setup.html)
* [Guide to installing gurobi optimiser](https://cran.r-project.org/package=prioritizr/vignettes/gurobi_installation_guide.html)

The following github repository provides instructions on install the gurobi solver:

* [Installing `ROI.plugin.gurobi`](https://github.com/roigrp/ROI.plugin.gurobi)


# References