Linear Optimization with duplication allowed / issue with adding constraints

Question

The problem I'm trying to solve is fairly complicated. It doesn't have to be an exact solution but more so, one that's as close as I can get.

The overview of the problem is as follows, I have 125,000 rows of different combinations of names. Sample of that data listed below:

0          James Harden  Jalen Suggs       Josh Giddey    Scottie Barnes    Christian Wood   Shai Gilgeous-Alexander  Michael Porter Jr.           Nikola Vucevic
1          James Harden  Jalen Suggs        Kyle Kuzma    Scottie Barnes    Christian Wood   Shai Gilgeous-Alexander         Josh Giddey           Nikola Vucevic
2           Jalen Suggs  Jalen Green       Josh Giddey     Tobias Harris    Christian Wood              James Harden  Michael Porter Jr.  Shai Gilgeous-Alexander
3           Jalen Suggs  Jalen Green    Scottie Barnes  Domantas Sabonis    Christian Wood              James Harden         Josh Giddey  Shai Gilgeous-Alexander
4           Jalen Suggs  Jalen Green        Kyle Kuzma     Tobias Harris    Christian Wood              James Harden         Josh Giddey  Shai Gilgeous-Alexander

I also have a list of the unique names and the maximum amount of times they can appear in the output subset. Example below:

79              Jalen Suggs    PG/SG        ORL          43.0
78              Josh Giddey       SF        OKC          41.0
33  Shai Gilgeous-Alexander       PG        OKC          39.0
27           Christian Wood        C        HOU          36.0
69              Jalen Green    PG/SG        HOU          32.0

The solver would choose the subset of rows until it meets all constraints above. In this example, 43 rows of the output subset would have "Jalen Suggs" in them, 41 would have "Josh Giddey", so on and so forth. Eventually it would come to a solution that meets all of the name constraints and output a subset of rows that I can use. The kicker to all of this is, the same row CAN be used multiple times if it fits within the name constraints. If we simplify the problem to a super basic example, let's say all I needed was the solver to find me 1 lineup with both "Jalen Suggs" & "Josh Giddey" and return me that row.

I'm hoping my output looks identical to my first snippet of code, except all constraints from my second snippet are met.

As far as code goes, I haven't got very far, but here's what I have:

prob = LpProblem('LineUp Optimization', LpMaximize)
## make all rows into nested list
lineups = df.values.tolist()

cap = dict(zip(players, player_df['Cap']))

lineup_var = LpVariable.dicts('Lineups', lineups, 0, cat='Integer')

Maybe I'm going about this problem in the wrong way, I'm not sure.

Linear Optimization with duplication allowed / issue with adding constraints

Answers (1)

Related Questions