Reconciliation between two List of type List

Question

I am attempting to reconcile or match the data between two lists of maps of type List> query1results and query2results.

We will refer to these datasets as left-hand side (LHS) for query1, and right-hand side (RHS) for query2. The goal is to record how many matches we have and how many breaks we have.

Records in both the LHS and the RHS dataset are considered matches. I would like to maintain matching maps from both LHS and RHS.
Records in LHS that are not in the RHS are right-hand-breaks.
Whatever remains unmatched in the RHS after the first two passes are left-hand-breaks.

Here is my code with some of my previous attempts.

ATTEMPT #1 -> Only does LHS matching, and no breaks:

@Override
public void reconcile(LocalDate date) {
    List> query1Records = executeQuery1(date).collect(Collectors.toList());
    List> query2Records = executeQuery2(date).collect(Collectors.toList());
    
    List> matching = query1Records.parallelStream().filter(searchData ->
            query2Records.parallelStream().anyMatch(inputMap ->
                searchData.get("instrument").equals(inputMap.get("instrument"))
                    && String.valueOf(searchData.get("entity")).equals(inputMap.get("entity"))
                    && searchData.get("party").equals(inputMap.get("party"))
                    && ((BigDecimal) searchData.get("quantity")).compareTo((BigDecimal) inputMap.get("quantity")) == 0))
        .collect(Collectors.toList());
    
}

ATTEMPT #2 -> Should only match if all values match on LHS and RHS

List keys = Arrays.asList("entity", "instrument", "party", "quantity");

Function, List> getKey = m -> 
    keys.stream().map(m::get).collect(Collectors.toList());

Map, Map> bpsKeys = query1Records.stream()
    .collect(Collectors.toMap(
        getKey,
        m -> m,
        (a, b) -> {
            throw new IllegalStateException("duplicate " + a + " and " + b);
        },
        LinkedHashMap::new));

List> matchinRecords = query2Records.stream()
    .filter(m -> bpsKeys.containsKey(getKey.apply(m)))
    .collect(Collectors.toList());

matchinRecords.forEach(m -> bpsKeys.remove(getKey.apply(m)));
List> notMatchingRecords = new ArrayList<>(bpsKeys.values());

Note: some of the keys need to be ignored during comparison.

Alexander Ivanchenko · Accepted Answer

Here's another solution for the case when during comparison of the two maps, some of the keys have to be ignored (i.e. their values should not be taken into account while determining if a map from the first dataset has a matching map in the second dataset).

We can start by creating two intermediate maps having a list List as a key, comprised of values mapped to keys that should be compared.

And since while generating the intersection of two datasets we need to include matching maps from both we can make use of the built-in Collector partitioningBy(). After partitioning both data sets we would be able to obtain intersection and difference.

Implementation might look like that:

public static void main(String args[]) {
    List> lhs = executeQuery1(date).collect(Collectors.toList()); // .toList() for Java 16+
    List> rhs = executeQuery2(date).collect(Collectors.toSet());
    
    //TODO: Dynamically generate, pull out ID field so not used in reconciliation
    List keys = Arrays.asList("entity", "instrument", "party", "quantity");
    
    Function, List> getKey = m ->
        keys.stream().map(m::get).collect(Collectors.toList());
    
    Map, Map> lhsMap = groupByKey(lhs, getKey);
    Map, Map> rhsMap = groupByKey(rhs, getKey);
    
    Map>> intersectionDiffLeft = lhsMap.entrySet().stream() // would contain those maps from LHS that has matching maps in the RHS mapped to TRUE (difference would be mapped to FALSE)
        .collect(Collectors.partitioningBy(
            entry -> rhsMap.containsKey(entry.getKey()),
            Collectors.mapping(Map.Entry::getValue,
                Collectors.toList())
        ));

    Map>> intersectionDiffRight = rhsMap.entrySet().stream() // would contain those maps from RHS that has matching maps in the LHS mapped to TRUE (difference would be mapped to FALSE)
        .collect(Collectors.partitioningBy(
            entry -> lhsMap.containsKey(entry.getKey()),
            Collectors.mapping(Map.Entry::getValue,
                Collectors.toList())
        ));

    List> intersection = new ArrayList<>();
    intersection.addAll(intersectionDiffLeft.get(true));
    intersection.addAll(intersectionDiffRight.get(true));

    List> difference = new ArrayList<>();
    difference.addAll(intersectionDiffLeft.get(false));
    difference.addAll(intersectionDiffRight.get(false));
}

public static Map, Map> groupByKey(Collection> source,
                                                           Function, List> getKey) {        
    return source.stream()
        .collect(Collectors.toMap(
            getKey,
            Function.identity(),
            (a, b) -> {
                throw new IllegalStateException("duplicate " + a + " and " + b);
            }
        ));
}

Reconciliation between two List of type List<Map<String, Object>>

Answers (2)

Related Questions

Reconciliation between two List of type List&lt;Map&lt;String, Object&gt;&gt;

Answers (2)

Related Questions

Reconciliation between two List of type List<Map<String, Object>>