Array intersections and differences
In maths and computer science we sometimes use sets, unordered collections of unique entries. NSMutableSet
has good basic support for set-operations like unions, intersections, relative complements (also called difference) with operations like unionSet:
, minusSet:
and intersectSet:
. There is also support for subsets.
The other extremely common data type is an array, an ordered collection of entries. We don’t usually talk about “array intersections” or “array complements” but they can sometimes come to good use. While there is no built in methods for this, NSPredicate
makes it really easy.
It is not clearly defined if the union of two arrays should contain duplicates or not. As we will see, we can easily achieve both.
Examples
Imagine that we have two arrays: (3,4,5,6,7) and (0,2,4,6,8) and what to know the intersection and the relative complement between these.
Array intersection
The intersection of two collections is the entries that exist in both of them.
With arrays this can be achieved by filtering either of the arrays (preferably the smaller for performance reasons) to only include objects that exist in the other array. A predicate that will do this reads as clearly as this:
NSPredicate *intersectPredicate =
[NSPredicate predicateWithFormat:@"SELF IN %@", otherArray];
NSArray *intersect =
[firstArray filteredArrayUsingPredicate:intersectPredicate];
Relative complement
For the relative complement (the difference) the order matters. It helps to think of it as a subtraction, we remove the common entries from either array.
This is only a slight modification from the example above. This time we want to keep the entries that does not exist in the other array:
NSPredicate *relativeComplementPredicate =
[NSPredicate predicateWithFormat:@"NOT SELF IN %@", otherArray];
NSArray *relativeComplement =
[firstArray filteredArrayUsingPredicate:relativeComplementPredicate];
Union
The union of two sets is the entires for both the first and the second set. Sets only contain unique values but arrays don’t always do that. The question is if the “union” of two arrays should add the intersect twice or not.
Adding all objects
The method arrayByAddingObjectsFromArray:
makes this trivial:
NSArray *union =
[firstArray arrayByAddingObjectsFromArray:otherArray];
Only adding the intersect once
The filled circles (known as Venn diagrams) come to great aid here. We can easily see that what we want is the relative complement added to all the entries in the second array.
// Calculating the relative complement was shown above ...
NSArray *union =
[relativeComplement arrayByAddingObjectsFromArray:otherArray];
Note that this still allows for duplicate entries since either of the two arrays can contain an entry more then once. Also the intersect can contain the same entry more then once.