Python library for creating data pipelines with chain functional programming
The primary goal of this release was to implement a number of functions which were missing from the ScalaFunctional
library.
union
, intersection
, difference
, symmetric_difference
(Key, Value)
pairs using inner, outer, left, and right joinsmin
/max
with min_by
and max_by
With these changes, the API is both stable and reasonably complete. The goal is to support all major operations from Scala
arrays and Spark
. That goal seems to be complete.
The next release (small or larger version) will be focused on improving performance using generators and extending the API if necessary.