Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rather than invoking the Python interpreter, Tuplex generates optimized LLVM bytecode for the given pipeline and input data set.
scripts/build_macos_wheels.sh
)-DBUILD_WITH_ORC
where ORC libraries may have been build with different Protobuf version than Tuplex causing issues when delocating the wheel under macOS (Intel/ARM).toPythonString()
conversion in Row
.pytest.parametrize
.setup.py
when Python executable is not in /usr/local
.scripts/create_lambda_zip.sh
.