Koheesio: Nike’s Python-based framework to build advanced data-pipelines

Koheesio, a Python framework for data pipelines, emphasizes modularity and collaboration to create complex pipelines from simple components. It offers support for various data processing libraries and frameworks, ensuring versatility. Using Pydantic for typing and settings management, Koheesio prioritizes type safety and structured configurations. Not competing but aiming for support and utility, Koheesio focuses on data engineering expertise, PySpark integration, and specific tasks like ETL jobs. The framework invites contributions for collaboration and innovation. Key components include Step, Context, and Logger. Installation is possible via pip, Hatch, or Poetry. Additional features like Spark Expectations, Box, and SFTP are available as extras. Contributions are welcomed following code standards, testing, and release processes, along with adherence to Nike’s guidelines.

https://github.com/Nike-Inc/koheesio

To top