Free and open source ETL for Big data

Free and open source ETL for Big data

Pentaho
Pentaho is a commerical open-source BI suite that has a product called Kettle for data integration.
It uses an innovative meta-driven approach and has a strong and very easy-to-use GUI.
The company started around 2001 (2002 was when kettle was integrated into it).
It has a strong community of 13,500 registered users.
It has a stand-alone java engine that process the jobs and tasks for moving data between many different databases and files.
It can schedule tasks (but you need a schedular for that – cron).
It can run remote jobs on “slave servers” on other machines.

Talend

  • Talend is an open-source data integration tool (with the full suite , ESB , MDM , BPM , DQ).
  • It uses a code-generating approach. Uses a GUI, but within Eclipse RC, with an intuitive use
  • Very large community , and more than 800 connectors ( the biggest connectors library )

Leave a Reply