News

Show HN: Data Bonsai: a Python package to clean your data with LLMs

  • alvin_r_h--Github.com
  • published date: 2024-04-27 22:59:03 UTC

I've been doing some data cleaning for my fine tuning projects using LLMs, and decided to just build a package for it as a side project. Check it out here: https://github.com/databonsai/databonsaiSome features:- categorization (labelling), transformation and …

databonsai is a Python library that uses LLMs to perform data cleaning tasks. <ul><li>Suite of tools for data processing using LLMs including categorization, transformation, and extraction</li><li>… [+5563 chars]