Exploring the Power of Natural Language Data Manipulation with PandasAI

How generative AI supercharges Pandas capabilities

5 min read

9 hours ago

I recently discovered a very intriguing new tool that marries my interest in data science and generative AI, which allows you to leverage ChatGPT to “talk” to your data directly in natural language while in a Jupyter notebook.

Enter PandasAI 🐼✨ a free, open source Python library.*

Image created by me, using DALL-E

What is Pandas AI?

Datacamp describes PandasAI as “a Python library that uses genAI models to supercharge pandas capabilities. It was created to complement the pandas library, a widely-used tool for data analysis and manipulation.” [1]

Below is an example of how you can leverage the impressive capabilities of PandasAI, even with limited knowledge of Python and Pandas:

Image created by the author, in Jupyter Lab, using pandasai to “talk” to a transactions dataset.

How to get set up

This part does require signing up for an OpenAI developer account, but fear not! OpenAI makes this step quite easy, it mainly just requires generating a token that you will need to include in your notebook.