Introduction
The English SDK for Apache Spark is an extremely simple yet powerful tool. It takes English instructions and compile them into PySpark objects like DataFrames. Its goal is to make Spark more user-friendly and accessible, allowing you to focus your efforts on extracting insights from your data.
Getting Started
DataFrame Transformation
Given the following DataFrame df
, you can write English to transform it to another DataFrame. For example:
df.ai.transform("What are the best-selling and the second best-selling products in every category?").show()
product | category | revenue |
---|---|---|
Foldable | Cellphone | 6500 |
Nromal | Cellphone | 6000 |
Mini | Tablet | 5500 |
Pro | Tablet | 4000 |