Skip to content

DuckDBDataTransformer

The DuckDBDataTransformer*class is a Python class that inherits from the BaseDataTransformer class. The class provides a method to transform data by executing a given SQL query using the DuckDBDataConnector utility to connect to a DuckDB database.

Configuration

Required Configuration

  • duckdb_path: The filepath to the duckdb instance.

Optional Configuration

There is no optional configuration.

Default Configuration

There is no default configuration.

Methods

transform

This method transforms data using DuckDBDataConnector.

def transform(self, sql, *args, **kwargs)

Arguments:

  • sql: (str) The SQL query to transform the data.

Returns:

  • data: (pandas.DataFrame) The transformed data as a Pandas DataFrame.

Usage

rom lolpop.component import DuckDBDataTransformer

config = {
   #insert component config here
}

transformer = DuckDBDataTransformer(conf=copnfig)
sql_query = "SELECT * FROM sample_table LIMIT 10"

result = transformer.transform(sql_query)

print(result.head())