Skip to content

BigTable

Bases: Spout

__init__(output, state, **kwargs)

Initialize the Bigtable class.

Parameters:

Name Type Description Default
output BatchOutput

An instance of the BatchOutput class for saving the data.

required
state State

An instance of the State class for maintaining the state.

required
**kwargs

Additional keyword arguments.

{}

Using geniusrise to invoke via command line

genius Bigtable rise \
    batch \
        --output_s3_bucket my_bucket \
        --output_s3_folder s3/folder \
    none \
    fetch \
        --args project_id=my_project instance_id=my_instance table_id=my_table

Using geniusrise to invoke via YAML file

version: "1"
spouts:
    my_bigtable_spout:
        name: "Bigtable"
        method: "fetch"
        args:
            project_id: "my_project"
            instance_id: "my_instance"
            table_id: "my_table"
        output:
            type: "batch"
            args:
                bucket: "my_bucket"
                s3_folder: "s3/folder"

fetch(project_id, instance_id, table_id)

📖 Fetch data from a Google Cloud Bigtable and save it in batch.

Parameters:

Name Type Description Default
project_id str

The Google Cloud Project ID.

required
instance_id str

The Bigtable instance ID.

required
table_id str

The Bigtable table ID.

required

Raises:

Type Description
Exception

If unable to connect to the Bigtable server or fetch the data.