Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Support more options in create index statement #26

Closed
Tracked by #3
dai-chen opened this issue Aug 30, 2023 · 2 comments
Closed
Tracked by #3

[FEATURE] Support more options in create index statement #26

dai-chen opened this issue Aug 30, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@dai-chen
Copy link
Collaborator

Is your feature request related to a problem?

Currently Flint streaming job uses Foreach Batch sink which creates temporary checkpoint folder by default. The checkpoint will be lost after restart.

What solution would you like?

Add another option in create statement to allow user configure where they want to store the checkpoint data. Without the location provided, we can assume user doesn't concern about this and stick with temporary checkpoint as before.

For example:

CREATE SKIPPING INDEX ON alb_logs
(
  client_ip VALUE_SET
)
WITH (
  auto_refresh = true,
  checkpoint_location = 's3://...'
)

What alternatives have you considered?

Use some default location of current database. For example, because we map data source concept to SparkSQL catalog, we may use the root folder of a S3 data source. But this needs user awareness and write permission.

Do you have any additional context?

N/A

@dai-chen dai-chen added enhancement New feature or request untriaged labels Aug 30, 2023
@dai-chen dai-chen changed the title [FEATURE] Add checkpoint location option to create statement [FEATURE] Support more options in create index statement Sep 20, 2023
@dai-chen
Copy link
Collaborator Author

Similarly, we want to support other options such as refresh interval, index setting etc.

CREATE SKIPPING/INDEX/MV ...
WITH (
  auto_refresh = [true | false],
  auto_start = [true | false],
  refresh_interval = '1s',
  checkpoint_location = 's3://...',
  index_settings = '{ "key": value ... }'
);

@dai-chen
Copy link
Collaborator Author

dai-chen commented Oct 4, 2023

auto_start is low priority and will be tracked separately.

@dai-chen dai-chen closed this as completed Oct 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant