Skip to content

Commit

Permalink
Better fix for add-back-references (#33207)
Browse files Browse the repository at this point in the history
This is a better fix for add-back-references breeze command, as
the previous fix had a bug and processed more than just providers
when providers --gen-type was selected. See #33149.

This version fixes it in a much more consistent way:

* you have to specify what you want to work on as option
* you can  select few of the provider via auto-complete
* you can also select apache-airflow and helm-chart
* you can mix and match them if you need

All the scripts and references where `--gen-type` was used
previously have been updated and documentation was changed to
reflect it
  • Loading branch information
potiuk committed Aug 9, 2023
1 parent c39359e commit 8c28454
Show file tree
Hide file tree
Showing 21 changed files with 424 additions and 430 deletions.
26 changes: 19 additions & 7 deletions BREEZE.rst
Original file line number Diff line number Diff line change
Expand Up @@ -2034,23 +2034,35 @@ To add back references to the documentation generated by ``build-docs`` in Breez
use the ``release-management add-back-references`` command. This is important to support backward compatibility
the airflow documentation.
.. code-block:: bash
breeze release-management add-back-references
You have to specify which packages you run it on. For example you can run it for all providers:
.. code-block:: bash
release-management add-back-references --airflow-site-directory DIRECTORY
release-management add-back-references --airflow-site-directory DIRECTORY all-providers
The flag ``--airflow-site-directory`` takes the path of the cloned ``airflow-site``. The command will
not proceed if this is an invalid path.
You can also run the command for apache-airflow (core documentation):
.. code-block:: bash
breeze release-management publish-docs --airflow-site-directory DIRECTORY apache-airflow
Also for helm-chart package:
.. code-block:: bash
breeze release-management publish-docs --airflow-site-directory DIRECTORY helm-chart
You can also manually specify (it's auto-completable) list of packages to run the command for including individual
providers - you can mix apache-airflow, helm-chart and provider packages this way:
.. code-block:: bash
breeze release-management publish-docs --gen-type [airflow | providers | helm]
breeze release-management publish-docs --airflow-site-directory DIRECTORY apache.airflow apache.beam google
The flag ``--gen-type`` is a string of which type of back references to generate. It can take the values of
airflow, providers, or helm. The command will fail if this is incorrect.
Those are all available flags of ``release-management add-back-references`` command:
Expand Down
2 changes: 1 addition & 1 deletion dev/README_RELEASE_AIRFLOW.md
Original file line number Diff line number Diff line change
Expand Up @@ -818,7 +818,7 @@ Documentation for providers can be found in the ``/docs/apache-airflow`` directo
```shell script
breeze release-management publish-docs --package-filter apache-airflow --package-filter docker-stack
breeze release-management add-back-references --gen-type airflow
breeze release-management add-back-references apache-airflow
git add .
git commit -m "Add documentation for Apache Airflow ${VERSION}"
git push
Expand Down
2 changes: 1 addition & 1 deletion dev/README_RELEASE_HELM_CHART.md
Original file line number Diff line number Diff line change
Expand Up @@ -633,7 +633,7 @@ between the two repositories to be able to build the documentation.
Regenerate `index.yaml` so it can be added to the Airflow website to allow: `helm repo add https://airflow.apache.org`.

```shell
breeze release-management add-back-references --gen-type helm
breeze release-management add-back-references helm-chart
curl https://dist.apache.org/repos/dist/dev/airflow/helm-chart/$RC/index.yaml -o index.yaml
cp ${AIRFLOW_SVN_RELEASE_HELM}/${VERSION}/airflow-${VERSION}.tgz .
helm repo index --merge ./index.yaml . --url "https://downloads.apache.org/airflow/helm-chart/$VERSION"
Expand Down
2 changes: 1 addition & 1 deletion dev/README_RELEASE_PROVIDER_PACKAGES.md
Original file line number Diff line number Diff line change
Expand Up @@ -392,7 +392,7 @@ breeze release-management publish-docs \
--package-filter 'apache-airflow-providers-*' \
--override-versioned

breeze release-management add-back-references --gen-type providers
breeze release-management add-back-references all-providers
```

If you see `ModuleNotFoundError: No module named 'docs'`, set:
Expand Down
4 changes: 2 additions & 2 deletions dev/breeze/src/airflow_breeze/commands/developer_commands.py
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@
DEFAULT_PYTHON_MAJOR_MINOR_VERSION,
DOCKER_DEFAULT_PLATFORM,
MOUNT_SELECTED,
get_available_documentation_provider_packages,
get_available_documentation_packages,
)
from airflow_breeze.params.build_ci_params import BuildCiParams
from airflow_breeze.params.doc_build_params import DocBuildParams
Expand Down Expand Up @@ -362,7 +362,7 @@ def start_airflow(
@click.option(
"--package-filter",
help="List of packages to consider.",
type=NotVerifiedBetterChoice(get_available_documentation_provider_packages()),
type=NotVerifiedBetterChoice(get_available_documentation_packages()),
multiple=True,
)
@click.option(
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -43,16 +43,16 @@
MOUNT_ALL,
MOUNT_SELECTED,
MULTI_PLATFORM,
get_available_documentation_provider_packages,
get_available_documentation_packages,
)
from airflow_breeze.params.shell_params import ShellParams
from airflow_breeze.utils.add_back_references import (
GenerationType,
start_generating_back_references,
)
from airflow_breeze.utils.ci_group import ci_group
from airflow_breeze.utils.common_options import (
argument_packages,
argument_packages_plus_all_providers,
option_airflow_constraints_mode_ci,
option_airflow_constraints_reference,
option_airflow_extras,
Expand Down Expand Up @@ -783,7 +783,7 @@ def alias_image(image_from: str, image_to: str):
@click.option(
"--package-filter",
help="List of packages to consider.",
type=NotVerifiedBetterChoice(get_available_documentation_provider_packages()),
type=NotVerifiedBetterChoice(get_available_documentation_packages()),
multiple=True,
)
@option_verbose
Expand Down Expand Up @@ -815,7 +815,7 @@ def publish_docs(

@release_management.command(
name="add-back-references",
help="Command to add back references for documentation to make it backward compatible",
help="Command to add back references for documentation to make it backward compatible.",
)
@click.option(
"-a",
Expand All @@ -825,23 +825,12 @@ def publish_docs(
help="Local directory path of cloned airflow-site repo.",
required=True,
)
@click.option(
"-g",
"--gen-type",
show_default=True,
help="Type of back references to generate. Forced to providers if providers specified as arguments.",
type=BetterChoice(
[e.name for e in GenerationType],
),
default=GenerationType.airflow.name,
)
@argument_packages
@argument_packages_plus_all_providers
@option_verbose
@option_dry_run
def add_back_references(
airflow_site_directory: str,
gen_type: str,
packages: list[str],
packages_plus_all_providers: tuple[str],
):
"""Adds back references for documentation generated by build-docs and publish-docs"""
site_path = Path(airflow_site_directory)
Expand All @@ -851,22 +840,16 @@ def add_back_references(
"Provide the path of cloned airflow-site repo\n"
)
sys.exit(1)
if len(packages) != 0 and gen_type != GenerationType.providers.name:
if len(packages_plus_all_providers) == 0:
get_console().print(
[
f"[warning]Forcing gen type to "
f"{GenerationType.providers} as some provider_packages are selected."
]
)
gen_type = GenerationType.providers.name
gen = GenerationType[gen_type]
if gen not in GenerationType:
get_console().print(
"\n[error]invalid type of doc generation required. Pass one of [airflow | providers | helm]\n"
"\n[error]You need to specify at least one package to generate back references for\n"
)
sys.exit(1)

start_generating_back_references(gen, site_path, packages)
packages = list(packages_plus_all_providers)
if "all-providers" in packages_plus_all_providers:
packages.remove("all-providers")
packages.extend(get_available_documentation_packages(only_providers=True, short_version=True))
start_generating_back_references(site_path, packages)


@release_management.command(
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -174,7 +174,7 @@
"breeze release-management add-back-references": [
{
"name": "Add Back References to Docs",
"options": ["--airflow-site-directory", "--gen-type"],
"options": ["--airflow-site-directory"],
},
],
"breeze release-management generate-issue-content-providers": [
Expand Down
14 changes: 8 additions & 6 deletions dev/breeze/src/airflow_breeze/global_constants.py
Original file line number Diff line number Diff line change
Expand Up @@ -145,18 +145,20 @@ def all_helm_test_packages() -> list[str]:
ALL_HISTORICAL_PYTHON_VERSIONS = ["3.6", "3.7", "3.8", "3.9", "3.10", "3.11"]


def get_available_documentation_provider_packages(short_version=False) -> list[str]:
def get_available_documentation_packages(short_version=False, only_providers: bool = False) -> list[str]:
provider_names: list[str] = list(json.loads(PROVIDER_DEPENDENCIES_JSON_FILE_PATH.read_text()).keys())
doc_provider_names = [provider_name.replace(".", "-") for provider_name in provider_names]
available_packages = [f"apache-airflow-providers-{doc_provider}" for doc_provider in doc_provider_names]
available_packages.extend(["apache-airflow", "docker-stack", "helm-chart"])
available_packages.sort()
available_packages = []
if not only_providers:
available_packages.extend(["apache-airflow", "docker-stack", "helm-chart"])
all_providers = [f"apache-airflow-providers-{doc_provider}" for doc_provider in doc_provider_names]
all_providers.sort()
available_packages.extend(all_providers)
if short_version:
prefix_len = len("apache-airflow-providers-")
available_packages = [
package[prefix_len:].replace("-", ".")
package[prefix_len:].replace("-", ".") if len(package) > prefix_len else package
for package in available_packages
if len(package) > prefix_len
]
return available_packages

Expand Down
37 changes: 13 additions & 24 deletions dev/breeze/src/airflow_breeze/utils/add_back_references.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,6 @@
# under the License.
from __future__ import annotations

import enum
import os
import re
import tempfile
Expand All @@ -26,21 +25,14 @@

from rich import print

from airflow_breeze.global_constants import get_available_documentation_provider_packages
from airflow_breeze.utils.console import get_console

airflow_redirects_link = (
"https://raw.githubusercontent.com/apache/airflow/main/docs/apache-airflow/redirects.txt"
)
helm_redirects_link = "https://raw.githubusercontent.com/apache/airflow/main/docs/helm-chart/redirects.txt"


# types of generations supported
class GenerationType(enum.Enum):
airflow = 1
helm = 2
providers = 3


def download_file(url):
try:
temp_dir = Path(tempfile.mkdtemp(prefix="temp_dir", suffix=""))
Expand Down Expand Up @@ -149,26 +141,23 @@ def generate_back_references(link: str, base_path: Path):
create_back_reference_html(relative_path, dest_file_path)


def start_generating_back_references(
gen_type: GenerationType, airflow_site_directory: Path, short_provider_package_ids: list[str]
):
# Either packages or gen_type should be provided
def start_generating_back_references(airflow_site_directory: Path, short_provider_package_ids: list[str]):
docs_archive_path = airflow_site_directory / "docs-archive"
airflow_docs_path = docs_archive_path / "apache-airflow"
helm_docs_path = docs_archive_path / "helm-chart"

if gen_type == GenerationType.airflow:
if "apache-airflow" in short_provider_package_ids:
generate_back_references(airflow_redirects_link, airflow_docs_path)
elif gen_type == GenerationType.helm:
short_provider_package_ids.remove("apache-airflow")
if "helm-chart" in short_provider_package_ids:
generate_back_references(helm_redirects_link, helm_docs_path)
elif gen_type == GenerationType.providers:
if short_provider_package_ids:
all_providers = [
f"apache-airflow-providers-{package.replace('.','-')}"
for package in short_provider_package_ids
]
else:
all_providers = get_available_documentation_provider_packages()
short_provider_package_ids.remove("helm-chart")
if "docker-stack" in short_provider_package_ids:
get_console().print("[info]Skipping docker-stack package. No back-reference needed.")
short_provider_package_ids.remove("docker-stack")
if short_provider_package_ids:
all_providers = [
f"apache-airflow-providers-{package.replace('.','-')}" for package in short_provider_package_ids
]
for p in all_providers:
print(f"Processing airflow provider: {p}")
generate_back_references(get_github_redirects_url(p), docs_archive_path / p)
10 changes: 8 additions & 2 deletions dev/breeze/src/airflow_breeze/utils/common_options.py
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@
SINGLE_PLATFORMS,
START_AIRFLOW_ALLOWED_EXECUTORS,
START_AIRFLOW_DEFAULT_ALLOWED_EXECUTORS,
get_available_documentation_provider_packages,
get_available_documentation_packages,
)
from airflow_breeze.utils.custom_param_types import (
AnswerChoice,
Expand Down Expand Up @@ -448,7 +448,13 @@ def _set_default_from_parent(ctx: click.core.Context, option: click.core.Option,
"packages",
nargs=-1,
required=False,
type=BetterChoice(get_available_documentation_provider_packages(short_version=True)),
type=BetterChoice(get_available_documentation_packages(short_version=True)),
)
argument_packages_plus_all_providers = click.argument(
"packages_plus_all_providers",
nargs=-1,
required=False,
type=BetterChoice(["all-providers"] + get_available_documentation_packages(short_version=True)),
)
option_airflow_constraints_reference = click.option(
"--airflow-constraints-reference",
Expand Down
6 changes: 3 additions & 3 deletions dev/breeze/tests/test_global_constants.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,12 +16,12 @@
# under the License.
from __future__ import annotations

from airflow_breeze.global_constants import get_available_documentation_provider_packages
from airflow_breeze.global_constants import get_available_documentation_packages

AVAILABLE_PACKAGES_STARTING_LIST = ("apache-airflow", "helm-chart", "docker-stack")


def test_get_available_packages():
assert len(get_available_documentation_provider_packages()) > 70
for package in get_available_documentation_provider_packages():
assert len(get_available_documentation_packages()) > 70
for package in get_available_documentation_packages():
assert package.startswith(AVAILABLE_PACKAGES_STARTING_LIST)
2 changes: 2 additions & 0 deletions dev/provider_packages/publish_provider_documentation.sh
Original file line number Diff line number Diff line change
Expand Up @@ -30,4 +30,6 @@ done

breeze build-docs "${provider_filters[@]}"

breeze release-management add-back-references "${@}"

cd "${AIRFLOW_SITE_DIRECTORY}"
14 changes: 7 additions & 7 deletions images/breeze/output-commands-hash.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
# Please do not solve it but run `breeze setup regenerate-command-images`.
# This command should fix the conflict and regenerate help images that you have conflict with.
main:344261ca3aa7ff31e098b1d88280566a
build-docs:45ccfec7bb1bd73eb635c3363d0353a6
build-docs:bfc91db55c595516fe84d56c96a5a8e0
ci:fix-ownership:3e5a73533cc96045e72cb258783cfc96
ci:free-space:49af17b032039c05c41a7a8283f365cc
ci:get-workflow-info:8246038093359b9c3c110043419473e2
Expand Down Expand Up @@ -35,21 +35,21 @@ prod-image:build:85bb45de92e86a55474238c3f04def92
prod-image:pull:76f1f27e6119928412abecf153fce4bb
prod-image:verify:bd2b78738a7c388dbad6076c41a9f906
prod-image:7b3369e182724fc155b3399c95d0fd73
release-management:add-back-references:0d4eb5ed82e5381bc630b343ba605a72
release-management:add-back-references:919b3bd706acf794e0e5e129fc587f6d
release-management:create-minor-branch:a3834afc4aa5d1e98002c9e9e7a9931d
release-management:generate-constraints:b8fcaf8f0acd35ed5dbd48659bdb6485
release-management:generate-issue-content-providers:6b0d954cb6dbdec0da0a7988feec58f0
release-management:generate-issue-content-providers:dc7149a2c6fc43d323e7c83865c0ca0b
release-management:generate-providers-metadata:d4e8e5cfaa024e3963af02d7a873048d
release-management:install-provider-packages:a89493be1ae961c13469b5a25a605069
release-management:prepare-airflow-package:85d01c57e5b5ee0fb9e5f9d9706ed3b5
release-management:prepare-provider-documentation:b8defe8b94bf790d88d055c3e4c0ffba
release-management:prepare-provider-packages:4599a06e636c6fe8fda41b5a751c1429
release-management:publish-docs:00eae3f9acc2134c14c9a94d789423ce
release-management:prepare-provider-documentation:71dd952212206ca5c3b24bf8deaf13d6
release-management:prepare-provider-packages:ed727969a3fa27e41c75c08381c9ebf4
release-management:publish-docs:6d8ea0362b7a4394d8367e2105516cf8
release-management:release-prod-images:cfbfe8b19fee91fd90718f98ef2fd078
release-management:start-rc-process:b27bd524dd3c89f50a747b60a7e892c1
release-management:start-release:419f48f6a4ff4457cb9de7ff496aebbe
release-management:verify-provider-packages:96dce5644aad6b37080acf77b3d8de3a
release-management:dc4897917210deefb4338f2038f8cb33
release-management:376b788c9d4d2d431d391313482fe791
sbom:generate-provider-requirements:9abe53200ea5f40e0bf7c27f6087f27f
sbom:update-sbom-information:0ce56884e5f842e3e80d6619df1ccc64
sbom:935d041028e847d3faf763a95b51063e
Expand Down
Loading

0 comments on commit 8c28454

Please sign in to comment.