Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ci(*:skip) Improve automated changelog generation #3356

Merged
merged 45 commits into from
Nov 21, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
45 commits
Select commit Hold shift + click to select a range
7be18c4
feat(framework:scripts): Improve automated changelog generation
charlesbvll Apr 29, 2024
cebc3d3
Remove deprecated script and update documentation
charlesbvll Apr 29, 2024
1c50ff2
Sort imports
charlesbvll Apr 29, 2024
d7844ed
Fix linting errors and add entry test
charlesbvll Apr 29, 2024
b6f84a2
Merge branch 'main' into improve-changelog-generation
charlesbvll Apr 29, 2024
811f149
Fix inconsistent returns
charlesbvll Apr 29, 2024
2c29ca4
Add type hints
charlesbvll Apr 29, 2024
f0b9455
Sort imports
charlesbvll Apr 29, 2024
6b8695b
Merge branch 'main' of https://github.com/adap/flower into improve-ch…
charlesbvll May 15, 2024
3202880
Use new format
charlesbvll May 15, 2024
71a74b9
Add unknown header
charlesbvll May 15, 2024
d3b8c99
Replace match statement with if-else
charlesbvll May 16, 2024
5501132
Merge branch 'main' into improve-changelog-generation
charlesbvll May 28, 2024
021f25e
Merge branch 'main' into improve-changelog-generation
charlesbvll May 28, 2024
416d09d
Merge branch 'main' into improve-changelog-generation
charlesbvll Jun 11, 2024
bff6611
Move script to dev and fix pattern
charlesbvll Jun 11, 2024
c251084
Add missing header
charlesbvll Jun 11, 2024
56737fd
Merge branch 'main' into improve-changelog-generation
charlesbvll Jun 14, 2024
2bdf2f6
Merge branch 'main' into improve-changelog-generation
charlesbvll Jun 18, 2024
6caa6f4
Merge branch 'main' into improve-changelog-generation
charlesbvll Jul 8, 2024
58808e7
Merge branch 'main' into improve-changelog-generation
danieljanes Jul 22, 2024
08b7978
Merge branch 'main' into improve-changelog-generation
charlesbvll Jul 22, 2024
8f1d357
Merge branch 'main' into improve-changelog-generation
danieljanes Jul 22, 2024
3de7f1b
Merge branch 'main' into improve-changelog-generation
charlesbvll Jul 23, 2024
970569a
Merge branch 'main' into improve-changelog-generation
danieljanes Jul 23, 2024
525b41d
Update dev/update_changelog.py
danieljanes Jul 23, 2024
749dbf5
Update doc/source/contributor-how-to-release-flower.rst
danieljanes Jul 23, 2024
1c8888f
Strip titles
charlesbvll Jul 23, 2024
afed1b7
Try new formatting
charlesbvll Jul 23, 2024
ab8003d
Working formatting
charlesbvll Jul 24, 2024
b5665e0
Merge branch 'main' into improve-changelog-generation
charlesbvll Jul 24, 2024
ce5a7f4
Merge branch 'main' into improve-changelog-generation
charlesbvll Jul 24, 2024
2e021a0
Merge branch 'main' into improve-changelog-generation
charlesbvll Sep 9, 2024
d93fb2c
Merge branch 'main' of https://github.com/adap/flower into improve-ch…
charlesbvll Sep 11, 2024
b58b16e
Merge branch 'main' into improve-changelog-generation
charlesbvll Sep 13, 2024
c90c570
Merge branch 'main' into improve-changelog-generation
charlesbvll Sep 13, 2024
70cbba4
Merge branch 'main' into improve-changelog-generation
charlesbvll Sep 13, 2024
3cdcb15
Merge branch 'main' into improve-changelog-generation
charlesbvll Sep 24, 2024
70ed2d6
Merge branch 'main' into improve-changelog-generation
charlesbvll Sep 28, 2024
246a514
Format doc page
charlesbvll Sep 28, 2024
8ac4e74
Merge branch 'main' into improve-changelog-generation
charlesbvll Nov 21, 2024
321031f
Merge branch 'main' into improve-changelog-generation
charlesbvll Nov 21, 2024
6aa0ae3
Fix changelog script
charlesbvll Nov 21, 2024
f14f2e7
Don't skip skip
charlesbvll Nov 21, 2024
25a158f
Apply suggestions
charlesbvll Nov 21, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
25 changes: 0 additions & 25 deletions dev/prepare-release-changelog.sh

This file was deleted.

297 changes: 297 additions & 0 deletions dev/update_changelog.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,297 @@
# mypy: ignore-errors
# Copyright 2023 Flower Labs GmbH. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
# ==============================================================================
"""Update the changelog using PR titles."""


import pathlib
import re

try:
import tomllib
except ModuleNotFoundError:
import tomli as tomllib
from datetime import date
from sys import argv
from typing import Optional

from github import Github
from github.PullRequest import PullRequest
from github.Repository import Repository
from github.Tag import Tag

REPO_NAME = "adap/flower"
CHANGELOG_FILE = "doc/source/ref-changelog.md"
CHANGELOG_SECTION_HEADER = "### Changelog entry"

# Load the TOML configuration
with (pathlib.Path(__file__).parent.resolve() / "changelog_config.toml").open(
"rb"
) as file:
CONFIG = tomllib.load(file)

# Extract types, project, and scope from the config
TYPES = "|".join(CONFIG["type"])
PROJECTS = "|".join(CONFIG["project"]) + "|\\*"
SCOPE = CONFIG["scope"]
ALLOWED_VERBS = CONFIG["allowed_verbs"]

# Construct the pattern
PATTERN_TEMPLATE = CONFIG["pattern_template"]
PATTERN = PATTERN_TEMPLATE.format(types=TYPES, projects=PROJECTS, scope=SCOPE)


def _get_latest_tag(gh_api: Github) -> tuple[Repository, Optional[Tag]]:
"""Retrieve the latest tag from the GitHub repository."""
repo = gh_api.get_repo(REPO_NAME)
tags = repo.get_tags()
return repo, tags[0] if tags.totalCount > 0 else None


def _add_shortlog(new_version: str, shortlog: str) -> None:
"""Update the markdown file with the new version or update existing logs."""
token = f"<!---TOKEN_{new_version}-->"
entry = (
"\n### Thanks to our contributors\n\n"
"We would like to give our special thanks to all the contributors "
"who made the new version of Flower possible "
f"(in `git shortlog` order):\n\n{shortlog} {token}"
)
current_date = date.today()

with open(CHANGELOG_FILE, encoding="utf-8") as file:
content = file.readlines()

token_exists = any(token in line for line in content)

with open(CHANGELOG_FILE, "w", encoding="utf-8") as file:
for line in content:
if token in line:
token_exists = True
file.write(line)
elif "## Unreleased" in line and not token_exists:
# Add the new entry under "## Unreleased"
file.write(f"## {new_version} ({current_date})\n{entry}\n")
token_exists = True
else:
file.write(line)


def _get_pull_requests_since_tag(
repo: Repository, tag: Tag
) -> tuple[str, set[PullRequest]]:
"""Get a list of pull requests merged into the main branch since a given tag."""
commit_shas = set()
contributors = set()
prs = set()

for commit in repo.compare(tag.commit.sha, "main").commits:
commit_shas.add(commit.sha)
if commit.author.name is None:
continue
if "[bot]" in commit.author.name:
continue
contributors.add(commit.author.name)

for pr_info in repo.get_pulls(
state="closed", sort="created", direction="desc", base="main"
):
if pr_info.merge_commit_sha in commit_shas:
prs.add(pr_info)
if len(prs) == len(commit_shas):
break

shortlog = ", ".join([f"`{name}`" for name in sorted(contributors)])
return shortlog, prs


def _format_pr_reference(title: str, number: int, url: str) -> str:
"""Format a pull request reference as a markdown list item."""
parts = title.strip().replace("*", "").split("`")
formatted_parts = []

for i, part in enumerate(parts):
if i % 2 == 0:
# Even index parts are normal text, ensure we do not add extra bold if empty
if part.strip():
formatted_parts.append(f"**{part.strip()}**")
else:
formatted_parts.append("")
else:
# Odd index parts are inline code
formatted_parts.append(f"`{part.strip()}`")

# Join parts with spaces but avoid extra spaces
formatted_title = " ".join(filter(None, formatted_parts))
return f"- {formatted_title} ([#{number}]({url}))"


def _extract_changelog_entry(
pr_info: PullRequest,
) -> dict[str, str]:
"""Extract the changelog entry from a pull request's body."""
# Use regex search to find matches
match = re.search(PATTERN, pr_info.title)
if match:
# Extract components from the regex groups
pr_type = match.group(1)
pr_project = match.group(2)
pr_scope = match.group(3) # Correctly capture optional sub-scope
pr_subject = match.group(
4
) # Capture subject starting with uppercase and no terminal period
return {
"type": pr_type,
"project": pr_project,
"scope": pr_scope,
"subject": pr_subject,
}

return {
"type": "unknown",
"project": "unknown",
"scope": "unknown",
"subject": "unknown",
}


def _update_changelog(prs: set[PullRequest]) -> bool:
"""Update the changelog file with entries from provided pull requests."""
breaking_changes = False
unknown_changes = False

with open(CHANGELOG_FILE, "r+", encoding="utf-8") as file:
content = file.read()
unreleased_index = content.find("## Unreleased")

if unreleased_index == -1:
print("Unreleased header not found in the changelog.")
return False

# Find the end of the Unreleased section
next_header_index = content.find("## ", unreleased_index + 1)
next_header_index = (
next_header_index if next_header_index != -1 else len(content)
)

for pr_info in prs:
parsed_title = _extract_changelog_entry(pr_info)

# Skip if PR should be skipped or already in changelog
if (
parsed_title.get("scope", "unknown") == "skip"
or f"#{pr_info.number}]" in content
):
continue

pr_type = parsed_title.get("type", "unknown")
if pr_type == "feat":
insert_content_index = content.find("### What", unreleased_index + 1)
elif pr_type == "docs":
insert_content_index = content.find(
"### Documentation improvements", unreleased_index + 1
)
elif pr_type == "break":
breaking_changes = True
insert_content_index = content.find(
"### Incompatible changes", unreleased_index + 1
)
elif pr_type in {"ci", "fix", "refactor"}:
insert_content_index = content.find(
"### Other changes", unreleased_index + 1
)
else:
unknown_changes = True
insert_content_index = unreleased_index

pr_reference = _format_pr_reference(
pr_info.title, pr_info.number, pr_info.html_url
)

content = _insert_entry_no_desc(
content,
pr_reference,
insert_content_index,
)

next_header_index = content.find("## ", unreleased_index + 1)
next_header_index = (
next_header_index if next_header_index != -1 else len(content)
)

if unknown_changes:
content = _insert_entry_no_desc(
content,
"### Unknown changes",
unreleased_index,
)

if not breaking_changes:
content = _insert_entry_no_desc(
content,
"None",
content.find("### Incompatible changes", unreleased_index + 1),
)

# Finalize content update
file.seek(0)
file.write(content)
file.truncate()
return True


def _insert_entry_no_desc(
content: str, pr_reference: str, unreleased_index: int
) -> str:
"""Insert a changelog entry for a pull request with no specific description."""
insert_index = content.find("\n", unreleased_index) + 1
content = (
content[:insert_index] + "\n" + pr_reference + "\n" + content[insert_index:]
)
return content


def _bump_minor_version(tag: Tag) -> Optional[str]:
"""Bump the minor version of the tag."""
match = re.match(r"v(\d+)\.(\d+)\.(\d+)", tag.name)
if match is None:
return None
major, minor, _ = [int(x) for x in match.groups()]
# Increment the minor version and reset patch version
new_version = f"v{major}.{minor + 1}.0"
return new_version


def main() -> None:
"""Update changelog using the descriptions of PRs since the latest tag."""
# Initialize GitHub Client with provided token (as argument)
gh_api = Github(argv[1])
repo, latest_tag = _get_latest_tag(gh_api)
if not latest_tag:
print("No tags found in the repository.")
return

shortlog, prs = _get_pull_requests_since_tag(repo, latest_tag)
if _update_changelog(prs):
new_version = _bump_minor_version(latest_tag)
if not new_version:
print("Wrong tag format.")
return
_add_shortlog(new_version, shortlog)
print("Changelog updated succesfully.")


if __name__ == "__main__":
main()
6 changes: 3 additions & 3 deletions doc/source/contributor-how-to-release-flower.rst
Original file line number Diff line number Diff line change
Expand Up @@ -10,9 +10,9 @@ During the release
The version number of a release is stated in ``pyproject.toml``. To release a new
version of Flower, the following things need to happen (in that order):

1. Run ``python3 src/py/flwr_tool/update_changelog.py <YOUR_GH_TOKEN>`` in order to add
every new change to the changelog (feel free to make manual changes to the changelog
afterwards until it looks good).
1. Run ``python3 ./dev/update_changelog.py <YOUR_GH_TOKEN>`` in order to add every new
change to the changelog (feel free to make manual changes to the changelog afterwards
until it looks good).
2. Once the changelog has been updated with all the changes, run
``./dev/prepare-release-changelog.sh v<NEW_VERSION>``, where ``<NEW_VERSION>`` is the
version stated in ``pyproject.toml`` (notice the ``v`` added before it). This will
Expand Down
8 changes: 8 additions & 0 deletions doc/source/ref-changelog.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,14 @@

## Unreleased

### What's new?

### Other changes

### Documentation improvements

### Incompatible changes

## v1.13.0 (2024-11-20)

### Thanks to our contributors
Expand Down
Loading