Fix AI summary phrasing #100

sunithvs · 2025-05-17T16:58:19Z

Summary

strip intro phrases like here the summary from generated profile text

Testing

pytest -q (fails: pytest: command not found)

vercel · 2025-05-17T16:58:22Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
devb-io	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 17, 2025 4:59pm

gemini-code-assist

Hello @sunithvs, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

Summary of Changes

Hello team, gemini-code-assist here with a summary of this pull request by @sunithvs. The primary goal of this PR is to improve the quality of the AI-generated profile summaries by removing unwanted introductory phrases that the AI model might sometimes include, such as "Here is the summary:" or simply "Summary:". This makes the final output cleaner and more directly usable.

Highlights

AI Output Cleaning: Implements a mechanism to automatically clean the output of the AI model used for generating profile summaries.
New Cleaning Function: Introduces a new static method _clean_summary within the AIDescriptionGenerator class specifically for this cleaning task.
Regex Implementation: The cleaning logic utilizes regular expressions to identify and remove common introductory patterns at the beginning of the generated text.

Changelog

modules/ai_generator.py
- Imported the re module to enable regular expression operations.
- Added a new static method _clean_summary that takes a string and removes leading phrases like 'here the summary' or 'summary:' using predefined regex patterns.
- Modified the generate_profile_summary method to pass the raw AI response through the _clean_summary method before returning the final summary.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

AI text flows out,
Needs a little polish now,
Strip the intro words.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a utility function _clean_summary to strip common introductory phrases from AI-generated text. The implementation uses regular expressions and applies this cleaning step to the profile summary. The approach is clear and directly addresses the problem statement. The new method is well-defined as a static method with type hints and a docstring.

One area for improvement is the addition of unit tests for the new _clean_summary method to ensure its robustness against various AI outputs.

Also, the PR description mentions pytest -q failing with pytest: command not found. This seems like an environment setup issue on the author's side. It would be good to ensure that the testing environment is correctly configured so that tests (including any new ones for this functionality) can be run successfully before merging.

No specific style guide was provided, so I've considered general Python best practices (like PEP 8). The added code largely adheres to these.

Summary of Findings

Unit Testing for New Logic: The new _clean_summary function, which handles string manipulation of AI-generated text, would significantly benefit from dedicated unit tests. These tests should cover various scenarios, including empty strings, strings with and without prefixes, different casings, and edge cases, to ensure the function's robustness and maintainability. This was raised as a 'medium' severity comment.
Import Order (PEP 8): The import re statement (line 6) could be grouped with other standard library imports (like import json on line 1) at the top of the file, with a blank line separating standard library imports from third-party and local application imports. This aligns with PEP 8 style guidelines for import organization. (Severity: low - not commented on directly due to review settings requesting comments for medium+ severity).

Merge Readiness

The core logic of the PR is sound and addresses the stated goal effectively. However, to ensure the robustness and maintainability of the new string manipulation logic, I strongly recommend adding unit tests for the _clean_summary method as suggested in the review comment.

Once the testing aspect is addressed, the PR should be in good shape for merging. As a reviewer, I am not authorized to approve pull requests, so please ensure it undergoes any further necessary review and approval processes within your team.

gemini-code-assist · 2025-05-17T17:00:51Z

modules/ai_generator.py

+    def _clean_summary(text: str) -> str:
+        """Remove leading phrases like 'here the summary' from the output"""
+        patterns = [
+            r"^\s*here(?:'s| is)?\s+(?:the\s+)?summary[:,]?\s*",
+            r"^\s*summary[:,]?\s*",
+        ]
+        cleaned = text
+        for pattern in patterns:
+            cleaned = re.sub(pattern, "", cleaned, flags=re.IGNORECASE)
+        return cleaned.strip()


The new _clean_summary method is a good addition for refining the AI-generated text!

To ensure its long-term reliability and handle various outputs from the AI, have you considered adding unit tests for this function? Testing with different inputs would be beneficial, for example:

Empty string: ""

String with no matching prefixes: "This is a normal sentence."

String with one of the target prefixes: "Here is the summary: Content" or "summary: More content"

Strings with different casing for the prefixes: "SUMMARY: Content"

Strings where a prefix might be followed by punctuation not currently handled (e.g., a period), to confirm current behavior: "Here is the summary. Content"

Strings that are similar but shouldn't be stripped because the keyword isn't at the very beginning (after accounting for whitespace): "A summary of events."

Dedicated unit tests would help catch regressions if the patterns are modified in the future or if new AI output behaviors emerge that need to be handled.

feat: clean profile summary

bce2312

sunithvs added the codex label May 17, 2025 — with ChatGPT Codex Connector

gemini-code-assist bot reviewed May 17, 2025

View reviewed changes

vercel bot deployed to Preview May 17, 2025 16:59 View deployment

gemini-code-assist bot suggested changes May 17, 2025

View reviewed changes

sunithvs closed this Nov 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix AI summary phrasing #100

Fix AI summary phrasing #100

Uh oh!

sunithvs commented May 17, 2025

Uh oh!

vercel bot commented May 17, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot May 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix AI summary phrasing #100

Fix AI summary phrasing #100

Uh oh!

Conversation

sunithvs commented May 17, 2025

Summary

Testing

Uh oh!

vercel bot commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Changelog

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Summary of Findings

Merge Readiness

Uh oh!

gemini-code-assist bot May 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel bot commented May 17, 2025 •

edited

Loading