feat(youtube_downloader.py): Implement YoutubeDownloader class functionality #21

domabyte · 2025-02-21T19:03:57Z

#19 For downloading videos and retrieving channel info

This change introduces a new class, YoutubeDownloader, which provides functionality to download videos from YouTube and fetch channel information using the YouTube Data API. The implementation includes logging, execution time measurement, and concurrent execution for fetching video details. This enhancement aims to streamline the process of interacting with YouTube content programmatically.

Summary by CodeRabbit

New Features
- Introduced a new functionality for downloading YouTube videos and retrieving detailed channel and video information.
- Enhanced the experience with concurrent processing for faster data retrieval and comprehensive logging for transparency during operations.
- Added a sample environment configuration file for easier setup.
- Implemented notification features in workflows to provide immediate feedback on code reviews and linting results.
Chores
- Updated .gitignore to include environment configuration files and log directories.
- Added new dependencies for improved functionality in the project.
- Modified configuration instructions to focus on design patterns and best practices.

…wnloading videos and retrieving channel info This change introduces a new class, `YoutubeDownloader`, which provides functionality to download videos from YouTube and fetch channel information using the YouTube Data API. The implementation includes logging, execution time measurement, and concurrent execution for fetching video details. This enhancement aims to streamline the process of interacting with YouTube content programmatically.

coderabbitai · 2025-02-21T19:04:04Z

Warning

`.coderabbit.yaml` has a parsing error

The CodeRabbit configuration file in this repository has a parsing error and default settings were used instead. Please fix the error(s) in the configuration file. You can initialize chat with CodeRabbit to get help with the configuration file.

💥 Parsing errors (1)

Validation error: Unrecognized key(s) in object: 'version', 'style', 'post_review_actions'

⚙️ Configuration instructions

Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Walkthrough

A new Python module has been introduced that implements the YoutubeDownloader class to retrieve and log YouTube video data. The module utilizes methods to measure execution time, run yt-dlp commands concurrently, and gather video and channel information, with logging configured via an environment-based setup. Helper functions for log directory resolution and a main() entry point have also been added to facilitate execution.

Changes

File	Change Summary
`reelfeeder/core/youtube_downloader.py`	Added `YoutubeDownloader` class with methods: • `__measure_execution_time` for timing functions • `__run_yt_dlp_command` to execute yt-dlp commands • `__get_video_info` for concurrent info retrieval • `get_channel_info` and `channel_info` for handling video/channel details Also added `logs_dir` and `main` as standalone helper functions.
`.gitignore`	Added entries for `.env` and `logs/` to ignore environment configuration files and log files; confirmed existing entry for `data/processed_videos/*`.
`pyproject.toml`	Added dependencies: `dotenv = "^0.9.9"` and `google-api-python-client = "^2.161.0"` under `[tool.poetry.dependencies]`.
`.coderabbit.yaml`	Updated `tone_instructions` to remove reference to "SOLID principles," focusing solely on "design patterns and best practices."
`.env.example`	Introduced a new file with environment variable declarations: `CHANNEL_ID`, `GOOGLE_API_KEY`, and `YOUTUBE_URL`.
`.github/workflows/lint.yml`	Added a "Notify result" job step to send notifications based on the success or failure of the linting process, utilizing the `NTFY_TOPIC` environment variable.
`.github/workflows/coderabbit-review-ntfy.yml`	Introduced a new workflow to send notifications upon completion of code reviews triggered by specific issue comments.

Sequence Diagram(s)

sequenceDiagram
    participant M as main()
    participant YT as YoutubeDownloader
    participant TP as ThreadPool
    participant YTD as yt-dlp

    M->>YT: Instantiate with URL and config
    Note over YT: Initializes attributes and logging
    M->>YT: Call channel_info()
    YT->>YT: Invoke __get_video_info() (measures execution time)
    YT->>TP: Dispatch __run_yt_dlp_command() tasks concurrently
    TP->>YTD: Execute yt-dlp command(s)
    YTD-->>TP: Return video info
    TP-->>YT: Return aggregated data
    YT->>YT: Log and print execution time

Poem

Hoppity hop, I'm a coding bunny,
Leaping through bytes and code so sunny,
With new methods that run and log,
And yt-dlp commands cutting through fog,
I nibble on data, swift and keen—
Celebrating a change so bright and clean!
🐰✨

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

…rings to improve code readability and maintainability

coderabbitai

Actionable comments posted: 7

🔭 Outside diff range comments (1)

reelfeeder/core/youtube_downloader.py (1)
1-125: ⚠️ Potential issue

Fix formatting issues.

The Ruff formatting check has failed. Please run the formatter to fix the issues.

Run the following command to fix formatting:
ruff format reelfeeder/core/youtube_downloader.py
🧰 Tools

🪛 GitHub Actions: Lint

[error] 1-1: Ruff formatting check failed. 1 file would be reformatted.

🧹 Nitpick comments (1)

reelfeeder/core/youtube_downloader.py (1)
1-10: Consider reorganizing imports and moving load_dotenv.

The imports could be better organized by grouping standard library imports separately from third-party imports. Additionally, consider moving the load_dotenv() call to a more appropriate location.
# Standard library imports
import concurrent.futures
import logging
import os
import subprocess
import time

# Third-party imports
from dotenv import load_dotenv
from googleapiclient.discovery import build

-load_dotenv()
Move the load_dotenv() call to either:

The main() function if this module is meant to be run as a script

An initialization function if this module is meant to be imported

🧰 Tools

🪛 GitHub Actions: Lint

[error] 1-1: Ruff formatting check failed. 1 file would be reformatted.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5a654ea and 70fedfb.

📒 Files selected for processing (1)

reelfeeder/core/youtube_downloader.py (1 hunks)

🧰 Additional context used

🪛 GitHub Actions: Lint

reelfeeder/core/youtube_downloader.py

[error] 1-1: Ruff formatting check failed. 1 file would be reformatted.

coderabbitai · 2025-02-21T19:08:17Z

reelfeeder/core/youtube_downloader.py

+    def channel_info(self) -> dict:
+        """
+        Get channel information
+        """
+        video_info, duration = self.__measure_execution_time(self.__get_video_info)
+        self.logger.info("Video Info: %s", video_info)
+        print("Execution Time: ", duration)
+


🛠️ Refactor suggestion

Improve channel_info method implementation.

The method lacks proper error handling and return type consistency.

Consider this improved implementation:

- def channel_info(self) -> dict: + def channel_info(self) -> tuple[dict, float]: """ - Get channel information + Get video and execution time information. + + Returns: + Tuple of (video information dict, execution time in seconds) + + Raises: + RuntimeError: If video information cannot be retrieved """ - video_info, duration = self.__measure_execution_time(self.__get_video_info) - self.logger.info("Video Info: %s", video_info) - print("Execution Time: ", duration) + try: + video_info, duration = self.__measure_execution_time(self.__get_video_info) + self.logger.info("Video Info: %s", video_info) + self.logger.info("Execution Time: %s seconds", duration) + return video_info, duration + except Exception as e: + self.logger.error("Failed to get channel info: %s", str(e)) + raise RuntimeError(f"Failed to get channel info: {str(e)}")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def channel_info(self) -> dict:

"""

Get channel information

"""

video_info, duration = self.__measure_execution_time(self.__get_video_info)

self.logger.info("Video Info: %s", video_info)

print("Execution Time: ", duration)

def channel_info(self) -> tuple[dict, float]:

"""

Get video and execution time information.

Returns:

Tuple of (video information dict, execution time in seconds)

Raises:

RuntimeError: If video information cannot be retrieved

"""

try:

video_info, duration = self.__measure_execution_time(self.__get_video_info)

self.logger.info("Video Info: %s", video_info)

self.logger.info("Execution Time: %s seconds", duration)

return video_info, duration

except Exception as e:

self.logger.error("Failed to get channel info: %s", str(e))

raise RuntimeError(f"Failed to get channel info: {str(e)}")

coderabbitai · 2025-02-21T19:08:17Z

reelfeeder/core/youtube_downloader.py

+def logs_dir():
+    current_dir = os.getcwd()
+    parent = os.path.join(os.path.join(current_dir, os.pardir), os.pardir)
+    return os.path.abspath(parent)
+


🛠️ Refactor suggestion

Improve logs directory resolution.

The current implementation has several issues:

Hard-coded path traversal

No directory creation

No error handling

Consider this improved implementation:

-def logs_dir(): +def logs_dir() -> str: + """ + Resolve and create logs directory. + + Returns: + Absolute path to logs directory + + Raises: + OSError: If directory creation fails + """ - current_dir = os.getcwd() - parent = os.path.join(os.path.join(current_dir, os.pardir), os.pardir) - return os.path.abspath(parent) + try: + # Use a more reliable method to get the project root + project_root = os.path.dirname(os.path.dirname(os.path.dirname(__file__))) + logs_path = os.path.join(project_root, "logs") + + # Create logs directory if it doesn't exist + os.makedirs(logs_path, exist_ok=True) + + return logs_path + except OSError as e: + raise OSError(f"Failed to create logs directory: {str(e)}")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def logs_dir():

current_dir = os.getcwd()

parent = os.path.join(os.path.join(current_dir, os.pardir), os.pardir)

return os.path.abspath(parent)

def logs_dir() -> str:

"""

Resolve and create logs directory.

Returns:

Absolute path to logs directory

Raises:

OSError: If directory creation fails

"""

try:

# Use a more reliable method to get the project root

project_root = os.path.dirname(os.path.dirname(os.path.dirname(__file__)))

logs_path = os.path.join(project_root, "logs")

# Create logs directory if it doesn't exist

os.makedirs(logs_path, exist_ok=True)

return logs_path

except OSError as e:

raise OSError(f"Failed to create logs directory: {str(e)}")

coderabbitai · 2025-02-21T19:08:17Z

reelfeeder/core/youtube_downloader.py

+    def __run_yt_dlp_command(self, flag):
+        command = ["yt-dlp", flag, self.url]
+        result = subprocess.run(command, capture_output=True, text=True)
+        return flag, result.stdout.strip()
+


⚠️ Potential issue

Improve security and error handling in yt-dlp command execution.

The current implementation has security risks and lacks proper error handling:

No URL validation before shell command execution

No timeout for subprocess execution

No proper error handling for command failures

Consider this safer implementation:

- def __run_yt_dlp_command(self, flag): - command = ["yt-dlp", flag, self.url] - result = subprocess.run(command, capture_output=True, text=True) - return flag, result.stdout.strip() + def __run_yt_dlp_command(self, flag: str, timeout: int = 30) -> tuple[str, str]: + """ + Execute yt-dlp command with the given flag. + + Args: + flag: yt-dlp command flag + timeout: Command timeout in seconds + + Returns: + Tuple of (flag, command output) + + Raises: + ValueError: If command execution fails + TimeoutError: If command execution times out + """ + if not isinstance(self.url, str) or not self.url.startswith(('http://', 'https://')): + raise ValueError(f"Invalid URL: {self.url}") + + command = ["yt-dlp", flag, self.url] + try: + result = subprocess.run( + command, + capture_output=True, + text=True, + timeout=timeout, + check=True + ) + return flag, result.stdout.strip() + except subprocess.TimeoutExpired: + self.logger.error(f"Command timed out after {timeout} seconds: {command}") + raise TimeoutError(f"Command timed out: {command}") + except subprocess.CalledProcessError as e: + self.logger.error(f"Command failed with exit code {e.returncode}: {command}") + self.logger.error(f"Error output: {e.stderr}") + raise ValueError(f"Command failed: {e.stderr}")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def __run_yt_dlp_command(self, flag):

command = ["yt-dlp", flag, self.url]

result = subprocess.run(command, capture_output=True, text=True)

return flag, result.stdout.strip()

def __run_yt_dlp_command(self, flag: str, timeout: int = 30) -> tuple[str, str]:

"""

Execute yt-dlp command with the given flag.

Args:

flag: yt-dlp command flag

timeout: Command timeout in seconds

Returns:

Tuple of (flag, command output)

Raises:

ValueError: If command execution fails

TimeoutError: If command execution times out

"""

if not isinstance(self.url, str) or not self.url.startswith(('http://', 'https://')):

raise ValueError(f"Invalid URL: {self.url}")

command = ["yt-dlp", flag, self.url]

try:

result = subprocess.run(

command,

capture_output=True,

text=True,

timeout=timeout,

check=True

)

return flag, result.stdout.strip()

except subprocess.TimeoutExpired:

self.logger.error(f"Command timed out after {timeout} seconds: {command}")

raise TimeoutError(f"Command timed out: {command}")

except subprocess.CalledProcessError as e:

self.logger.error(f"Command failed with exit code {e.returncode}: {command}")

self.logger.error(f"Error output: {e.stderr}")

raise ValueError(f"Command failed: {e.stderr}")

coderabbitai · 2025-02-21T19:08:17Z

reelfeeder/core/youtube_downloader.py

+def main():
+    directory = logs_dir()
+    logname = os.path.join(directory, "logs", "youtube_downloader.log")
+    downloader = YoutubeDownloader(
+        "https://youtube.com/shorts/GT2m7AwLg9o?si=iuViBBk4DXG92BNR", logname, "output"
+    )
+    downloader.channel_info()
+
+
+if __name__ == "__main__":
+    main()


🛠️ Refactor suggestion

Improve main function implementation.

The current implementation has hard-coded values and lacks error handling.

Consider this improved implementation:

-def main(): +def main() -> None: + """ + Main entry point for the YouTube downloader. + + Environment variables required: + - CHANNEL_ID: YouTube channel ID + - GOOGLE_API_KEY: Google API key for YouTube Data API + """ + try: + # Load environment variables + load_dotenv() + directory = logs_dir() logname = os.path.join(directory, "logs", "youtube_downloader.log") + + # Configure logging for the main function + logging.basicConfig( + level=logging.INFO, + format='%(asctime)s - %(name)s - %(levelname)s - %(message)s' + ) + logger = logging.getLogger(__name__) + + # Validate environment variables + if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY")]): + raise EnvironmentError("Missing required environment variables") + downloader = YoutubeDownloader( - "https://youtube.com/shorts/GT2m7AwLg9o?si=iuViBBk4DXG92BNR", logname, "output" + url=os.getenv("YOUTUBE_URL", "https://youtube.com/shorts/GT2m7AwLg9o"), + logname=logname, + output_path=os.path.join(directory, "downloads"), + channel_ids=[os.getenv("CHANNEL_ID")] ) - downloader.channel_info() + + # Get channel information + video_info, duration = downloader.channel_info() + logger.info("Successfully retrieved video information in %.2f seconds", duration) + + except Exception as e: + logging.error("Failed to execute main: %s", str(e)) + raise if __name__ == "__main__": - main() + try: + main() + except Exception as e: + logging.error("Application failed: %s", str(e)) + sys.exit(1)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def main():

directory = logs_dir()

logname = os.path.join(directory, "logs", "youtube_downloader.log")

downloader = YoutubeDownloader(

"https://youtube.com/shorts/GT2m7AwLg9o?si=iuViBBk4DXG92BNR", logname, "output"

)

downloader.channel_info()

if __name__ == "__main__":

main()

def main() -> None:

"""

Main entry point for the YouTube downloader.

Environment variables required:

- CHANNEL_ID: YouTube channel ID

- GOOGLE_API_KEY: Google API key for YouTube Data API

"""

try:

# Load environment variables

load_dotenv()

directory = logs_dir()

logname = os.path.join(directory, "logs", "youtube_downloader.log")

# Configure logging for the main function

logging.basicConfig(

level=logging.INFO,

format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'

)

logger = logging.getLogger(__name__)

# Validate environment variables

if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY")]):

raise EnvironmentError("Missing required environment variables")

downloader = YoutubeDownloader(

url=os.getenv("YOUTUBE_URL", "https://youtube.com/shorts/GT2m7AwLg9o"),

logname=logname,

output_path=os.path.join(directory, "downloads"),

channel_ids=[os.getenv("CHANNEL_ID")]

)

# Get channel information

video_info, duration = downloader.channel_info()

logger.info("Successfully retrieved video information in %.2f seconds", duration)

except Exception as e:

logging.error("Failed to execute main: %s", str(e))

raise

if __name__ == "__main__":

try:

main()

except Exception as e:

logging.error("Application failed: %s", str(e))

sys.exit(1)

coderabbitai · 2025-02-21T19:08:17Z

reelfeeder/core/youtube_downloader.py

+class YoutubeDownloader:
+    def __init__(self, url, logname, output_path) -> None:
+        """
+        Initialization of YoutubeDownloader class
+        """
+        self.channel_ids = ["Oenoclips"]
+        self.url = url
+        self.output_path = output_path
+        self.channel_id = os.getenv("CHANNEL_ID")
+        self.logger = logging
+        self.logger.basicConfig(
+            filename=logname,
+            filemode="a",
+            format="%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s",
+            datefmt="%Y-%m-%d %H:%M:%S",
+            level=logging.DEBUG,
+        )
+        self.youtube = build('youtube', 'v3', developerKey=os.getenv("GOOGLE_API_KEY"))
+


🛠️ Refactor suggestion

Several improvements needed in class initialization.

The hard-coded channel_ids list seems out of place and should be passed as a parameter.

Missing input validation for constructor parameters.

No error handling for missing environment variables.

Logging configuration in constructor could affect global logging state.

Consider this improved implementation:

class YoutubeDownloader: - def __init__(self, url, logname, output_path) -> None: + def __init__(self, url: str, logname: str, output_path: str, channel_ids: list[str] = None) -> None: """ Initialization of YoutubeDownloader class + + Args: + url: YouTube video URL + logname: Path to log file + output_path: Path for downloaded content + channel_ids: List of channel IDs to process + + Raises: + ValueError: If required parameters are invalid + EnvironmentError: If required environment variables are missing """ - self.channel_ids = ["Oenoclips"] + if not all([url, logname, output_path]): + raise ValueError("All parameters must be non-empty") + + self.channel_ids = channel_ids or [] self.url = url self.output_path = output_path - self.channel_id = os.getenv("CHANNEL_ID") + + # Validate environment variables + self.channel_id = os.getenv("CHANNEL_ID") + self.api_key = os.getenv("GOOGLE_API_KEY") + if not all([self.channel_id, self.api_key]): + raise EnvironmentError("Missing required environment variables: CHANNEL_ID, GOOGLE_API_KEY") + + # Setup logging with a unique logger name self.logger = logging - self.logger.basicConfig( - filename=logname, - filemode="a", - format="%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s", - datefmt="%Y-%m-%d %H:%M:%S", - level=logging.DEBUG, - ) - self.youtube = build('youtube', 'v3', developerKey=os.getenv("GOOGLE_API_KEY")) + self.logger = logging.getLogger(__name__) + if not self.logger.handlers: + handler = logging.FileHandler(logname) + handler.setFormatter( + logging.Formatter( + "%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s", + datefmt="%Y-%m-%d %H:%M:%S" + ) + ) + self.logger.addHandler(handler) + self.logger.setLevel(logging.DEBUG) + + self.youtube = build('youtube', 'v3', developerKey=self.api_key)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

class YoutubeDownloader:

def __init__(self, url, logname, output_path) -> None:

"""

Initialization of YoutubeDownloader class

"""

self.channel_ids = ["Oenoclips"]

self.url = url

self.output_path = output_path

self.channel_id = os.getenv("CHANNEL_ID")

self.logger = logging

self.logger.basicConfig(

filename=logname,

filemode="a",

format="%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s",

datefmt="%Y-%m-%d %H:%M:%S",

level=logging.DEBUG,

)

self.youtube = build('youtube', 'v3', developerKey=os.getenv("GOOGLE_API_KEY"))

class YoutubeDownloader:

def __init__(self, url: str, logname: str, output_path: str, channel_ids: list[str] = None) -> None:

"""

Initialization of YoutubeDownloader class

Args:

url: YouTube video URL

logname: Path to log file

output_path: Path for downloaded content

channel_ids: List of channel IDs to process

Raises:

ValueError: If required parameters are invalid

EnvironmentError: If required environment variables are missing

"""

if not all([url, logname, output_path]):

raise ValueError("All parameters must be non-empty")

self.channel_ids = channel_ids or []

self.url = url

self.output_path = output_path

# Validate environment variables

self.channel_id = os.getenv("CHANNEL_ID")

self.api_key = os.getenv("GOOGLE_API_KEY")

if not all([self.channel_id, self.api_key]):

raise EnvironmentError("Missing required environment variables: CHANNEL_ID, GOOGLE_API_KEY")

# Setup logging with a unique logger name

self.logger = logging.getLogger(__name__)

if not self.logger.handlers:

handler = logging.FileHandler(logname)

handler.setFormatter(

logging.Formatter(

"%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s",

datefmt="%Y-%m-%d %H:%M:%S"

)

)

self.logger.addHandler(handler)

self.logger.setLevel(logging.DEBUG)

self.youtube = build('youtube', 'v3', developerKey=self.api_key)

coderabbitai · 2025-02-21T19:08:17Z

reelfeeder/core/youtube_downloader.py

+    def __get_video_info(self):
+        flags = {
+            "--get-id": "id",
+            "--get-title": "title",
+            "--get-duration": "duration",
+            "--get-thumbnail": "thumbnail",
+            "--get-format": "format",
+            "--get-url": "url",
+        }
+        video_info = {}
+        with concurrent.futures.ThreadPoolExecutor(max_workers=len(flags)) as executor:
+            future_to_flag = {
+                executor.submit(self.__run_yt_dlp_command, flag): key
+                for flag, key in flags.items()
+            }
+            for future in concurrent.futures.as_completed(future_to_flag):
+                flag = future_to_flag[future]
+                try:
+                    flag, result = future.result()
+                    if flag in flags:
+                        video_info[flags[flag]] = result
+                except Exception as exc:
+                    print(f"{flag} generated an exception: {exc}")
+        return video_info
+


🛠️ Refactor suggestion

Improve error handling and add validation in video info retrieval.

The current implementation has several issues:

Generic exception handling without specific error recovery

No validation of retrieved video information

No timeout for the entire operation

Consider this improved implementation:

- def __get_video_info(self): + def __get_video_info(self, timeout: int = 60) -> dict: + """ + Retrieve video information using yt-dlp commands concurrently. + + Args: + timeout: Overall operation timeout in seconds + + Returns: + Dictionary containing video information + + Raises: + TimeoutError: If operation times out + ValueError: If required information cannot be retrieved + """ flags = { "--get-id": "id", "--get-title": "title", "--get-duration": "duration", "--get-thumbnail": "thumbnail", "--get-format": "format", "--get-url": "url", } video_info = {} - with concurrent.futures.ThreadPoolExecutor(max_workers=len(flags)) as executor: - future_to_flag = { - executor.submit(self.__run_yt_dlp_command, flag): key - for flag, key in flags.items() - } - for future in concurrent.futures.as_completed(future_to_flag): - flag = future_to_flag[future] - try: - flag, result = future.result() - if flag in flags: - video_info[flags[flag]] = result - except Exception as exc: - print(f"{flag} generated an exception: {exc}") + + required_fields = {"id", "title"} # Minimum required information + + try: + with concurrent.futures.ThreadPoolExecutor(max_workers=len(flags)) as executor: + future_to_flag = { + executor.submit(self.__run_yt_dlp_command, flag): key + for flag, key in flags.items() + } + + # Wait for all futures with timeout + done, not_done = concurrent.futures.wait( + future_to_flag, + timeout=timeout, + return_when=concurrent.futures.ALL_COMPLETED + ) + + # Cancel any pending futures + for future in not_done: + future.cancel() + if not_done: + raise TimeoutError(f"Operation timed out after {timeout} seconds") + + # Process completed futures + for future in done: + flag = future_to_flag[future] + try: + flag, result = future.result() + if flag in flags: + video_info[flags[flag]] = result + except (ValueError, TimeoutError) as exc: + self.logger.error(f"Failed to get {flag}: {exc}") + + # Validate required fields + missing_fields = required_fields - set(video_info.keys()) + if missing_fields: + raise ValueError(f"Missing required video information: {missing_fields}") + + return video_info + + except Exception as exc: + self.logger.error(f"Failed to get video information: {exc}") + raise

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def __get_video_info(self):

flags = {

"--get-id": "id",

"--get-title": "title",

"--get-duration": "duration",

"--get-thumbnail": "thumbnail",

"--get-format": "format",

"--get-url": "url",

}

video_info = {}

with concurrent.futures.ThreadPoolExecutor(max_workers=len(flags)) as executor:

future_to_flag = {

executor.submit(self.__run_yt_dlp_command, flag): key

for flag, key in flags.items()

}

for future in concurrent.futures.as_completed(future_to_flag):

flag = future_to_flag[future]

try:

flag, result = future.result()

if flag in flags:

video_info[flags[flag]] = result

except Exception as exc:

print(f"{flag} generated an exception: {exc}")

return video_info

def __get_video_info(self, timeout: int = 60) -> dict:

"""

Retrieve video information using yt-dlp commands concurrently.

Args:

timeout: Overall operation timeout in seconds

Returns:

Dictionary containing video information

Raises:

TimeoutError: If operation times out

ValueError: If required information cannot be retrieved

"""

flags = {

"--get-id": "id",

"--get-title": "title",

"--get-duration": "duration",

"--get-thumbnail": "thumbnail",

"--get-format": "format",

"--get-url": "url",

}

video_info = {}

required_fields = {"id", "title"} # Minimum required information

try:

with concurrent.futures.ThreadPoolExecutor(max_workers=len(flags)) as executor:

future_to_flag = {

executor.submit(self.__run_yt_dlp_command, flag): key

for flag, key in flags.items()

}

# Wait for all futures with timeout

done, not_done = concurrent.futures.wait(

future_to_flag,

timeout=timeout,

return_when=concurrent.futures.ALL_COMPLETED

)

# Cancel any pending futures

for future in not_done:

future.cancel()

if not_done:

raise TimeoutError(f"Operation timed out after {timeout} seconds")

# Process completed futures

for future in done:

flag = future_to_flag[future]

try:

flag, result = future.result()

if flag in flags:

video_info[flags[flag]] = result

except (ValueError, TimeoutError) as exc:

self.logger.error(f"Failed to get {flag}: {exc}")

# Validate required fields

missing_fields = required_fields - set(video_info.keys())

if missing_fields:

raise ValueError(f"Missing required video information: {missing_fields}")

return video_info

except Exception as exc:

self.logger.error(f"Failed to get video information: {exc}")

raise

coderabbitai · 2025-02-21T19:08:18Z

reelfeeder/core/youtube_downloader.py

+    def get_channel_info(self, channel_id):
+        request = self.youtube.channels().list(
+            part="snippet,statistics",
+            id=channel_id
+        )
+        response = request.execute()
+
+        if 'items' in response:
+            channel = response['items'][0]
+            return {
+                'title': channel['snippet']['title'],
+                'description': channel['snippet']['description'],
+                'subscriber_count': channel['statistics']['subscriberCount'],
+                'view_count': channel['statistics']['viewCount'],
+                'video_count': channel['statistics']['videoCount'],
+                'published_at': channel['snippet']['publishedAt']
+            }
+        else:
+            return None
+


🛠️ Refactor suggestion

Improve error handling and validation in channel info retrieval.

The current implementation lacks proper error handling and validation for API interactions.

Consider this improved implementation:

- def get_channel_info(self, channel_id): + def get_channel_info(self, channel_id: str) -> dict: + """ + Retrieve channel information from YouTube API. + + Args: + channel_id: YouTube channel ID + + Returns: + Dictionary containing channel information + + Raises: + ValueError: If channel_id is invalid or channel not found + RuntimeError: If API request fails + """ + if not channel_id: + raise ValueError("channel_id cannot be empty") + + self.logger.info(f"Fetching info for channel: {channel_id}") + + try: request = self.youtube.channels().list( part="snippet,statistics", id=channel_id ) response = request.execute() if 'items' in response: channel = response['items'][0] - return { + info = { 'title': channel['snippet']['title'], 'description': channel['snippet']['description'], 'subscriber_count': channel['statistics']['subscriberCount'], 'view_count': channel['statistics']['viewCount'], 'video_count': channel['statistics']['videoCount'], 'published_at': channel['snippet']['publishedAt'] } + self.logger.info(f"Successfully retrieved channel info: {info['title']}") + return info else: - return None + raise ValueError(f"Channel not found: {channel_id}") + except Exception as e: + self.logger.error(f"Failed to get channel info: {str(e)}") + raise RuntimeError(f"API request failed: {str(e)}")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def get_channel_info(self, channel_id):

request = self.youtube.channels().list(

part="snippet,statistics",

id=channel_id

)

response = request.execute()

if 'items' in response:

channel = response['items'][0]

return {

'title': channel['snippet']['title'],

'description': channel['snippet']['description'],

'subscriber_count': channel['statistics']['subscriberCount'],

'view_count': channel['statistics']['viewCount'],

'video_count': channel['statistics']['videoCount'],

'published_at': channel['snippet']['publishedAt']

}

else:

return None

def get_channel_info(self, channel_id: str) -> dict:

"""

Retrieve channel information from YouTube API.

Args:

channel_id: YouTube channel ID

Returns:

Dictionary containing channel information

Raises:

ValueError: If channel_id is invalid or channel not found

RuntimeError: If API request fails

"""

if not channel_id:

raise ValueError("channel_id cannot be empty")

self.logger.info(f"Fetching info for channel: {channel_id}")

try:

request = self.youtube.channels().list(

part="snippet,statistics",

id=channel_id

)

response = request.execute()

if 'items' in response:

channel = response['items'][0]

info = {

'title': channel['snippet']['title'],

'description': channel['snippet']['description'],

'subscriber_count': channel['statistics']['subscriberCount'],

'view_count': channel['statistics']['viewCount'],

'video_count': channel['statistics']['videoCount'],

'published_at': channel['snippet']['publishedAt']

}

self.logger.info(f"Successfully retrieved channel info: {info['title']}")

return info

else:

raise ValueError(f"Channel not found: {channel_id}")

except Exception as e:

self.logger.error(f"Failed to get channel info: {str(e)}")

raise RuntimeError(f"API request failed: {str(e)}")

…instructions to streamline guidance for code reviews

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (7)

reelfeeder/core/youtube_downloader.py (7)

13-31: ⚠️ Potential issue

Several critical improvements needed in class initialization.

The initialization has multiple issues that need to be addressed:

Hard-coded channel ID list

No validation of input parameters

Global logging configuration that could affect other modules

Missing error handling for environment variables

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

49-53: ⚠️ Potential issue

Improve security and error handling in yt-dlp command execution.

The current implementation has security risks and lacks proper error handling.

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

54-78: ⚠️ Potential issue

Improve error handling and add validation in video info retrieval.

The current implementation lacks proper error handling and validation.

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

79-95: ⚠️ Potential issue

Improve error handling and validation in channel info retrieval.

The current implementation lacks proper error handling and validation for API interactions.

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

96-103: ⚠️ Potential issue

Improve channel_info method implementation.

The method lacks proper error handling and return type consistency.

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

105-109: ⚠️ Potential issue

Improve logs directory resolution.

The current implementation has several issues with path traversal and directory creation.

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

111-121: ⚠️ Potential issue

Improve main function implementation.

The current implementation has hard-coded values and lacks error handling.

The previous review comment already provided a comprehensive solution for these issues. Please refer to that implementation.

🧹 Nitpick comments (2)

reelfeeder/core/youtube_downloader.py (2)

32-48: Add type hints and improve docstring format.

The __measure_execution_time method needs type hints and PEP 257 compliant docstrings.

     @staticmethod
-    def __measure_execution_time(func, *args, **kwargs):
+    def __measure_execution_time(func: callable, *args, **kwargs) -> tuple[any, float]:
         """
         Measures the execution time of a function.
-
-        :param func: The function to measure.
-        :param args: Positional arguments to pass to the function.
-        :param kwargs: Keyword arguments to pass to the function.
-        :return: A tuple containing the result of the function and the time taken in
-        seconds.
+        
+        Args:
+            func: The function to measure
+            *args: Positional arguments to pass to the function
+            **kwargs: Keyword arguments to pass to the function
+            
+        Returns:
+            Tuple containing (function result, execution time in seconds)
         """

32-47: Improve type hints and error handling in execution time measurement.

The method could benefit from better type hints and error handling.

Consider this improved implementation:

     @staticmethod
-    def __measure_execution_time(func, *args, **kwargs):
+    def __measure_execution_time(func: callable, *args: Any, **kwargs: Any) -> Tuple[Any, float]:
         """
         Measures the execution time of a function.
 
-        :param func: The function to measure.
-        :param args: Positional arguments to pass to the function.
-        :param kwargs: Keyword arguments to pass to the function.
-        :return: A tuple containing the result of the function and the time taken in
-        seconds.
+        Args:
+            func: The function to measure
+            args: Positional arguments to pass to the function
+            kwargs: Keyword arguments to pass to the function
+            
+        Returns:
+            Tuple of (function result, execution time in seconds)
+            
+        Raises:
+            TypeError: If func is not callable
         """
+        if not callable(func):
+            raise TypeError("func must be callable")
+            
         start_time = time.time()
-        result = func(*args, **kwargs)
-        end_time = time.time()
-        duration = end_time - start_time
-        return result, duration
+        try:
+            result = func(*args, **kwargs)
+            return result, time.time() - start_time
+        except Exception as e:
+            logging.error(f"Error executing {func.__name__}: {str(e)}")
+            raise

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 70fedfb and d6875dd.

📒 Files selected for processing (3)

.gitignore (1 hunks)
pyproject.toml (1 hunks)
reelfeeder/core/youtube_downloader.py (1 hunks)

🔇 Additional comments (6)

.gitignore (1)

6-8: LGTM! Appropriate entries added for security and maintenance.

The additions properly exclude sensitive configuration files (.env) and generated content (logs/).
pyproject.toml (1)

16-17: ⚠️ Potential issue
✅ Verification successful

Fix incorrect package name and verify versions.

The package name dotenv is incorrect. It should be python-dotenv.

Let's verify if these are the latest stable versions.

Apply this fix:
-dotenv = "^0.9.9"
+python-dotenv = "^1.0.0"
 google-api-python-client = "^2.161.0"
🌐 Web query:
What is the correct package name and latest version for Python dotenv package?
💡 Result:

The correct Python package for managing .env files is python-dotenv[1][4][6][7], which should be installed via:
pip install python-dotenv
As of February 21, 2025¹, version 1.0.1 is listed as stable in Debian repositories[9], while PyPI shows an unrelated deprecated "dotenv" package (v0.9.9)[10]. The primary GitHub repository maintained by @theskumar confirms v1.x releases are current[2][7].

Key details:

Avoid confusion with Node.js' dotenv (npm)[5]

Python versions ≥3.5 are supported since v0.19[2]

Latest GitHub release: v1.0.1 (January 22, 2024)[2]

Citations:

1: https://www.python-engineer.com/posts/dotenv-python/

2: https://github.com/theskumar/python-dotenv/releases

3: ImportError: cannot import name load_dotenv theskumar/python-dotenv#6

4: https://configu.com/blog/using-py-dotenv-python-dotenv-package-to-manage-env-variables/

5: https://www.npmjs.com/package/dotenv

6: https://solutions.posit.co/write-code/minimum-viable-python/installing-packages/

7: https://github.com/theskumar/python-dotenv

8: https://www.piwheels.org/project/python-dotenv/

9: https://packages.debian.org/source/sid/python-dotenv

10: https://pypi.org/project/dotenv/

🌐 Web query:
What is the latest stable version of google-api-python-client?
💡 Result:

The latest stable version of google-api-python-client as of February 2025 is 2.161.0, released on January 21, 2025[7]. This version includes updates across various Google APIs such as BigQuery, Cloud Functions, and Chat[7].

Key details about recent releases:

Version 2.160.0 was released on January 21, 2025[7]

Version 2.158.0 followed on January 7, 2025 with security center and cloud identity updates[7]

The library requires Python ≥3.7 and caches discovery documents locally instead of fetching them dynamically[1][4]

For production environments using Python ≥3.7, upgrading to v2.x is recommended for improved reliability[1][4]. The package size increased by ~50MB due to embedded discovery documents[1][4].

Citations:

1: https://pypi.org/project/google-api-python-client/

2: https://cloud.google.com/python/docs/reference/language/latest/changelog

3: https://cloud.google.com/python/docs/reference/google-cloud-core/latest/changelog

4: https://github.com/googleapis/google-api-python-client/blob/main/UPGRADING.md

5: https://googleapis.dev/python/google-api-core/latest/changelog.html

6: https://cloud.google.com/python/docs/reference/bigquery/latest

7: https://github.com/googleapis/google-api-python-client/releases

8: https://github.com/googleapis/google-api-python-client/blob/main/CHANGELOG.md

9: https://pypi.org/project/google-api-core/

10: https://developers.google.com/google-ads/api/docs/client-libs
Update dependency configuration in pyproject.toml

The dotenv entry is incorrect and has been replaced with python-dotenv, with its version bumped to the latest stable release. The google-api-python-client version remains up-to-date.

Changes to be applied:

Update package name:

From: dotenv = "^0.9.9"

To: python-dotenv = "^1.0.1"

Retain: google-api-python-client = "^2.161.0"
reelfeeder/core/youtube_downloader.py (4)

13-31: Several improvements needed in class initialization.

The current implementation has multiple issues with initialization, validation, and logging setup that need to be addressed.

Please refer to the previous review comment for the detailed implementation suggestions.

49-53: Multiple improvements needed in core functionality.

The implementation of several core methods needs improvement in areas such as error handling, validation, and security.

Please refer to the previous review comments for detailed implementation suggestions for:

__run_yt_dlp_command

__get_video_info

get_channel_info

channel_info

Also applies to: 54-78, 79-95, 96-103

105-121: Improve utility functions and main entry point.

The implementation of logs_dir and main functions needs improvement in areas such as error handling, path resolution, and environment validation.

Please refer to the previous review comments for detailed implementation suggestions.

49-52:
❓ Verification inconclusive

Verify URL validation and command injection prevention.

Let's verify the security of URL handling and command execution.

🏁 Script executed:
#!/bin/bash
# Description: Check for potential command injection vulnerabilities and URL validation

# Test 1: Search for URL validation
echo "Checking for URL validation..."
rg -l 'urllib\.parse\.urlparse|validators\.url|^https?://' 

# Test 2: Search for potential command injection vulnerabilities
echo "Checking for command injection vulnerabilities..."
ast-grep --pattern 'subprocess.run($cmd, $$$)'

# Test 3: Check if yt-dlp is being used securely
echo "Checking yt-dlp usage patterns..."
rg -l 'yt-dlp.*shell\s*=\s*True|subprocess\.Popen.*shell\s*=\s*True'
Length of output: 437
Action Required: Confirm URL Validation

After reviewing the code in reelfeeder/core/youtube_downloader.py (lines 49-52), the use of a list with subprocess.run (without enabling shell=True) mitigates typical command injection risks when executing external commands. However, the snippet does not include any URL validation. Please ensure that the value of self.url is properly validated elsewhere in the codebase to prevent potential issues with malformed or untrusted URLs.

Command Injection: No vulnerabilities detected since the command is executed as a list.

URL Validation: Confirm that self.url is validated or sanitized prior to being used in this function.

Current date provided in query context: Friday, February 21, 2025 ↩

coderabbitai · 2025-02-21T19:11:53Z

reelfeeder/core/youtube_downloader.py

+import concurrent.futures
+import logging
+import os
+import subprocess
+import time
+
+from dotenv import load_dotenv
+from googleapiclient.discovery import build
+
+load_dotenv()
+


🛠️ Refactor suggestion

Improve global setup and imports.

The current implementation has several issues:

load_dotenv() at global scope can affect other modules

Missing error handling for environment variable loading

Missing type hints for imports

Consider this improved implementation:

import concurrent.futures import logging import os import subprocess import time +from typing import Dict, List, Optional, Tuple, Any from dotenv import load_dotenv from googleapiclient.discovery import build -load_dotenv() +def load_environment() -> None: + """Load environment variables from .env file.""" + if not load_dotenv(): + logging.warning(".env file not found or couldn't be loaded")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

import concurrent.futures

import logging

import os

import subprocess

import time

from dotenv import load_dotenv

from googleapiclient.discovery import build

load_dotenv()

import concurrent.futures

import logging

import os

import subprocess

import time

from typing import Dict, List, Optional, Tuple, Any

from dotenv import load_dotenv

from googleapiclient.discovery import build

def load_environment() -> None:

"""Load environment variables from .env file."""

if not load_dotenv():

logging.warning(".env file not found or couldn't be loaded")

…or handling and environment variable validation fix(youtube_downloader.py): improve logging and exception handling for better debugging refactor(youtube_downloader.py): restructure main function to load environment variables and handle errors gracefully

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (2)

reelfeeder/core/youtube_downloader.py (2)

113-134: 🛠️ Refactor suggestion

Add validation for required video information.

The method should validate that minimum required fields are present in the video info.

         video_info = {}
+        required_fields = {"id", "title"}  # Minimum required information
         try:
             with concurrent.futures.ThreadPoolExecutor(
                 max_workers=len(flags)
             ) as executor:
                 # ... existing code ...
 
+            # Validate required fields
+            missing_fields = required_fields - set(video_info.keys())
+            if missing_fields:
+                raise ValueError(f"Missing required video information: {missing_fields}")
+
             return video_info

131-133: 🛠️ Refactor suggestion

Improve exception handling in video info retrieval.

Using print for error logging and broad exception handling could mask specific errors.

-                    except Exception as exc:
-                        print(f"{flag} generated an exception: {exc}")
+                    except (ValueError, TimeoutError) as exc:
+                        self.logger.error(f"Failed to get {flag}: {exc}")

🧹 Nitpick comments (2)

reelfeeder/core/youtube_downloader.py (2)
1-9: Add type hints for imports.

Consider adding type hints for better code maintainability and IDE support.
 import concurrent.futures
 import logging
 import os
 import subprocess
 import sys
 import time
+from typing import Dict, List, Optional, Tuple, Any
 
 from dotenv import load_dotenv
 from googleapiclient.discovery import build
141-141: Fix typo in docstring.

There's a typo in the docstring: "ca" should be removed.
-        Retrieve ca channel information using the Youtube API.
+        Retrieve channel information using the Youtube API.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d6875dd and 9d44a0b.

📒 Files selected for processing (2)

.coderabbit.yaml (1 hunks)
reelfeeder/core/youtube_downloader.py (1 hunks)

✅ Files skipped from review due to trivial changes (1)

.coderabbit.yaml

coderabbitai · 2025-02-23T07:29:14Z

reelfeeder/core/youtube_downloader.py

+        if not all([os.getenv("GOOGLE_API_KEY")]):
+            raise OSError("Missing required environment variables")


⚠️ Potential issue

Validate all required environment variables.

The environment validation is incomplete. CHANNEL_ID is used later but not validated here.

- if not all([os.getenv("GOOGLE_API_KEY")]): + if not all([os.getenv("GOOGLE_API_KEY"), os.getenv("CHANNEL_ID")]): raise OSError("Missing required environment variables")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if not all([os.getenv("GOOGLE_API_KEY")]):

raise OSError("Missing required environment variables")

if not all([os.getenv("GOOGLE_API_KEY"), os.getenv("CHANNEL_ID")]):

raise OSError("Missing required environment variables")

coderabbitai · 2025-02-23T07:29:15Z

reelfeeder/core/youtube_downloader.py

+        self.logger = logging
+        self.logger.basicConfig(
+            filename=logname,
+            filemode="a",
+            format="%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s",
+            datefmt="%Y-%m-%d %H:%M:%S",
+            level=logging.DEBUG,
+        )


🛠️ Refactor suggestion

Avoid global logging configuration.

Using basicConfig affects global logging state and could interfere with other modules.

- self.logger = logging - self.logger.basicConfig( - filename=logname, - filemode="a", - format="%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s", - datefmt="%Y-%m-%d %H:%M:%S", - level=logging.DEBUG, - ) + self.logger = logging.getLogger(__name__) + if not self.logger.handlers: + handler = logging.FileHandler(logname) + handler.setFormatter( + logging.Formatter( + "%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s", + datefmt="%Y-%m-%d %H:%M:%S" + ) + ) + self.logger.addHandler(handler) + self.logger.setLevel(logging.DEBUG)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

self.logger = logging

self.logger.basicConfig(

filename=logname,

filemode="a",

format="%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s",

datefmt="%Y-%m-%d %H:%M:%S",

level=logging.DEBUG,

)

self.logger = logging.getLogger(__name__)

if not self.logger.handlers:

handler = logging.FileHandler(logname)

handler.setFormatter(

logging.Formatter(

"%(asctime)s,%(msecs)03d %(name)s %(levelname)s %(message)s",

datefmt="%Y-%m-%d %H:%M:%S"

)

)

self.logger.addHandler(handler)

self.logger.setLevel(logging.DEBUG)

coderabbitai · 2025-02-23T07:29:15Z

reelfeeder/core/youtube_downloader.py

+        if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY")]):
+            raise OSError("Missing required environment variables")
+        downloader = YoutubeDownloader(
+            url=os.getenv("YOUTUBE_URL"),


⚠️ Potential issue

Validate YOUTUBE_URL environment variable.

The URL is retrieved from environment but not validated. This could cause silent failures.

- url=os.getenv("YOUTUBE_URL"), + url=os.getenv("YOUTUBE_URL") or "https://youtube.com/example", # Add default URL

Also add YOUTUBE_URL to the environment validation:

- if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY")]): + if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY"), os.getenv("YOUTUBE_URL")]):

Committable suggestion skipped: line range outside the PR's diff.

… logging for file and console output refactor(youtube_downloader.py): replace basicConfig logging setup with setup_logger for better modularity fix(youtube_downloader.py): handle KeyboardInterrupt to allow graceful application termination

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (3)

reelfeeder/core/youtube_downloader.py (3)

269-276: ⚠️ Potential issue

Validate YOUTUBE_URL environment variable.

The URL is retrieved from environment but not validated. This could cause silent failures.

-        if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY")]):
+        if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY"), os.getenv("YOUTUBE_URL")]):
             raise OSError("Missing required environment variables")
         downloader = YoutubeDownloader(
-            url=os.getenv("YOUTUBE_URL"),
+            url=os.getenv("YOUTUBE_URL") or "https://youtube.com/example",  # Add default URL
             logname=logname,
             output_path=os.path.join(directory, "downloads"),
             channel_ids=[os.getenv("CHANNEL_ID")],
         )

47-71: ⚠️ Potential issue

Validate CHANNEL_ID environment variable.

The environment validation is incomplete. CHANNEL_ID is used later but not validated here.

-        if not all([os.getenv("GOOGLE_API_KEY")]):
+        if not all([os.getenv("GOOGLE_API_KEY"), os.getenv("CHANNEL_ID")]):
             raise OSError("Missing required environment variables")

159-161: 🛠️ Refactor suggestion

Replace print with logger in exception handling.

Using print for error logging is inconsistent with the rest of the codebase.

-                        print(f"{flag} generated an exception: {exc}")
+                        self.logger.error(f"{flag} generated an exception: {exc}")

🧹 Nitpick comments (2)

reelfeeder/core/youtube_downloader.py (2)
1-11: Add type hints for imports and consider organizing imports.

The imports look good, but consider adding type hints from typing module for better type safety:
+from typing import Dict, List, Optional, Tuple, Any
286-287: Consider adding cleanup in error handling.

The error handling could benefit from cleanup operations before re-raising the exception.
     except Exception as e:
-        logging.error("Failed to execute main: %s", str(e))
+        logging.error("Failed to execute main: %s", str(e))
+        # Cleanup operations if needed (e.g., close connections, remove temporary files)
         raise

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9d44a0b and 6d2e958.

📒 Files selected for processing (1)

reelfeeder/core/youtube_downloader.py (1 hunks)

🔇 Additional comments (2)

reelfeeder/core/youtube_downloader.py (2)

12-45: LGTM! Well-structured logging setup.

The setup_logger function is well-implemented with:

Proper separation of file and console handlers

Comprehensive logging format

Clear docstring with type hints

293-295: LGTM! Good handling of KeyboardInterrupt.

The implementation properly handles user interruption with appropriate logging and exit code.

coderabbitai · 2025-02-23T07:44:58Z

reelfeeder/core/youtube_downloader.py

+        except Exception as exec:
+            self.logger.error(f"Failed to get video information: {str(exec)}")
+            raise


⚠️ Potential issue

Fix variable name typo in exception handling.

The variable name exec is a typo and shadows the built-in exec function.

- except Exception as exec: - self.logger.error(f"Failed to get video information: {str(exec)}") + except Exception as exc: + self.logger.error(f"Failed to get video information: {str(exc)}")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

except Exception as exec:

self.logger.error(f"Failed to get video information: {str(exec)}")

raise

except Exception as exc:

self.logger.error(f"Failed to get video information: {str(exc)}")

raise

coderabbitai · 2025-02-23T07:44:58Z

reelfeeder/core/youtube_downloader.py

+        logging.info(
+            "Successfully retrieved video information in %.2f seconds", duration
+        )
+        logging.info("Video Info: %s", video_info)


🛠️ Refactor suggestion

Use class logger instead of root logger.

Using the root logger here is inconsistent with the rest of the code that uses the class logger.

- logging.info( + downloader.logger.info( "Successfully retrieved video information in %.2f seconds", duration ) - logging.info("Video Info: %s", video_info) + downloader.logger.info("Video Info: %s", video_info)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

logging.info(

"Successfully retrieved video information in %.2f seconds", duration

)

logging.info("Video Info: %s", video_info)

downloader.logger.info(

"Successfully retrieved video information in %.2f seconds", duration

)

downloader.logger.info("Video Info: %s", video_info)

…t variables feat(lint): add notification step to GitHub Actions lint workflow for success and failure refactor(youtube_downloader.py): improve exception handling by using consistent variable names for exceptions

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (2)

reelfeeder/core/youtube_downloader.py (2)
279-282: ⚠️ Potential issue

Fix incorrect logger usage.

The code incorrectly accesses the logging module through the downloader instance.
-        downloader.logging.info(
+        downloader.logger.info(
             "Successfully retrieved video information in %.2f seconds", duration
         )
-        downloader.logging.info("Video Info: %s", video_info)
+        downloader.logger.info("Video Info: %s", video_info)
67-68: ⚠️ Potential issue

Validate all required environment variables.

The environment validation is incomplete. CHANNEL_ID is used later but not validated here.
-        if not all([os.getenv("GOOGLE_API_KEY")]):
+        if not all([os.getenv("GOOGLE_API_KEY"), os.getenv("CHANNEL_ID")]):
             raise OSError("Missing required environment variables")

🧹 Nitpick comments (1)

.github/workflows/lint.yml (1)
37-37: Remove trailing space.

Remove the trailing space after env: for consistency.
-        env: 
+        env:
🧰 Tools

🪛 YAMLlint (1.35.1)

[error] 37-37: trailing spaces

(trailing-spaces)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6d2e958 and 2df8f25.

📒 Files selected for processing (3)

.env.example (1 hunks)
.github/workflows/lint.yml (1 hunks)
reelfeeder/core/youtube_downloader.py (1 hunks)

✅ Files skipped from review due to trivial changes (1)

.env.example

🧰 Additional context used

🪛 YAMLlint (1.35.1)

.github/workflows/lint.yml

[error] 37-37: trailing spaces

(trailing-spaces)

🔇 Additional comments (5)

reelfeeder/core/youtube_downloader.py (5)

1-45: LGTM! Well-structured logging setup.

The imports are appropriate, and the setup_logger function follows best practices by:

Setting up both file and console handlers

Using appropriate log formatters

Including comprehensive docstrings

72-88: LGTM! Well-implemented execution time measurement.

The static method is well-documented and correctly implements time measurement functionality.

89-122: LGTM! Robust command execution implementation.

The method includes:

URL validation

Proper error handling with specific exceptions

Comprehensive logging

166-222: LGTM! Well-structured channel information retrieval.

Both methods include:

Comprehensive documentation

Proper error handling

Appropriate logging

224-251: LGTM! Well-implemented utility functions.

Both functions include proper error handling and documentation.

coderabbitai · 2025-02-23T08:02:26Z

reelfeeder/core/youtube_downloader.py

+                    except Exception as exc:
+                        print(f"{flag} generated an exception: {exc}")


🛠️ Refactor suggestion

Use logger instead of print for consistency.

Replace print statement with logger for consistent error handling.

- print(f"{flag} generated an exception: {exc}") + self.logger.error(f"{flag} generated an exception: {exc}")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

except Exception as exc:

print(f"{flag} generated an exception: {exc}")

except Exception as exc:

self.logger.error(f"{flag} generated an exception: {exc}")

coderabbitai · 2025-02-23T08:02:27Z

.github/workflows/lint.yml

+            LINT_ERRORS=$(cat lint_results.txt | head -n 5)
+            curl -H "Title: Lint Failed" -d "Linting failed for PR #${{ github.event.pull_request.number }}. First few errors: $LINT_ERRORS" ntfy.sh/$NTFY_TOPIC


⚠️ Potential issue

Verify lint_results.txt file creation.

The script attempts to read from lint_results.txt, but there's no step that creates this file.

- name: Lint with Ruff - run: docker run focusfeed poetry run ruff check . + run: docker run focusfeed poetry run ruff check . > lint_results.txt || true

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

LINT_ERRORS=$(cat lint_results.txt | head -n 5)

curl -H "Title: Lint Failed" -d "Linting failed for PR #${{ github.event.pull_request.number }}. First few errors: $LINT_ERRORS" ntfy.sh/$NTFY_TOPIC

LINT_ERRORS=$(cat lint_results.txt | head -n 5)

curl -H "Title: Lint Failed" -d "Linting failed for PR #${{ github.event.pull_request.number }}. First few errors: $LINT_ERRORS" ntfy.sh/$NTFY_TOPIC

- name: Lint with Ruff

run: docker run focusfeed poetry run ruff check . > lint_results.txt || true

…on for completed reviews to enhance communication

… review is finished to streamline communication and updates on pull requests

coderabbitai

Actionable comments posted: 2

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2df8f25 and 0de6bfb.

📒 Files selected for processing (2)

.coderabbit.yaml (2 hunks)
.github/workflows/coderabbit-review-ntfy.yml (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

.coderabbit.yaml

🧰 Additional context used

🪛 actionlint (1.7.4)

.github/workflows/coderabbit-review-ntfy.yml

19-19: "github.event.issue.title" is potentially untrusted. avoid using it directly in inline scripts. instead, pass it through an environment variable. see https://docs.github.com/en/actions/security-guides/security-hardening-for-github-actions for more details

(expression)

🔇 Additional comments (1)

.github/workflows/coderabbit-review-ntfy.yml (1)

1-6: Workflow Trigger Configuration Approved
The workflow is set to trigger on issue comments of type "created" and is clearly named. This setup aligns well with the intended behavior.

coderabbitai · 2025-02-23T08:56:23Z

.github/workflows/coderabbit-review-ntfy.yml

+    if: >
+      github.event.issue.pull_request &&
+      github.event.comment.user.login == 'coderabbitai[bot]' &&
+      (contains(github.event.comment.body, '') || contains(github.event.comment.body, 'Actionable comments posted'))


🛠️ Refactor suggestion

⚠️ Potential issue

Revisit the Job Condition Logic
The condition (contains(github.event.comment.body, '') || contains(github.event.comment.body, 'Actionable comments posted')) uses contains(..., ''), which always evaluates to true. If the intention is to check for an empty comment, consider using an equality check (e.g., github.event.comment.body == '') instead. This will make the intent explicit and avoid redundancy.

coderabbitai · 2025-02-23T08:56:23Z

.github/workflows/coderabbit-review-ntfy.yml

+          env:
+            NTFY_TOPIC: ${{ secrets.NTFY_TOPIC }}
+          run: |
+            curl -X POST \
+              -H "Title: CodeRabbit Review Finished" \
+              -H "Priority: default" \
+              -d "CodeRabbit has finished reviewing Pull Request #${{ github.event.issue.number }}: ${{ github.event.issue.title }}" \
+              https://ntfy.sh/$NTFY_TOPIC


🛠️ Refactor suggestion

⚠️ Potential issue

Mitigate Injection Risks in the Inline Script
The inline script directly interpolates ${{ github.event.issue.title }} in the curl command. As noted by the static analysis hint, this value is potentially untrusted. It is recommended to pass the PR title through an environment variable to mitigate any injection risks. For example, add a PR_TITLE environment variable and reference it in the script.

Below is a suggested diff to implement this change:

- - name: Send ntfy notification - env: - NTFY_TOPIC: ${{ secrets.NTFY_TOPIC }} - run: | - curl -X POST \ - -H "Title: CodeRabbit Review Finished" \ - -H "Priority: default" \ - -d "CodeRabbit has finished reviewing Pull Request #${{ github.event.issue.number }}: ${{ github.event.issue.title }}" \ - https://ntfy.sh/$NTFY_TOPIC + - name: Send ntfy notification + env: + NTFY_TOPIC: ${{ secrets.NTFY_TOPIC }} + PR_TITLE: ${{ github.event.issue.title }} + run: | + curl -X POST \ + -H "Title: CodeRabbit Review Finished" \ + -H "Priority: default" \ + -d "CodeRabbit has finished reviewing Pull Request #${{ github.event.issue.number }}: $PR_TITLE" \ + https://ntfy.sh/$NTFY_TOPIC

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

env:

NTFY_TOPIC: ${{ secrets.NTFY_TOPIC }}

run: |

curl -X POST \

-H "Title: CodeRabbit Review Finished" \

-H "Priority: default" \

-d "CodeRabbit has finished reviewing Pull Request #${{ github.event.issue.number }}: ${{ github.event.issue.title }}" \

https://ntfy.sh/$NTFY_TOPIC

- name: Send ntfy notification

env:

NTFY_TOPIC: ${{ secrets.NTFY_TOPIC }}

PR_TITLE: ${{ github.event.issue.title }}

run: |

curl -X POST \

-H "Title: CodeRabbit Review Finished" \

-H "Priority: default" \

-d "CodeRabbit has finished reviewing Pull Request #${{ github.event.issue.number }}: $PR_TITLE" \

https://ntfy.sh/$NTFY_TOPIC

🧰 Tools

🪛 actionlint (1.7.4)

19-19: "github.event.issue.title" is potentially untrusted. avoid using it directly in inline scripts. instead, pass it through an environment variable. see https://docs.github.com/en/actions/security-guides/security-hardening-for-github-actions for more details

(expression)

…rity and change NTFY_TOPIC to focusfeed_alerts for better organization

coderabbitai

Actionable comments posted: 1

🔭 Outside diff range comments (2)

.coderabbit.yaml (2)
2-2: ⚠️ Potential issue

Critical: Unrecognized key "version"

The key version (line 2) is not defined in the CodeRabbit configuration schema. This unrecognized field will lead to a validation error. Please remove or refactor this key if it’s not needed or update the schema accordingly.
- version: 1
75-82: ⚠️ Potential issue

Critical: Unrecognized key "style"

The style block (lines 75–82) is not permitted by the current schema. This causes a validation error. Remove the style section or update the configuration schema to include it if it is required.
- style:
-   python:
-     formatter: "ruff"
-     max_line_length: 88
-     exclude:
-       - "tests/**"
-       - "docs/**"

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0de6bfb and 537e11c.

📒 Files selected for processing (1)

.coderabbit.yaml (2 hunks)

🧰 Additional context used

🪛 YAMLlint (1.35.1)

.coderabbit.yaml

[error] 90-90: no new line character at the end of file

(new-line-at-end-of-file)

[error] 90-90: trailing spaces

(trailing-spaces)

coderabbitai · 2025-02-26T16:08:18Z

.coderabbit.yaml

+post_review_actions:
+  - name: "Send ntfy notification"
+    run: |
+      curl -H "Title: Code Review Complete" \
+           -H "Priority: default" \
+           -d "CodeRabbit has completed reviewing PR #${PR_NUMBER}" \
+           ntfy.sh/focusfeed_alerts
+


⚠️ Potential issue

Critical: Unrecognized key "post_review_actions" and Formatting Issues

The post_review_actions section (lines 83–90) is not defined in the configuration schema and triggers a validation error. Additionally, YAMLlint has flagged trailing spaces at the end of line 90 and a missing newline at the end of the file. Please remove this section (or update the schema accordingly) and ensure proper file formatting (i.e. no trailing spaces and a newline at the file's end).

- post_review_actions: - - name: "Send ntfy notification" - run: | - curl -H "Title: Code Review Complete" \ - -H "Priority: default" \ - -d "CodeRabbit has completed reviewing PR #${PR_NUMBER}" \ - ntfy.sh/focusfeed_alerts

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

post_review_actions:

- name: "Send ntfy notification"

run: |

curl -H "Title: Code Review Complete" \

-H "Priority: default" \

-d "CodeRabbit has completed reviewing PR #${PR_NUMBER}" \

ntfy.sh/focusfeed_alerts

🧰 Tools

🪛 YAMLlint (1.35.1)

[error] 90-90: no new line character at the end of file

(new-line-at-end-of-file)

[error] 90-90: trailing spaces

(trailing-spaces)

style(youtube_downloader.py): apply consistent quotation style for st…

d6875dd

…rings to improve code readability and maintainability

coderabbitai bot reviewed Feb 21, 2025

View reviewed changes

chore(coderabbit.yaml): remove mention of SOLID principles from tone …

2e52538

…instructions to streamline guidance for code reviews

coderabbitai bot reviewed Feb 21, 2025

View reviewed changes

coderabbitai bot reviewed Feb 23, 2025

View reviewed changes

domabyte added 2 commits February 23, 2025 13:37

feat(coderabbit.yaml): add post review action to send ntfy notificati…

b0e4696

…on for completed reviews to enhance communication

feat(workflow): add GitHub Actions workflow to notify when CodeRabbit…

0de6bfb

… review is finished to streamline communication and updates on pull requests

coderabbitai bot reviewed Feb 23, 2025

View reviewed changes

domabyte self-assigned this Feb 26, 2025

chore(coderabbit.yaml): update notification title and message for cla…

537e11c

…rity and change NTFY_TOPIC to focusfeed_alerts for better organization

coderabbitai bot reviewed Feb 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(youtube_downloader.py): Implement YoutubeDownloader class functionality #21

feat(youtube_downloader.py): Implement YoutubeDownloader class functionality #21

domabyte commented Feb 21, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 21, 2025 •

edited

Loading

`.coderabbit.yaml` has a parsing error

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot Feb 21, 2025

coderabbitai bot Feb 21, 2025

coderabbitai bot Feb 21, 2025

coderabbitai bot Feb 21, 2025

coderabbitai bot Feb 21, 2025

coderabbitai bot Feb 21, 2025

coderabbitai bot Feb 21, 2025

coderabbitai bot left a comment

coderabbitai bot Feb 21, 2025

coderabbitai bot left a comment

coderabbitai bot Feb 23, 2025

coderabbitai bot Feb 23, 2025

coderabbitai bot Feb 23, 2025

coderabbitai bot left a comment

coderabbitai bot Feb 23, 2025

coderabbitai bot Feb 23, 2025

coderabbitai bot left a comment

coderabbitai bot Feb 23, 2025

coderabbitai bot Feb 23, 2025

coderabbitai bot left a comment

coderabbitai bot Feb 23, 2025

coderabbitai bot Feb 23, 2025

coderabbitai bot left a comment

coderabbitai bot Feb 26, 2025

-def logs_dir():
-    current_dir = os.getcwd()
-    parent = os.path.join(os.path.join(current_dir, os.pardir), os.pardir)
-    return os.path.abspath(parent)
+def logs_dir() -> str:
+    """
+    Resolve and create logs directory.
+    Returns:
+        Absolute path to logs directory
+    Raises:
+        OSError: If directory creation fails
+    """
+    try:
+        # Use a more reliable method to get the project root
+        project_root = os.path.dirname(os.path.dirname(os.path.dirname(__file__)))
+        logs_path = os.path.join(project_root, "logs")
+        # Create logs directory if it doesn't exist
+        os.makedirs(logs_path, exist_ok=True)
+        return logs_path
+    except OSError as e:
+        raise OSError(f"Failed to create logs directory: {str(e)}")

-    def __run_yt_dlp_command(self, flag):
-        command = ["yt-dlp", flag, self.url]
-        result = subprocess.run(command, capture_output=True, text=True)
-        return flag, result.stdout.strip()
+    def __run_yt_dlp_command(self, flag: str, timeout: int = 30) -> tuple[str, str]:
+        """
+        Execute yt-dlp command with the given flag.
+        Args:
+            flag: yt-dlp command flag
+            timeout: Command timeout in seconds
+        Returns:
+            Tuple of (flag, command output)
+        Raises:
+            ValueError: If command execution fails
+            TimeoutError: If command execution times out
+        """
+        if not isinstance(self.url, str) or not self.url.startswith(('http://', 'https://')):
+            raise ValueError(f"Invalid URL: {self.url}")
+        command = ["yt-dlp", flag, self.url]
+        try:
+            result = subprocess.run(
+                command,
+                capture_output=True,
+                text=True,
+                timeout=timeout,
+                check=True
+            )
+            return flag, result.stdout.strip()
+        except subprocess.TimeoutExpired:
+            self.logger.error(f"Command timed out after {timeout} seconds: {command}")
+            raise TimeoutError(f"Command timed out: {command}")
+        except subprocess.CalledProcessError as e:
+            self.logger.error(f"Command failed with exit code {e.returncode}: {command}")
+            self.logger.error(f"Error output: {e.stderr}")
+            raise ValueError(f"Command failed: {e.stderr}")

-def main():
-    directory = logs_dir()
-    logname = os.path.join(directory, "logs", "youtube_downloader.log")
-    downloader = YoutubeDownloader(
-        "https://youtube.com/shorts/GT2m7AwLg9o?si=iuViBBk4DXG92BNR", logname, "output"
-    )
-    downloader.channel_info()
-if __name__ == "__main__":
-    main()
+def main() -> None:
+    """
+    Main entry point for the YouTube downloader.
+    Environment variables required:
+    - CHANNEL_ID: YouTube channel ID
+    - GOOGLE_API_KEY: Google API key for YouTube Data API
+    """
+    try:
+        # Load environment variables
+        load_dotenv()
+        directory = logs_dir()
+        logname = os.path.join(directory, "logs", "youtube_downloader.log")
+        # Configure logging for the main function
+        logging.basicConfig(
+            level=logging.INFO,
+            format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'
+        )
+        logger = logging.getLogger(__name__)
+        # Validate environment variables
+        if not all([os.getenv("CHANNEL_ID"), os.getenv("GOOGLE_API_KEY")]):
+            raise EnvironmentError("Missing required environment variables")
+        downloader = YoutubeDownloader(
+            url=os.getenv("YOUTUBE_URL", "https://youtube.com/shorts/GT2m7AwLg9o"),
+            logname=logname,
+            output_path=os.path.join(directory, "downloads"),
+            channel_ids=[os.getenv("CHANNEL_ID")]
+        )
+        # Get channel information
+        video_info, duration = downloader.channel_info()
+        logger.info("Successfully retrieved video information in %.2f seconds", duration)
+    except Exception as e:
+        logging.error("Failed to execute main: %s", str(e))
+        raise
+if __name__ == "__main__":
+    try:
+        main()
+    except Exception as e:
+        logging.error("Application failed: %s", str(e))
+        sys.exit(1)

		if not all([os.getenv("GOOGLE_API_KEY")]):
		raise OSError("Missing required environment variables")

		except Exception as exc:
		print(f"{flag} generated an exception: {exc}")

		LINT_ERRORS=$(cat lint_results.txt \| head -n 5)
		curl -H "Title: Lint Failed" -d "Linting failed for PR #${{ github.event.pull_request.number }}. First few errors: $LINT_ERRORS" ntfy.sh/$NTFY_TOPIC

feat(youtube_downloader.py): Implement YoutubeDownloader class functionality #21

Are you sure you want to change the base?

feat(youtube_downloader.py): Implement YoutubeDownloader class functionality #21

Conversation

domabyte commented Feb 21, 2025 • edited by coderabbitai bot Loading

Summary by CodeRabbit

coderabbitai bot commented Feb 21, 2025 • edited Loading

.coderabbit.yaml has a parsing error

Walkthrough

Changes

Sequence Diagram(s)

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

Footnotes

coderabbitai bot Feb 21, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot Feb 23, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Feb 26, 2025

Choose a reason for hiding this comment

domabyte commented Feb 21, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 21, 2025 •

edited

Loading

`.coderabbit.yaml` has a parsing error