Added data extraction script with CSV and Excel export#25
Open
Emmanuel10701 wants to merge 1 commit into4GeeksAcademy:mainfrom
Open
Added data extraction script with CSV and Excel export#25Emmanuel10701 wants to merge 1 commit into4GeeksAcademy:mainfrom
Emmanuel10701 wants to merge 1 commit into4GeeksAcademy:mainfrom
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Issue: Improve Data Extraction & Handling
Description
Currently, the data extraction script fetches revenue data from a webpage and stores it in CSV and Excel files. However, some improvements are needed:
✅ Enhancements Required:
🔍 Better Error Handling: Handle missing or inconsistent data to prevent script failures.
📊 Data Formatting: Convert revenue values into a standard numerical format (e.g., remove $ and B to store as billions).
🚀 Optimize Performance: Improve parsing logic for efficiency.
🛠 Add Logging: Provide meaningful error messages and logs for debugging.
🎨 Code Cleanup: Refactor code for readability and maintainability.
💡 Suggestions:
Use try-except to handle errors gracefully.
Implement regular expressions (re) for better data parsing.
Add logging (logging module) to track errors.
✨ Proposed Solution
A PR will be created to address these improvements and ensure the extracted data is accurate, formatted correctly, and efficiently stored.
🛠 Steps to Reproduce
1️⃣ Run the current script.
2️⃣ Check for any missing values or incorrectly formatted data.
3️⃣ Observe any performance issues or parsing errors.
🎯 Expected Outcome
✔ The script should extract clean, structured, and correctly formatted data without errors.
✔ The revenue should be stored as numerical values for analysis.
✔ The script should handle missing values smoothly.
📌 Priority: High 🔥
🎯 Goal: Improve reliability & efficiency of the script.
Would you like me to add a Pull Request (PR) template as well? 🚀😃