17 Commits

Author SHA1 Message Date
81ea656332 Add unit price and unit parsing for items
This update enhances the `Item` structure to include `UnitPrice` and `Unit` fields. Additional logic is implemented to extract and parse unit pricing details from the HTML, improving data accuracy and granularity.
2025-01-21 19:42:25 -05:00
6de455b1bd Add price extraction and validate URL structure in parsers
Added price field to Item struct in AisleGopher and implemented logic to extract price data. Updated Wegmans parser to validate URL structure by ensuring the second segment is "product". These changes improve data accuracy and error handling.
2025-01-20 13:00:59 -05:00
f37e60dddc Add Wegmans module to fetch item details and prices
Introduce functionality to retrieve item details, including name and price, from Wegmans using a browser-based scraper. This includes a CLI tool to execute searches and robust error handling for URL validation and browser interactions.
2025-01-20 12:28:29 -05:00
654976de82 Add AisleGopher integration for data extraction
Introduced a new package and command for extracting data from aislegopher.com, including URL parsing and item retrieval. Updated dependencies in go.mod to support the new functionality. Additionally, refined import structure in the DuckDuckGo integration.
2025-01-20 02:16:32 -05:00
e8de488d2b Update CSS selector for extracting titles in DuckDuckGo parser
Replaced the overly complex CSS selector with a simplified "h2" selector for extracting titles. This change improves maintainability and ensures accurate title extraction from the updated DOM structure.
2025-01-16 21:37:38 -05:00
67a3552747 Add DuckDuckGo integration for search functionality
Implemented a DuckDuckGo search module with configurable SafeSearch and regional settings. Added a CLI tool to perform searches via DuckDuckGo using browser automation, supporting flags for customization.
2025-01-16 20:45:37 -05:00
eec94ec708 Reorder imports in main.go for better organization.
Moved the local package import to align with standard Go import grouping conventions. This improves code readability and maintains a consistent structure.
2025-01-16 20:45:23 -05:00
691ae400d1 Add Google search integration with CLI support
Introduce a Google search integration, including a Go package for performing searches with configurable parameters (e.g., language, region) and a CLI tool for executing search queries. Refactor archive CLI import ordering for consistency.
2025-01-16 16:56:05 -05:00
2ca2bb0742 close playwright instance on browser close 2025-01-01 22:48:12 -05:00
8ad5a34f2d Added global screenshot shortcut 2024-12-26 22:20:07 -05:00
36707dec17 added useragents to go-extractor 2024-12-24 12:15:48 -05:00
276b7ad38d removed some unused files 2024-12-23 18:24:14 -05:00
567a9f9212 added archive, megamillions, and powerball site logic 2024-12-23 03:18:50 -05:00
5e924eb3f9 changed browser api to return pages that can be acted on, not strictly contents 2024-12-17 23:16:13 -05:00
23334991b1 make screenshot update cookies 2024-12-14 00:50:25 -05:00
a51f8200ea added screenshots 2024-12-09 13:51:00 -05:00
cbd6682257 initial commit 2024-12-07 03:53:46 -05:00