Из-за периодической блокировки нашего сайта РКН сервисами, просим воспользоваться резервным адресом:
Загрузить через dTub.ru Загрузить через ClipSaver.ruУ нас вы можете посмотреть бесплатно The ONLY ACTUALLY ZERO CODE n8n LLM SCRAPING SYSTEM или скачать в максимальном доступном качестве, которое было загружено на ютуб. Для скачивания выберите вариант из формы ниже:
Роботам не доступно скачивание файлов. Если вы считаете что это ошибочное сообщение - попробуйте зайти на сайт через браузер google chrome или mozilla firefox. Если сообщение не исчезает - напишите о проблеме в обратную связь. Спасибо.
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса savevideohd.ru
Try our SEO tool: https://harborseo.ai/ Work with us: https://calendly.com/incomestreamsurf... Welcome to this in-depth tutorial where I show you how to build a custom web scraping system using no-code tools like n8n and Gina! In this video, I walk you through the entire process of scraping any webpage—without relying on expensive APIs like Perplexity, Serp API, or Crawford AI. Learn how to set up your workflow from scratch, generate targeted search terms, scrape Google results and sitemaps, extract relevant data using an information extractor node (powered by OpenAI o3 Mini), and even summarize the results into useful JSON outputs. How to create a no-code scraping workflow with n8n and jina. Setting up keyword inputs to generate multiple search terms. Scraping the top 5 Google results for any query and converting them into LLM-readable text. Extracting data from complex sitemaps and handling large datasets using chunking techniques. Configuring HTTP request nodes, importing cURL commands, and managing content formats (cleaned markdown vs. plain text). Customizing your extraction schema to capture statistics, sentiment, source URLs, and more. Tips and best practices for improving your prompt and extraction accuracy. Chapters: 00:00 – Introduction & Overview Welcome and brief overview of the no-code scraping project. 01:00 – Why Build a No-Code Scraping System? Discussion on avoiding expensive tools (Perplexity, Ser API, Crawford AI) and the benefits of a custom solution. 03:00 – Introducing Jina: The No-Code Scraping Tool Overview of Jina and why it’s perfect for this project. 04:00 – Workflow Setup: Keyword Input & Generating Search Terms How a new keyword is added to a Google Sheet and used to generate five search terms. 05:00 – Scraping Google Results with Jina Requests Demonstration of how each generated search term is used to scrape the top 5 Google results. 06:00 – Using the Information Extractor Node (o3 Mini) Explanation of extracting and summarizing data from the scraped results using JSON mode. 07:00 – Types of Jina Scrapes: S. Jina vs. R. Jina Differentiating between scraping search results and scraping specific URLs (sitemap pages). 08:00 – Scraping a Sitemap Index: Process & Challenges How to handle a sitemap index that contains multiple sitemap URLs and why chunking might be needed. 09:00 – Extracting Data from Large Sitemaps Techniques for splitting and processing large sitemap outputs for useful URL extraction. 10:00 – Feeding Sitemap Data to the Second Jina Scraper Using the extracted sitemap URLs to scrape individual pages for images, context, and additional data. 11:00 – Configuration Tips: API Keys, Content Format, & Exclude Selectors Best practices for setting up your nodes—leaving defaults, adding headers/footers, and saving tokens. 12:00 – Setting Up HTTP Request Nodes & Importing cURL How to import cURL commands into n8n and wire up your HTTP request nodes. 13:00 – Wiring Up Variable Jina Scraping URLs & Testing Outputs Demonstrating the connection of dynamic scraping URLs and testing the sitemap extraction process. 14:00 – Switching Output Formats: Cleaned Markdown vs. Plain Text How to adjust the output for a lighter LLM context window. 15:00 – Extracting Relevant URLs Using the Information Extractor Node Setting up the node to filter and list URLs relevant to your keyword. 16:00 – Integrating OpenAI o3 Mini for JSON Output Processing Using OpenAI’s o3 Mini to process and format the extracted data as JSON. 17:00 – Setting Up the Second Jina Scraper for Detailed Extraction How to scrape each individual page for images, context, and business details. 18:00 – Customizing Your JSON Schema for Data Extraction Defining your schema to include statistics, sentiment, and source URLs for each scraped page. 19:00 – Reviewing and Summarizing Scraped Data A look at the summarized output and how it consolidates key information. 20:00 – Finalizing the Scraping Workflow & Article Generation How to integrate all nodes to write a complete article from the scraped data. 21:00 – Recap: Building Your Own No-Code Web Scraping System A summary of the entire process and its benefits for targeted data extraction. 22:00 – Troubleshooting & Best Practices Tips for refining your workflow, improving prompt accuracy, and ensuring reliable data extraction. 23:00 – Advanced Customizations & Prompt Improvements Ideas for further tweaking your system to scrape images, sentiment, pricing, and more. 24:00 – Optimizing for Your Specific Niche How to adapt the system for any niche by changing schemas and extraction parameters. 25:00 – Final System Overview & Next Steps A complete recap of your custom scraping setup and potential future enhancements. 26:00 – Q&A, Final Testing, & Adjustments Reviewing the final outputs and discussing common issues and fixes.