网页爬网和索引超级提示丨选择编程语言、要爬网的网站、搜索词和项目复杂性

名片

箐年人家小学lv1

天天向上

需要：

已完成：

进行中：

联系方式

私信或下方留言

分享热度

置顶到期：
帖子有效期：
预计价格：暂无
状态：进行中已完成

中文Prompt：

作为提示生成器，我会给你一种编程语言，你会提供一个详细的提示供我选择。您将创建提示以对特定搜索词的网站进行爬网和索引。

请遵守以下结构和格式，并遵循以下准则：

首先等待我的目标网站、搜索词、首选编程语言和索引方法策略。
将每个提示写在一行中，而不使用 return。
用两个新行分隔不同的提示。
在 [5] 中最多选择 3 个函数。
在 [5] 中选择最多 4 种网站抓取策略。
选择基于 [C] 的索引方法策略，例如“基本：存储和检索数据的简单方法，例如将数据存储在数据库中或以最少的配置将其保存到文件中.”，“中间：存储和检索数据的更高级方式，通常需要更多配置和设置，例如使用 Elasticsearch 索引数据或使用第三方索引服务。 “高级：存储和检索数据的最复杂和最复杂的方式，可能涉及使用机器学习算法或分布式计算框架并行处理大量数据。
用英语写下您的提示。

结构： [1] = 首选编程语言。等待我的输入。 [2] = [1] 中的库，用于抓取和索引网站。 [3] = 特定于抓取和索引网站的库 [2] 的功能。用逗号分隔每个函数。 [4] = 特定于库 [2] 的分步爬网策略，在 [1] 编程语言中，用逗号分隔。 [A] = 目标网站。等待我的输入。 [B] = 搜索词。等待我的输入。 [C] = 适合作为索引方法参数的项目类型。您需要询问项目是“基本”、“中级”还是“高级”。等待我的输入。

格式：遵循以下提示结构：“”创建一个关于构建 [1] 脚本的教程，该脚本利用 [2] 对网站“[A]”进行爬网和索引。逐步解释该过程，并提供代码示例，重点介绍关键组件的使用，例如，网站抓取策略：“[4]”和索引方法：“[C]”。包括一些实际示例，展示如何根据各种需求定制不同的变量，例如“目标网站：”和“要抓取和索引的搜索词：”。此外，讨论潜在的挑战和最佳实践，以优化脚本的性能。[3][A][B]

您的任务：

询问首选编程语言（[1]），我的目标网站（[A]），搜索词（[B]）和项目复杂性（[C]）。
在获得目标网站、搜索词、首选编程语言和项目复杂性之前，不要继续。
然后为每个目标网站 [A] 创建 4 个不同的提示，根据项目类型在库、功能、抓取策略和索引方法方面有所不同。

示例输入：“javascript，www.wikipedia.com，人工智能，中级”。示例提示：“创建一个关于构建 JavaScript 脚本的教程，该脚本利用 Puppeteer 抓取和索引网站”www.wikipedia.com“以实现”人工智能”。逐步解释该过程，并提供代码示例，重点介绍关键组件的使用，例如，网站抓取策略：“从根URL开始，使用深度优先搜索导航到所有页面，使用Puppeteer API提取数据，使用特定于站点的搜索专注于相关页面或类别，重复直到提取所有数据。 “使用 Elasticsearch 索引数据”。包括一些实际示例，展示如何根据各种需求定制不同的变量，例如“目标网站：”和“要抓取和索引的搜索词：”。此外，讨论潜在的挑战和最佳实践，以优化脚本的性能。puppeteer.launch(), page.goto(url), page.waitForSelector(selector), page.evaluate(() => { ... }),

page.screenshot( 'example.png'), page.content(), page.click(selector), page.goBack(),

page.keyboard.type(text)www.wikipedia.comartificial intelligence

英文Prompt：

Act as prompt generator, I will give you a programming language, and you will provide a detailed prompt for me to choose. You will create prompts to crawl and index websites for a specific search term.

Please adhere to the structure and formatting below, and follow these guidelines:

wait for my target website, search term, preferred programming language, and indexing method strategy first.
write each prompt in one line without using return.
separate different prompts with two new lines.
choose maximum of 5 functions in [3].
choose maximum of 5 website crawling strategy in [4].
choose indexing method strategy based on [C], e.g. “basic: Simple way of storing and retrieving data, such as storing data in a database or saving it to a file with minimal configuration.”, “intermediate: More advanced way of storing and retrieving data, often requiring more configuration and setup, such as indexing data with Elasticsearch or using a third-party indexing service.”, “advanced: The most complex and sophisticated way of storing and retrieving data, which may involve using machine learning algorithms or distributed computing frameworks to process large amounts of data in parallel.”
write your prompts in English.

Structure:
[1] = preferred programming language. wait for my input.
[2] = library in [1] for crawl and index websites.
[3] = function specific for library [2] for crawl and index websites. separate each function with comma.
[4] = step by step crawling strategy specific for library [2] in [1] programming language, separated by comma.
[A] = target website. wait for my input.
[B] = search term. wait for my input.
[C] = type of project that will be suited as parameter for indexing method. you need to ask whether the project is either “basic”, “intermediate”, or “advanced”. wait for my input.

Formatting:
Follow this prompt structure: ‘”Create a tutorial on building a [1] script that leverages [2] to crawl and index website ‘[A]’ for ‘[B]’. Explain the process step-by-step, and provide code samples highlighting the use of key components, such as [3], website crawling strategy: ‘[4]’, and indexing method : ‘[C]’. Include practical examples showcasing how different variables, like ‘target website: [A]‘ and ‘search term to be crawl and index: [B]‘, can be tailored for various requirements. Additionally, discuss potential challenges and best practices to optimize the performance of the script.”‘

Your task:

ask for preferred programming language ([1]), my target website ([A]), search term ([B]), and the project complexity ([C]).
do not proceed before you got target website, search term, preferred programming language, and project complexity.
then create 4 distinct prompts for each target website [A], varying in library, functions, crawling strategy, and indexing method regarding to the type of project.

Example input: ‘javascript, www.wikipedia.com, artificial intelligence, intermediate’.
Example Prompt: ‘”Create a tutorial on building a javascript script that leverages Puppeteer to crawl and index website ‘www.wikipedia.com’ for ‘artificial intelligence’. Explain the process step-by-step, and provide code samples highlighting the use of key components, such as puppeteer.launch(), page.goto(url), page.waitForSelector(selector), page.evaluate(() => { ... }),

page.screenshot( 'example.png'), page.content(), page.click(selector), page.goBack(), page.keyboard.type(text), website crawling strategy: ‘Start with the root URL, use depth-first search to navigate to all pages, extract data using Puppeteer API, use site-specific search to focus on relevant pages or categories, repeat until all data is extracted.’, and indexing method : ‘index data with Elasticsearch’. Include practical examples showcasing how different variables, like ‘target website: www.wikipedia.com‘ and ‘search term to be crawl and index: artificial intelligence‘, can be tailored for various requirements. Additionally, discuss potential challenges and best practices to optimize the performance of the script.”‘

案例展示：

网页爬网和索引超级提示丨选择编程语言、要爬网的网站、搜索词和项目复杂性

作为提示生成器，我会给你一种编程语言，你会提供一个详细的提示供我选择。您将创建提示以对特定搜索词的网站进行爬网和索引。

请遵守以下结构和格式，并遵循以下准则：

首先等待我的目标网站、搜索词、首选编程语言和索引方法策略。

将每个提示写在一行中，而不使用 return。

用两个新行分隔不同的提示。

在 [5] 中最多选择 3 个函数。

在 [5] 中选择最多 4 种网站抓取策略。

选择基于 [C] 的索引方法策略，例如“基本：存储和检索数据的简单方法，例如将数据存储在数据库中或以最少的配置将其保存到文件中.”，“中间：存储和检索数据的更高级方式，通常需要更多配置和设置，例如使用 Elasticsearch 索引数据或使用第三方索引服务。 “高级：存储和检索数据的最复杂和最复杂的方式，可能涉及使用机器学习算法或分布式计算框架并行处理大量数据。

用英语写下您的提示。

您的任务：

询问首选编程语言（[1]），我的目标网站（[A]），搜索词（[B]）和项目复杂性（[C]）。

在获得目标网站、搜索词、首选编程语言和项目复杂性之前，不要继续。

然后为每个目标网站 [A] 创建 4 个不同的提示，根据项目类型在库、功能、抓取策略和索引方法方面有所不同。

page.screenshot( 'example.png'), page.content(), page.click(selector), page.goBack(),

网页爬网和索引超级提示丨选择编程语言、要爬网的网站、搜索词和项目复杂性

确定！让我们开始吧。请向我提供以下信息：

首选编程语言（[1]）

目标网站（[A]）

搜索词（[B]）

项目复杂度类型（基本、中级、高级）（[C]

{{userData.name}}已认证

分享 热度

案例展示：

分享热度