DP2 Beginner’s Guide
Welcome to DP2 Beginner's Guide!
DP2 is a web data collection tool that helps you efficiently extract required data from websites.
This guide will teach you the complete process of configuring and using DP2 for web data collection.
1. Introduction
2. System Architecture
3. User Interface
4. Steps and Extraction Guides
- Step Configuration ① - Category Step
- Jexter Configuration - Extract Category Information
- Step Configuration ② - Total Page Step
- Jexter Configuration - Get Total Pages
- Step Configuration ③ - List Step
- Jexter Configuration - Extract List Page Information
- Step Configuration ④ - Detail Step
- Extracting Drug Information in
detail_step - Step Configuration ⑤ - Attachment Step
5. API Configuration
6. Post Configuration
7. Data Management
8. Monitoring and Logging
9. Real-world Applications
10. Simplifying Data Extraction with Jexter
- Simplifying Data Extraction with Jexter I
- Simplifying Data Extraction with Jexter:
Parent- Common Forms of XPath Expressions Include:
- Simplifying XPath Using
ParentConfiguration - Example 1: Nested Structure of Medicine Information
- Example 2: Columnar Display of Medicine Components and Effects
- Example 3: Medicine Reviews and User Ratings Combination
- Example 4: Medicine Instructions and Warning Information
- Example 5: Multi-Functional Medicine Page
- Conclusion
- Simplifying Data Extraction with Jexter III
- Total Row: Defining the Total Number of Drugs to Extract
- Parent: Streamlining the XPath Configuration
- Elements: Specifying Drug Information to Extract
- Combining the Three Aspects for Efficient Drug Data Extraction
prefix,postfix, anddefault- Execution Order of Extraction with the Jexter in DP2
- Comprehensive Example
- Conclusion
11. Querying Techniques
12. Tips and Troubleshooting