|
![]() |
![]() |
Software That Makes Your Browsing Time Easier, A Lot More Productive, And More Efficient! |
|
Downloads |
|
Register |
|
Screenshots |
|
|
|
FAQ |
|
Affiliates |
|
Contacts |
|
||
![]() |
||||||||||||||||
![]() |
||||||||||||||||
![]() |
![]() |
![]() |
||||||||||||||
| Online Manual | |||||||||||||||||||||||||||||||||||
How to Use FiltersYou can use filters to limit the paths and types of files the spider will explore. For instance, you can tell the spider not to download graphics embedded on webpages, to download only from a particular website, not to download from a certain directory on a website, or to exclude particular websites. The scope of filters can be project-wide or be limited to a single task. Project-wide filters are set in the Project Properties dialog box. To launch Project Properties window, click on the Properties button on the Project Toolbar. Task filters are set in the Task Properties dialog box. To launch Task Properties window, select a task and click on the Properties button on the Task Toolbar or click on the New Task button to create and set properties of a new task. Task filters are set in the Task Properties dialog box. Task filters are combined with current project filters using AND logical operation every time the program needs to decide whether to retrieve a URL or not. Thus, project filters affect every task in a project. There are four types of filters:
You can also filter files by size. You can specify that the program should retrieve pages or files that are no more than a certain maximum size and/or no less than a certain minimum size. Type in the maximum size or a range of sizes in the Max Size or Min Size - Max Size text box for each type of filter. This is an optional parameter. If you do not provide the sizes, the program will download files regardless of their sizes. Pattern Matching Filters:This is a logical expression that is evaluated for each URL of a task to determine whether the program should retrieve the URL. Logical expression can evaluate either to true or to false. If a filter applied to a URL evaluates to true, the URL is accepted and put into the queue to be downloaded. If a filter evaluates to false, a URL is not accepted and not retrieved. The operands of the logical expression are URLs or URL patterns. The operators are AND, OR, and NOT (operators can be written in lowercase or uppercase). Expressions can include subexpressions enclosed in parenthesis. URL patterns can include two kinds of wildcard characters: an asterisk (*) and a question mark (?).
Note that a filter is not a list of URLs that must be included or excluded. This is a logical expression that is evaluated for each URL. Each URL must match the whole filter to be accepted. This is why you must use OR to join different URLs. If you join two URL patterns with AND, it will mean that, to be accepted, a URL must match both patterns. Joining two fully qualified URLs (not containing wildcard characters) with AND will have no sense at all. This is a logical expression that is evaluated for each URL to decide whether to download it or not. You can check if a URL is accepted or rejected by a task filter on the Test Filters page. To view the Test Filters page, click on a task and select Test Filters from the Task Menu. Type in a URL, select a type of filter, and press the Test button. The page will reload showing you the result of the test (accepted or not) as well as the detailed explanation of why the URL was accepted or rejected by the filter. Every pattern in both the task filter and the project filter will be painted green or red depending on whether a pattern accepts or rejects the URL. Examples for link filters:
Examples for image filters:
|
|
|||
| © Copyright 2002-2005 WebArchiver.net Software, Inc. |