fix(scan): first endpoint detection for IP #251

psyray · 2025-01-17T20:44:11Z

Fix first endpoint detection by IPs

Scan by hostname
Scan by IP

Summary by Sourcery

Improve handling of IP address scans by limiting the available tasks and prioritizing the creation of endpoints for the IP itself, even when rDNS is available.

New Features:

Display the number of subdomains associated with each IP and port in their respective badges.

Bug Fixes:

Fix an issue where IP addresses were not being properly scanned.

Enhancements:

Limit the tasks performed during an IP address scan to only essential ones.
Prioritize creating an endpoint for the IP address itself during scans.
Improve port information gathering during scans by including service details and uncommon web ports.
Update port service information with nmap service detection results.
Standardize port information by ensuring only one entry exists per port number.

- Added support for explicitly setting HTTP status during endpoint creation - Improved error handling in endpoint creation from Nmap data - Added fallback mechanisms for creating endpoints when initial crawling fails - Enhanced logging for endpoint and subdomain metadata creation

- Added a safer check for domain existence in nmap data - Added a check to ensure the scheme is present before creating a fallback endpoint - Prevents potential errors when creating endpoints from incomplete nmap data

This change enhances the scanner's ability to handle IP addresses as scan targets. Specifically, it limits the tasks performed when scanning an IP, ensures IP addresses are correctly handled in various parts of the code. It also adds the HTTP status code to the endpoint data.

sourcery-ai · 2025-01-17T20:44:15Z

Reviewer's Guide by Sourcery

This pull request enhances the handling of IP address scans by limiting the tasks and correctly creating the first endpoint.

Sequence diagram for IP scan endpoint detection

sequenceDiagram
    participant Scanner
    participant TaskManager
    participant EndpointManager
    participant Database

    Scanner->>TaskManager: initiate_scan(domain)
    TaskManager->>TaskManager: Check if IP scan
    alt is IP scan
        TaskManager->>TaskManager: Filter allowed tasks
        Note over TaskManager: Only allow: port_scan, fetch_url,<br/>dir_file_fuzz, vulnerability_scan,<br/>screenshot, waf_detection
    end

    TaskManager->>EndpointManager: create_first_endpoint_from_nmap_data()
    alt is IP scan
        EndpointManager->>EndpointManager: Create IP endpoint
        EndpointManager->>Database: Save IP endpoint
        opt rDNS hostname exists
            EndpointManager->>EndpointManager: Create rDNS endpoint
            EndpointManager->>Database: Save rDNS endpoint
        end
    else is domain scan
        EndpointManager->>EndpointManager: Create domain endpoint
        EndpointManager->>Database: Save domain endpoint
    end

Class diagram for endpoint and domain relationships

classDiagram
    class EndPoint {
        +scan_history: ScanHistory
        +target_domain: Domain
        +http_url: String
        +is_default: Boolean
        +discovered_date: DateTime
        +http_status: Integer
    }

    class Domain {
        +name: String
        +last_scan_date: DateTime
    }

    class Subdomain {
        +scan_history: ScanHistory
        +target_domain: Domain
        +name: String
        +discovered_date: DateTime
    }

    Domain "1" -- "*" EndPoint
    Domain "1" -- "*" Subdomain
    Subdomain "1" -- "*" EndPoint

    note for EndPoint "Added http_status field"
    note for Domain "Enhanced IP validation"

Flow diagram for endpoint creation logic

flowchart TD
    A[Start Endpoint Creation] --> B{Is IP Scan?}
    B -->|Yes| C[Filter Allowed Tasks]
    B -->|No| D[Regular Domain Tasks]

    C --> E{Process Hosts Data}
    E -->|IP Address| F[Create IP Endpoint]
    E -->|rDNS Exists| G[Create rDNS Endpoint]

    D --> H[Create Domain Endpoint]
    H --> I[Try Crawling]
    I -->|Success| J[Save with Crawl Data]
    I -->|Failure| K[Save without Crawling]

    F --> L[Save Metadata]
    G --> L
    J --> L
    K --> L
    L --> M[End]

File-Level Changes

Change	Details	Files
Limit tasks for IP scans	Added a check to identify if the scan target is an IP address. Filtered the list of tasks to be executed for IP scans, allowing only 'port_scan', 'fetch_url', 'dir_file_fuzz', 'vulnerability_scan', 'screenshot', and 'waf_detection'.	`web/reNgine/tasks.py`
Improve endpoint creation for IP scans	Modified the save_endpoint function to handle IP scans correctly, skipping domain validation for IP addresses. Added http_status parameter to save_endpoint function. Modified create_first_endpoint_from_nmap_data to create endpoints for both the IP and rDNS hostnames when scanning an IP address. Added logic to create an endpoint for the IP itself if it's not present in the Nmap data, using rDNS data if available. Modified save_subdomain to validate both domain and IP formats. Modified save_subdomain to not validate subdomain against domain for IP scans.	`web/reNgine/tasks.py`

Assessment against linked issues

Issue	Objective	Addressed	Explanation
#250	First endpoint should be correctly detected for hostname or IP	✅
#250	Fix scan by IP not working well	✅

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time. You can also use
this command to specify where the summary should be inserted.

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

- This change improves the accuracy and detail of port scanning and service detection during vulnerability scans. - It expands the range of ports scanned, leverages nmap's service detection capabilities, and stores more detailed service information in the database. - Additionally, it refactors the handling of scan results to improve efficiency and code clarity. - It also adds functionality to retrieve ports associated with a subdomain.

This change improves the handling of port information by storing service details specific to each IP address. Previously, port service information was global, which was inaccurate for cases where different IPs on the same domain expose different services on the same port. Now, each IP can have its own service information for a given port. This also affects how port information is displayed in the subdomain view.

This update includes database migrations, API and serializer updates, and adjustments to related functionalities.

web/reNgine/tasks.py

This commit refactors the port handling logic and its visualization within the application. The changes improve the backend data model for ports, optimize how port information is fetched and displayed, and enhance the user interface for viewing port details. Specifically, the way ports are associated with IP addresses is improved, and the frontend now dynamically counts and displays the number of IPs associated with each unique port. Additionally, the frontend code for handling port display is streamlined and made more robust.

web/static/custom/port_display.js

This change enhances the scan initiation process by correctly handling IP-based scans and improving the parsing of Nmap results. It also adds support for filtering URLs during the scan and fixes a bug where scans would fail if the target host didn't have common web ports open. The update ensures that valid endpoints are created even when dealing with uncommon ports and addresses edge cases in Nmap's output parsing to prevent failures due to unexpected data formats.

This change improves the discovery of web endpoints by expanding the range of ports scanned and refining the endpoint creation logic. The update includes a broader list of common and uncommon HTTP ports, handles various schemes (HTTP and HTTPS) more effectively, and streamlines endpoint creation for both IP addresses and domain names. It also removes an unused URL path filter.

This change simplifies the data model by removing the PortInfo model and establishing a direct many-to-many relationship between Port and IpAddress. Service information, previously stored in PortInfo, is now directly associated with the Port model. This simplification reduces database complexity and improves query efficiency. Additionally, the nmap task signature and command building have been updated to use args instead of cmd for better clarity and flexibility. The report generation logic has also been updated to reflect these changes. Finally, a check for current_subdomain has been added to prevent errors when no subdomain is found.

web/startScan/templates/startScan/history.html

This change fixes WAF detection by switching to JSON output from wafw00f and fixing the display of WAF information in the UI. The update also fixes a bug where multiple WAFs were not being correctly handled, ensuring only the detected WAF is associated with a subdomain.

web/static/custom/port_display.js

- Add new GetIpDetails API view for consolidated IP information - Update templates to use new endpoint - Refactor port_display.js to handle new API response format - Remove redundant API calls and simplify data flow - Improve error handling and loading states

AnonymousWP · 2025-02-09T12:18:49Z

Please solve all comments, then I'll review.

psyray · 2025-02-09T13:56:59Z

Please solve all comments, then I'll review.

Which ones ?
They are all solved for me

AnonymousWP · 2025-02-09T14:10:45Z

Please solve all comments, then I'll review.

Which ones ? They are all solved for me

Hmm, on phone it said they weren't resolved, perhaps a bug. On PC it does indeed show they're solved. Will review later.

- Add HTTP/HTTPS links for web ports (common and uncommon) - Refactor modal display code for better maintainability - Add subdomain names to IP tooltips - Improve table layouts with consistent columns - Add port-specific links for subdomains - Create reusable modal and table creation functions - Add UncommonWebPortsView API endpoint - Update IP serializer to include subdomain names and filtered counts This commit enhances the user experience by providing direct links to web services and improves code maintainability through shared components.

- Fix incorrect IP address storage where hostname was stored instead of IP - Parse IP address directly from nmap XML results - Add warning log when no IP address is found - Prioritize IPv4 over IPv6 addresses This fixes a bug where domain names were being stored in the IpAddress table instead of actual IP addresses.

- Fix JavaScript syntax error "Uncaught SyntaxError: expected expression, got ','" - Clean up URL parameters formatting - Ensure proper comma usage in function parameters

Always use the -Pn flag with nmap to treat all hosts as online, skipping host discovery.

0b3ud

I have finished testing and reviewing this PR and it works as expected
No bugs detected

psyray added 4 commits December 16, 2024 00:25

fix(tasks): improve endpoint creation from nmap data

06b3177

- Added a safer check for domain existence in nmap data - Added a check to ensure the scheme is present before creating a fallback endpoint - Prevents potential errors when creating endpoints from incomplete nmap data

Merge branch 'release/2.1.1' into fix/first-endpoint-detection

2acb92a

psyray self-assigned this Jan 17, 2025

psyray linked an issue Jan 17, 2025 that may be closed by this pull request

bug(scan): fix first endpoint detection #250

Closed

3 tasks

psyray added 3 commits January 18, 2025 01:56

refactor: Refactor Port model to use PortInfo

80dfd51

This update includes database migrations, API and serializer updates, and adjustments to related functionalities.

github-advanced-security bot found potential problems Jan 18, 2025

View reviewed changes

web/reNgine/tasks.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Jan 21, 2025

View reviewed changes

psyray added 5 commits January 22, 2025 22:55

Merge branch 'release/2.1.1' into fix/first-endpoint-detection

c98377a

Merge branch 'release/2.1.1' into fix/first-endpoint-detection

7988573

github-advanced-security bot found potential problems Feb 3, 2025

View reviewed changes

web/startScan/templates/startScan/history.html Dismissed Show resolved Hide resolved

psyray added 3 commits February 3, 2025 10:21

fix: update visualization to reflect db changes

437f79d

Merge branch 'release/2.1.1' into fix/first-endpoint-detection

5b2aa15

psyray mentioned this pull request Feb 8, 2025

bug(scope): Nuclei only scan the first endpoint (https://domain.com/) (HTTPS) and do not scan the next endpoints (http://domain.com/) (HTTP) #217

Closed

3 tasks

psyray requested review from 0b3ud and AnonymousWP February 8, 2025 12:40

psyray added the bug Something isn't working label Feb 8, 2025

psyray marked this pull request as ready for review February 8, 2025 12:40

psyray linked an issue Feb 8, 2025 that may be closed by this pull request

bug(scope): Nuclei only scan the first endpoint (https://domain.com/) (HTTPS) and do not scan the next endpoints (http://domain.com/) (HTTP) #217

Closed

3 tasks

fix(js): apply codeql JS recommendations

8bde09e

github-advanced-security bot found potential problems Feb 8, 2025

View reviewed changes

fix(js): apply codeql JS recommendations

905bae3

github-advanced-security bot found potential problems Feb 8, 2025

View reviewed changes

web/static/custom/port_display.js Fixed Show fixed Hide fixed

web/static/custom/port_display.js Fixed Show fixed Hide fixed

web/static/custom/port_display.js Fixed Show fixed Hide fixed

psyray added 2 commits February 8, 2025 20:39

fix(js): apply codeql JS recommendations

5d696a3

psyray added 4 commits February 10, 2025 04:58

fix: remove trailing comma causing JavaScript syntax error

9e63bc7

- Fix JavaScript syntax error "Uncaught SyntaxError: expected expression, got ','" - Clean up URL parameters formatting - Ensure proper comma usage in function parameters

feat: Add -Pn to nmap command

4fa8a80

Always use the -Pn flag with nmap to treat all hosts as online, skipping host discovery.

0b3ud approved these changes Feb 12, 2025

View reviewed changes

0b3ud merged commit 900b2d6 into release/2.1.1 Feb 12, 2025
5 checks passed

0b3ud deleted the fix/first-endpoint-detection branch February 12, 2025 11:49

sourcery-ai bot mentioned this pull request Feb 18, 2025

fix: resolve duplicate port & improve first endpoint detection #273

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(scan): first endpoint detection for IP #251

fix(scan): first endpoint detection for IP #251

psyray commented Jan 17, 2025 •

edited

Loading

sourcery-ai bot commented Jan 17, 2025 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

AnonymousWP commented Feb 9, 2025

psyray commented Feb 9, 2025

AnonymousWP commented Feb 9, 2025

0b3ud left a comment

fix(scan): first endpoint detection for IP #251

fix(scan): first endpoint detection for IP #251

Conversation

psyray commented Jan 17, 2025 • edited Loading

Summary by Sourcery

sourcery-ai bot commented Jan 17, 2025 • edited Loading

Reviewer's Guide by Sourcery

Sequence diagram for IP scan endpoint detection

Class diagram for endpoint and domain relationships

Flow diagram for endpoint creation logic

File-Level Changes

Assessment against linked issues

Interacting with Sourcery

Customizing Your Experience

Getting Help

AnonymousWP commented Feb 9, 2025

psyray commented Feb 9, 2025

AnonymousWP commented Feb 9, 2025

0b3ud left a comment

Choose a reason for hiding this comment

psyray commented Jan 17, 2025 •

edited

Loading

sourcery-ai bot commented Jan 17, 2025 •

edited

Loading