Feature Extraction

Modules

Feature Extraction

This document covers the CICIDS2017 feature extraction system used in RL-IDS for converting network packets into standardized feature vectors.

Overview

The feature extraction system (network_monitor.py - CICIDSFeatureExtractor class) converts raw network packets into 78 standardized features compatible with the CICIDS2017 dataset. This enables the DQN models to analyze real-time network traffic using the same feature space they were trained on.

CICIDS2017 Feature Set

The system extracts 78 features organized into several categories:

Flow Duration Features

flow_duration - Duration of the network flow in seconds

Packet Count Features

total_fwd_packets - Total packets in forward direction
total_bwd_packets - Total packets in backward direction

Byte Count Features

total_length_fwd_packets - Total bytes in forward direction
total_length_bwd_packets - Total bytes in backward direction

Packet Length Statistics

Forward direction: - fwd_packet_length_max - Maximum packet length - fwd_packet_length_min - Minimum packet length - fwd_packet_length_mean - Mean packet length - fwd_packet_length_std - Standard deviation of packet lengths

Backward direction: - bwd_packet_length_max - Maximum packet length - bwd_packet_length_min - Minimum packet length - bwd_packet_length_mean - Mean packet length - bwd_packet_length_std - Standard deviation of packet lengths

Flow Rate Features

flow_bytes_per_sec - Bytes per second in the flow
flow_packets_per_sec - Packets per second in the flow
fwd_packets_per_sec - Forward packets per second
bwd_packets_per_sec - Backward packets per second

Inter-Arrival Time Features

flow_iat_mean - Mean inter-arrival time
flow_iat_std - Standard deviation of inter-arrival times
flow_iat_max - Maximum inter-arrival time
flow_iat_min - Minimum inter-arrival time

TCP Flag Features

fin_flag_count - Count of FIN flags
syn_flag_count - Count of SYN flags
rst_flag_count - Count of RST flags
psh_flag_count - Count of PSH flags
ack_flag_count - Count of ACK flags
urg_flag_count - Count of URG flags

Additional Statistical Features

min_packet_length - Minimum packet length across all packets
max_packet_length - Maximum packet length across all packets
packet_length_mean - Mean packet length
packet_length_std - Standard deviation of packet lengths
packet_length_variance - Variance of packet lengths

CICIDSFeatureExtractor Implementation

Class Structure

class CICIDSFeatureExtractor:
    """Extract CICIDS2017-compatible features from network packets"""

    def __init__(self):
        self.feature_names = [
            'flow_duration', 'total_fwd_packets', 'total_bwd_packets',
            'total_length_fwd_packets', 'total_length_bwd_packets',
            'fwd_packet_length_max', 'fwd_packet_length_min', 'fwd_packet_length_mean',
            'fwd_packet_length_std', 'bwd_packet_length_max', 'bwd_packet_length_min',
            'bwd_packet_length_mean', 'bwd_packet_length_std', 'flow_bytes_per_sec',
            'flow_packets_per_sec', 'flow_iat_mean', 'flow_iat_std', 'flow_iat_max',
            'flow_iat_min', 'fwd_iat_total', 'fwd_iat_mean', 'fwd_iat_std',
            'fwd_iat_max', 'fwd_iat_min', 'bwd_iat_total', 'bwd_iat_mean',
            'bwd_iat_std', 'bwd_iat_max', 'bwd_iat_min', 'fwd_psh_flags',
            'bwd_psh_flags', 'fwd_urg_flags', 'bwd_urg_flags', 'fwd_header_length',
            'bwd_header_length', 'fwd_packets_per_sec', 'bwd_packets_per_sec',
            'min_packet_length', 'max_packet_length', 'packet_length_mean',
            'packet_length_std', 'packet_length_variance', 'fin_flag_count',
            'syn_flag_count', 'rst_flag_count', 'psh_flag_count', 'ack_flag_count',
            'urg_flag_count', 'cwe_flag_count', 'ece_flag_count', 'down_up_ratio',
            'average_packet_size', 'avg_fwd_segment_size', 'avg_bwd_segment_size',
            'fwd_header_length_2', 'fwd_avg_bytes_per_bulk', 'fwd_avg_packets_per_bulk',
            'fwd_avg_bulk_rate', 'bwd_avg_bytes_per_bulk', 'bwd_avg_packets_per_bulk',
            'bwd_avg_bulk_rate', 'subflow_fwd_packets', 'subflow_fwd_bytes',
            'subflow_bwd_packets', 'subflow_bwd_bytes', 'init_win_bytes_forward',
            'init_win_bytes_backward', 'act_data_pkt_fwd', 'min_seg_size_forward',
            'active_mean', 'active_std', 'active_max', 'active_min', 'idle_mean',
            'idle_std', 'idle_max', 'idle_min'
        ]