Files
f2b/fail2ban/logs.go
Ismo Vuorinen 70d1cb70fd feat!: Go rewrite (#9)
* Go rewrite

* chore(cr): apply suggestions

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Signed-off-by: Ismo Vuorinen <ismo@ivuorinen.net>

* 📝 CodeRabbit Chat: Add NoOpClient to fail2ban and initialize when skip flag is true

* 📝 CodeRabbit Chat: Fix malformed if-else structure and add no-op client for skip-only commands

* 📝 CodeRabbit Chat: Fix malformed if-else structure and add no-op client for skip-only commands

* fix(main): correct no-op branch syntax (#10)

* chore(gitignore): ignore env and binary files (#11)

* chore(config): remove indent_size for go files (#12)

* feat(cli): inject version via ldflags (#13)

* fix(security): validate filter parameter to prevent path traversal (#15)

* chore(repo): anchor ignore for build artifacts (#16)

* chore(ci): use golangci-lint action (#17)

* feat(fail2ban): expose GetLogDir (#19)

* test(cmd): improve IP mock validation (#20)

* chore(ci): update golanglint

* fix(ci): golanglint

* fix(ci): correct args indentation in pr-lint workflow (#21)

* fix(ci): avoid duplicate releases (#22)

* refactor(fail2ban): remove test check from OSRunner (#23)

* refactor(fail2ban): make log and filter dirs configurable (#24)

* fix(ci): create single release per tag (#14)

Signed-off-by: Ismo Vuorinen <ismo@ivuorinen.net>

* chore(dev): add codex setup script (#27)

* chore(lint): enable staticcheck (#26)

* chore(ci): verify golangci config (#28)

* refactor(cmd): centralize env config (#29)

* chore(dev): add pre-commit config (#30)

* fix(ci): disable cgo in cross compile (#31)

* fix(ci): fail on formatting issues (#32)

* feat(cmd): add context to logs watch (#33)

* chore: fixes, roadmap, claude.md, linting

* chore: fixes, linting

* fix(ci): gh actions update, fixes and tweaks

* chore: use reviewdog actionlint

* chore: use wow-rp-addons/actions-editorconfig-check

* chore: combine agent instructions, add comments, fixes

* chore: linting, fixes, go revive

* chore(deps): update pre-commit hooks

* chore: bump go to 1.21, pin workflows

* fix: install tools in lint.yml

* fix: sudo timeout

* fix: service command injection

* fix: memory exhaustion with large logs

* fix: enhanced path traversal and file security vulns

* fix: race conditions

* fix: context support

* chore: simplify fail2ban/ code

* feat: major refactoring with GoReleaser integration and code consolidation

- Add GoReleaser configuration for automated multi-platform releases
  - Support for Linux, macOS, Windows, and BSD builds
  - Docker images, Homebrew tap, and Linux packages (.deb, .rpm, .apk)
  - GitHub Actions workflow for release automation

- Consolidate duplicate code and improve architecture
  - Extract common command helpers to cmd/helpers.go (~230 lines)
  - Remove duplicate MockClient implementation from tests (~250 lines)
  - Create context wrapper helpers in fail2ban/context_helpers.go
  - Standardize error messages in fail2ban/errors.go

- Enhance validation and security
  - Add proper IP address validation with fail2ban.ValidateIP
  - Fix path traversal and command injection vulnerabilities
  - Improve thread-safety in MockClient with consistent ordering

- Optimize documentation
  - Reduce CLAUDE.md from 190 to 81 lines (57% reduction)
  - Reduce TODO.md from 633 to 93 lines (85% reduction)
  - Move README.md to root directory with installation instructions

- Improve test reliability
  - Fix race conditions and test flakiness
  - Add sorting to ensure deterministic test output
  - Enhance MockClient with configurable behavior

* feat: comprehensive code quality improvements and documentation reorganization

This commit represents a major overhaul of code quality, documentation
structure, and development tooling:

**Documentation & Structure:**

- Move CODE_OF_CONDUCT.md from .github to root directory
- Reorganize documentation with dedicated docs/ directory
- Create comprehensive architecture, security, and testing documentation
- Update all references and cross-links for new documentation structure

**Code Quality & Linting:**

- Add 120-character line length limit across all files via EditorConfig
- Enable comprehensive linting with golines, lll, usetesting, gosec, and revive
- Fix all 86 revive linter issues (unused parameters, missing export comments)
- Resolve security issues (file permissions 0644 → 0600, gosec warnings)
- Replace deprecated os.Setenv with t.Setenv in all tests
- Configure golangci-lint with auto-fix capabilities and formatter integration

**Development Tooling:**

- Enhance pre-commit configuration with additional hooks and formatters
- Update GoReleaser configuration with improved YAML formatting
- Improve GitHub workflows and issue templates for CLI-specific context
- Add comprehensive Makefile with proper dependency checking

**Testing & Security:**

- Standardize mock patterns and context wrapper implementations
- Enhance error handling with centralized error constants
- Improve concurrent access testing for thread safety

* perf: implement major performance optimizations with comprehensive test coverage

This commit introduces three significant performance improvements along with
complete linting compliance and robust test coverage:

**Performance Optimizations:**
1. **Time Parsing Cache (8.6x improvement)**
    - Add TimeParsingCache with sync.Map for caching parsed times
    - Implement object pooling for string builders to reduce allocations
    - Create optimized BanRecordParser with pooled string slices

2. **Gzip Detection Consolidation (55x improvement)**
    - Consolidate ~100 lines of duplicate gzip detection logic
    - Fast-path extension checking before magic byte detection
    - Unified GzipDetector with comprehensive file handling utilities

3. **Parallel Processing (2.5-5.0x improvement)**
    - Generic WorkerPool implementation for concurrent operations
    - Smart fallback to sequential processing for single operations
    - Context-aware cancellation support for long-running tasks
    - Applied to ban/unban operations across multiple jails

**New Files Added:**
- fail2ban/time_parser.go: Cached time parsing with global instances
- fail2ban/ban_record_parser.go: Optimized ban record parsing
- fail2ban/gzip_detection.go: Unified gzip handling utilities
- fail2ban/parallel_processing.go: Generic parallel processing framework
- cmd/parallel_operations.go: Command-level parallel operation support

**Code Quality & Linting:**
- Resolve all golangci-lint issues (0 remaining)
- Add proper #nosec annotations for legitimate file operations
- Implement sentinel errors replacing nil/nil anti-pattern
- Fix context parameter handling and error checking

**Comprehensive Test Coverage:**
- 500+ lines of new tests with benchmarks validating all improvements
- Concurrent access testing for thread safety
- Edge case handling and error condition testing
- Performance benchmarks demonstrating measured improvements

**Modified Files:**
- fail2ban/fail2ban.go: Integration with new optimized parsers
- fail2ban/logs.go: Use consolidated gzip detection (-91 lines)
- cmd/ban.go & cmd/unban.go: Add conditional parallel processing

* test: comprehensive test infrastructure overhaul with real test data

Major improvements to test code quality and organization:

• Added comprehensive test data infrastructure with 6 anonymized log files
• Extracted common test helpers reducing ~200 lines to ~50 reusable functions
• Enhanced ban record parser tests with real production log patterns
• Improved gzip detection tests with actual compressed test data
• Added integration tests for full log processing and concurrent operations
• Updated .gitignore to allow testdata log files while excluding others
• Updated TODO.md to reflect completed test infrastructure improvements

* fix: comprehensive security hardening and critical bug fixes

Security Enhancements:
- Add command injection protection with allowlist validation for all external
  commands
- Add security documentation to gzip functions warning about path traversal risks
- Complete TODO.md security audit - all critical vulnerabilities addressed

Bug Fixes:
- Fix negative index access vulnerability in parallel operations (prevent panic)
- Fix parsing inconsistency between BannedIn and BannedInWithContext functions
- Fix nil error handling in concurrent log reading tests
- Fix benchmark error simulation to measure actual performance vs error paths

Implementation Details:
- Add ValidateCommand() with allowlist for fail2ban-client, fail2ban-regex,
  service, systemctl, sudo
- Integrate command validation into all OSRunner methods before execution
- Replace manual string parsing with ParseBracketedList() for consistency
- Add bounds checking (index >= 0) to prevent negative array access
- Replace nil error with descriptive error message in concurrent error channels
- Update banFunc in benchmark to return success instead of permanent errors

Test Coverage:
- Add comprehensive security validation tests with injection attempt patterns
- Add parallel operations safety tests with index validation
- Add parsing consistency tests between context/non-context functions
- Add error handling demonstration tests for concurrent operations
- Add gzip function security requirement documentation tests

* perf: implement ultra-optimized log and ban record parsing with significant performance gains

Major performance improvements to core fail2ban processing with
 comprehensive benchmarking:

Performance Achievements:
• Ban record parsing: 15% faster, 39% less memory, 45% fewer allocations
• Log processing: 27% faster, 64% less memory, 32% fewer allocations
• Cache performance: 624x faster cache hits with zero allocations
• String pooling: 4.7x improvement with zero memory allocations

Core Optimizations:
• Object pooling (sync.Pool) for string slices, scanner buffers, and line buffers
• Comprehensive caching (sync.Map) for gzip detection, file info, and path patterns
• Fast path optimizations with extension-based gzip detection
• Byte-level operations to reduce string allocations in filtering
• Ultra-optimized parsers with smart field parsing and efficient time handling

New Files:
• fail2ban/ban_record_parser_optimized.go - High-performance ban record parser
• fail2ban/log_performance_optimized.go - Ultra-optimized log processor with caching
• fail2ban/ban_record_parser_benchmark_test.go - Ban record parsing benchmarks
• fail2ban/log_performance_benchmark_test.go - Log performance benchmarks
• fail2ban/ban_record_parser_compatibility_test.go - Compatibility verification tests

Updated:
• fail2ban/fail2ban.go - Integration with ultra-optimized parsers
• TODO.md - Marked performance optimization tasks as completed

* fix(ci): install dev dependencies for pre-commit

* refactor: streamline pre-commit config and extract test helpers

- Replace local hooks with upstream pre-commit repositories for better maintainability
- Add new hooks: shellcheck, shfmt, checkov for enhanced code quality
- Extract common test helpers into dedicated test_helpers.go to reduce duplication
- Add warning logs for unreadable log files in fail2ban and logs packages
- Remove hard-coded GID checks in sudo.go for better cross-platform portability
- Update golangci-lint installation method in Makefile

* fix(security): path traversal, log file validation

* feat: complete pre-release modernization with comprehensive testing

- Remove all deprecated legacy functions and dead code paths
- Add security hardening with sanitized error messages
- Implement comprehensive performance benchmarks and security audit tests
- Mark all pre-release modernization tasks as completed (10/10)
- Update project documentation to reflect full completion status

* fix(ci): linting, and update gosec install source

* feat: implement comprehensive test framework with 60-70% code reduction

Major test infrastructure modernization:

- Create fluent CommandTestBuilder framework for streamlined test creation
- Add MockClientBuilder pattern for advanced mock configuration
- Standardize table test field naming (expectedOut→wantOutput, expectError→wantError)
- Consolidate test code: 3,796 insertions, 3,104 deletions (net +692 lines with enhanced functionality)

Framework achievements:
- 168+ tests passing with zero regressions
- 5 cmd test files fully migrated to new framework
- 63 field name standardizations applied
- Advanced mock patterns with fluent interface

File organization improvements:
- Rename all test files with consistent prefixes (cmd_*, fail2ban_*, main_*)
- Split monolithic test files into focused, maintainable modules
- Eliminate cmd_test.go (622 lines) and main_test.go (825 lines)
- Create specialized test files for better organization

Documentation enhancements:
- Update docs/testing.md with complete framework documentation
- Optimize TODO.md from 231→72 lines (69% token reduction)
- Add comprehensive migration guides and best practices

Test framework components:
- command_test_framework.go: Core fluent interface implementation
- MockClientBuilder: Advanced mock configuration with builder pattern
- table_test_standards.go: Standardized field naming conventions
- Enhanced test helpers with error checking consolidation

* chore: fixes, .go-version, linting

* fix(ci) editorconfig in .pre-commit-config.yaml

* fix: too broad gitignore

* chore: update fail2ban/fail2ban_path_security_test.go

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Signed-off-by: Ismo Vuorinen <ismo@ivuorinen.net>

* chore: code review fixes

* chore: code review fixes

* fix: more code review fixes

* fix: more code review fixes

* feat: cleanup, fixes, testing

* chore: minor config file updates

- Add quotes to F2B_TIMEOUT value in .env.example for clarity
- Remove testdata log exception from .gitignore (simplified)

* feat: implement comprehensive monitoring with structured logging and metrics

- Add structured logging with context propagation throughout codebase
  - Implement ContextualLogger with request tracking and operation timing
  - Add context values for operation, IP, jail, command, and request ID
  - Integrate with existing logrus logging infrastructure

- Add request/response timing metrics collection
  - Create comprehensive Metrics system with atomic counters
  - Track command executions, ban/unban operations, and client operations
  - Implement latency distribution buckets for performance analysis
  - Add validation cache hit/miss tracking

- Enhance ban/unban commands with structured logging
  - Add LogOperation wrapper for automatic timing and context
  - Log individual jail operations with success/failure status
  - Integrate metrics recording with ban/unban operations

- Add new 'metrics' command to expose collected metrics
  - Support both plain text and JSON output formats
  - Display system metrics (uptime, memory, goroutines)
  - Show operation counts, failures, and average latencies
  - Include latency distribution histograms

- Update test infrastructure
  - Add tests for metrics command
  - Fix test helper to support persistent flags
  - Ensure all tests pass with new logging

This completes the high-priority performance monitoring and structured
logging requirements from TODO.md, providing comprehensive operational
visibility into the f2b application.

* docs: update TODO.md to reflect completed monitoring work

- Mark structured logging and timing metrics as completed
- Update test coverage stats (cmd/ improved from 66.4% to 76.8%)
- Add completed infrastructure section for today's work
- Update current status date and add monitoring to health indicators

* feat: complete TODO.md technical debt cleanup

Complete all remaining TODO.md tasks with comprehensive implementation:

## 🎯 Validation Caching Implementation
- Thread-safe validation cache with sync.RWMutex protection
- MetricsRecorder interface to avoid circular dependencies
- Cached validation for IP, jail, filter, and command validation
- Integration with existing metrics system for cache hit/miss tracking
- 100% test coverage for caching functionality

## 🔧 Constants Extraction
- Fail2Ban status codes: Fail2BanStatusSuccess, Fail2BanStatusAlreadyProcessed
- Command constants: Fail2BanClientCommand, Fail2BanRegexCommand, Fail2BanServerCommand
- File permissions: DefaultFilePermissions (0600), DefaultDirectoryPermissions (0750)
- Timeout limits: MaxCommandTimeout, MaxFileTimeout, MaxParallelTimeout
- Updated all references throughout codebase to use named constants

## 📊 Test Coverage Improvement
- Increased fail2ban package coverage from 62.0% to 70.3% (target: 70%+)
- Added 6 new comprehensive test files with 200+ additional test cases
- Coverage improvements across all major components:
  - Context helpers, validation cache, mock clients, OS runner methods
  - Error constructors, timing operations, cache statistics
  - Thread safety and concurrency testing

## 🛠️ Code Quality & Fixes
- Fixed all linting issues (golangci-lint, revive, errcheck)
- Resolved unused parameter warnings and error handling
- Fixed timing-dependent test failures in worker pool cancellation
- Enhanced thread safety in validation caching

## 📈 Final Metrics
- Overall test coverage: 72.4% (up from ~65%)
- fail2ban package: 70.3% (exceeds 70% target)
- cmd package: 76.9%
- Zero TODO/FIXME/HACK comments in production code
- 100% linting compliance

* fix: resolve test framework issues and update documentation

- Remove unnecessary defer/recover block in comprehensive_framework_test.go
- Fix compilation error in command_test_framework.go variable redeclaration
- Update TODO.md to reflect all 12 completed code quality fixes
- Clean up dead code and improve test maintainability
- Fix linting issues: error handling, code complexity, security warnings
- Break down complex test function to reduce cyclomatic complexity

* fix: replace dangerous test commands with safe placeholders

Replaces actual dangerous commands in test cases with safe placeholder patterns to prevent accidental execution while maintaining comprehensive security testing.

- Replace 'rm -rf /', 'cat /etc/passwd' with 'DANGEROUS_RM_COMMAND', 'DANGEROUS_SYSTEM_CALL'
- Update GetDangerousCommandPatterns() to recognize both old and new patterns
- Enhance filter validation with command injection protection (semicolons, pipes, backticks, dollar signs)
- Add package documentation comments for all packages (main, cmd, fail2ban)
- Fix GoReleaser static linking configuration for cross-platform builds
- Remove Docker platform restriction to enable multi-arch support
- Apply code formatting and linting fixes

All security validation tests continue to pass with the safe placeholders.

* fix: resolve TestMixedConcurrentOperations race condition and command key mismatches

The concurrency test was failing due to several issues:

1. **Command Key Mismatch**: Test setup used "sudo test arg" key but MockRunner
   looked for "test arg" because "test" command doesn't require sudo
2. **Invalid Commands**: Using "test" and "echo" commands that aren't in the
   fail2ban command allowlist, causing validation failures
3. **Race Conditions**: Multiple goroutines setting different MockRunners
   simultaneously, overwriting responses

**Solution:**
- Replace invalid test commands ("test", "echo") with valid fail2ban commands
  ("fail2ban-client status", "fail2ban-client -V")
- Pre-configure shared MockRunner with all required response keys for both
  sudo and non-sudo execution paths
- Improve test structure to reduce race conditions between setup and execution

All tests now pass reliably, resolving the CI failure.

* fix: address code quality issues and improve test coverage

- Replace unsafe type assertion with comma-ok idiom in logging
- Fix TestTestFilter to use created filter instead of nonexistent
- Add warning logs for invalid log level configurations
- Update TestVersionCommand to use consistent test framework pattern
- Remove unused LoggerContextKey constant
- Add version command support to test framework
- Fix trailing whitespace in test files

* feat: add timeout handling and multi-architecture Docker support

* test: enhance path traversal security test coverage

* chore: comprehensive documentation update and linting fixes

Updated all documentation to reflect current capabilities including context-aware operations, multi-architecture Docker support, advanced security features, and performance monitoring. Removed unused functions and fixed all linting issues.

* fix(lint): .goreleaser.yaml

* feat: add markdown link checker and fix all linting issues

- Add markdown-link-check to pre-commit hooks with comprehensive configuration
- Fix GitHub workflow structure (sync-labels.yml) with proper job setup
- Add JSON schemas to all configuration files for better IDE support
- Update tool installation in Makefile for markdown-link-check dependency
- Fix all revive linting issues (Boolean literals, defer in loop, if-else simplification, method naming)
- Resolve broken relative link in CONTRIBUTING.md
- Configure rate limiting and ignore patterns for GitHub URLs
- Enhance CLAUDE.md with link checking documentation

* fix(ci): sync-labels permissions

* docs: comprehensive documentation update reflecting current project status

- Updated TODO.md to show production-ready status with 21 commands
- Enhanced README.md with enterprise-grade features and capabilities
- Added performance monitoring and timeout configuration to FAQ
- Updated CLAUDE.md with accurate project architecture overview
- Fixed all line length issues to meet EditorConfig requirements
- Added .mega-linter.yml configuration for enhanced linting

* fix: address CodeRabbitAI review feedback

- Split .goreleaser.yaml builds for static/dynamic linking by architecture
- Update docs to accurately reflect 7 path traversal patterns (not 17)
- Fix containsPathTraversal to allow valid absolute paths
- Replace runnerCombinedRunWithSudoContext with RunnerCombinedOutputWithSudoContext
- Fix ldflags to use uppercase Version variable name
- Remove duplicate test coverage metrics in TODO.md
- Fix .markdown-link-check.json schema violations
- Add v8r JSON validator to pre-commit hooks

* chore(ci): update workflows, switch v8r to check-jsonschema

* fix: restrict static linking to amd64 only in .goreleaser.yaml

- Move arm64 from static to dynamic build configuration
- Static linking now only applies to linux/amd64
- Prevents build failures due to missing static libc on ARM64
- All architectures remain supported with appropriate linking

* fix(ci): caching

* fix(ci): python caching with pip, node with npm

* fix(ci): no caching for node then

* fix(ci): no requirements.txt, no cache

* refactor: address code review feedback

- Pin Alpine base image to v3.20 for reproducible builds
- Remove redundant --platform flags in GoReleaser Docker configs
- Fix unused parameters in concurrency test goroutines
- Simplify string search helper using strings.Contains()
- Remove redundant error checking logic in security tests

---------

Signed-off-by: Ismo Vuorinen <ismo@ivuorinen.net>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2025-08-07 01:49:45 +03:00

558 lines
15 KiB
Go

package fail2ban
import (
"bufio"
"context"
"fmt"
"io"
"net/url"
"os"
"path/filepath"
"sort"
"strconv"
"strings"
)
/*
Package fail2ban provides log reading and filtering utilities for Fail2Ban logs.
This file contains logic for reading, parsing, and filtering Fail2Ban log files,
including support for rotated and compressed logs.
*/
// GetLogLines reads Fail2Ban log files (current and rotated) and filters lines by jail and/or IP.
//
// jailFilter: jail name to filter by (empty or "all" for all jails)
// ipFilter: IP address to filter by (empty or "all" for all IPs)
//
// Returns a slice of matching log lines, or an error.
// This function uses streaming to limit memory usage.
func GetLogLines(jailFilter string, ipFilter string) ([]string, error) {
return GetLogLinesWithLimit(jailFilter, ipFilter, 1000) // Default limit for safety
}
// GetLogLinesWithLimit returns log lines with configurable limits for memory management.
func GetLogLinesWithLimit(jailFilter string, ipFilter string, maxLines int) ([]string, error) {
// Handle zero limit case - return empty slice immediately
if maxLines == 0 {
return []string{}, nil
}
pattern := filepath.Join(GetLogDir(), "fail2ban.log*")
files, err := filepath.Glob(pattern)
if err != nil {
return nil, fmt.Errorf("error listing log files: %w", err)
}
if len(files) == 0 {
return []string{}, nil
}
currentLog, rotated := parseLogFiles(files)
// Use streaming approach with memory limits
config := LogReadConfig{
MaxLines: maxLines,
MaxFileSize: 100 * 1024 * 1024, // 100MB file size limit
JailFilter: jailFilter,
IPFilter: ipFilter,
ReverseOrder: false,
}
var allLines []string
totalLines := 0
// Read rotated logs first (oldest to newest) - maintains original ordering
for _, rotatedFile := range rotated {
if config.MaxLines > 0 && totalLines >= config.MaxLines {
break
}
// Adjust remaining lines limit (skip limit check for negative MaxLines)
fileConfig := config
if config.MaxLines > 0 {
remainingLines := config.MaxLines - totalLines
if remainingLines <= 0 {
break
}
fileConfig.MaxLines = remainingLines
}
lines, err := streamLogFile(rotatedFile.path, fileConfig)
if err != nil {
getLogger().WithError(err).WithField("file", rotatedFile.path).Error("Failed to read rotated log file")
continue
}
allLines = append(allLines, lines...)
totalLines += len(lines)
}
// Read current log last (most recent) - maintains original ordering
if currentLog != "" && (config.MaxLines <= 0 || totalLines < config.MaxLines) {
fileConfig := config
if config.MaxLines > 0 {
remainingLines := config.MaxLines - totalLines
if remainingLines <= 0 {
return allLines, nil
}
fileConfig.MaxLines = remainingLines
}
lines, err := streamLogFile(currentLog, fileConfig)
if err != nil {
getLogger().WithError(err).WithField("file", currentLog).Error("Failed to read current log file")
} else {
allLines = append(allLines, lines...)
}
}
return allLines, nil
}
// parseLogFiles parses log file names and returns the current log and a slice of rotated logs
// (sorted oldest to newest).
func parseLogFiles(files []string) (string, []rotatedLog) {
var currentLog string
var rotated []rotatedLog
for _, path := range files {
base := filepath.Base(path)
if base == "fail2ban.log" {
currentLog = path
} else if strings.HasPrefix(base, "fail2ban.log.") {
if num := extractLogNumber(base); num >= 0 {
rotated = append(rotated, rotatedLog{num: num, path: path})
}
}
}
// Sort rotated logs by number descending (highest number = oldest log)
sort.Slice(rotated, func(i, j int) bool {
return rotated[i].num > rotated[j].num
})
return currentLog, rotated
}
// extractLogNumber extracts the rotation number from a log file name (e.g., "fail2ban.log.2.gz" -> 2).
func extractLogNumber(base string) int {
numPart := strings.TrimPrefix(base, "fail2ban.log.")
numPart = strings.TrimSuffix(numPart, ".gz")
if n, err := strconv.Atoi(numPart); err == nil {
return n
}
return -1
}
// rotatedLog represents a rotated log file with its rotation number.
type rotatedLog struct {
num int
path string
}
// LogReadConfig holds configuration for streaming log reading
type LogReadConfig struct {
MaxLines int // Maximum number of lines to read (0 = unlimited)
MaxFileSize int64 // Maximum file size to process in bytes (0 = unlimited)
JailFilter string // Filter by jail name (empty = no filter)
IPFilter string // Filter by IP address (empty = no filter)
ReverseOrder bool // Read from end of file backwards (for recent logs)
}
// streamLogFile reads a log file line by line with memory limits and filtering
func streamLogFile(path string, config LogReadConfig) ([]string, error) {
cleanPath, err := validateLogPath(path)
if err != nil {
return nil, err
}
if shouldSkipFile(cleanPath, config.MaxFileSize) {
return []string{}, nil
}
scanner, cleanup, err := createLogScanner(cleanPath)
if err != nil {
return nil, err
}
defer cleanup()
return scanLogLines(scanner, config)
}
// streamLogFileWithContext reads a log file line by line with memory limits,
// filtering, and context support for timeouts
func streamLogFileWithContext(ctx context.Context, path string, config LogReadConfig) ([]string, error) {
// Check context before starting
select {
case <-ctx.Done():
return nil, ctx.Err()
default:
}
cleanPath, err := validateLogPath(path)
if err != nil {
return nil, err
}
if shouldSkipFile(cleanPath, config.MaxFileSize) {
return []string{}, nil
}
scanner, cleanup, err := createLogScanner(cleanPath)
if err != nil {
return nil, err
}
defer cleanup()
return scanLogLinesWithContext(ctx, scanner, config)
}
// PathSecurityConfig holds configuration for path security validation
type PathSecurityConfig struct {
AllowedBasePaths []string // List of allowed base directories
MaxPathLength int // Maximum allowed path length (0 = unlimited)
AllowSymlinks bool // Whether to allow symlinks
ResolveSymlinks bool // Whether to resolve symlinks before validation
}
// validateLogPath validates and sanitizes the log file path with comprehensive security checks
func validateLogPath(path string) (string, error) {
config := PathSecurityConfig{
AllowedBasePaths: []string{GetLogDir()}, // Use configured log directory
MaxPathLength: 4096, // Reasonable path length limit
AllowSymlinks: false, // Disable symlinks for security
ResolveSymlinks: true, // Resolve symlinks before validation
}
return validatePathWithSecurity(path, config)
}
// validatePathWithSecurity performs comprehensive path security validation
func validatePathWithSecurity(path string, config PathSecurityConfig) (string, error) {
if path == "" {
return "", fmt.Errorf("empty path not allowed")
}
// Check path length limits
if config.MaxPathLength > 0 && len(path) > config.MaxPathLength {
return "", fmt.Errorf("path too long: %d characters (max: %d)", len(path), config.MaxPathLength)
}
// Detect and prevent null byte injection
if strings.Contains(path, "\x00") {
return "", fmt.Errorf("path contains null byte")
}
// Decode URL-encoded path traversal attempts
if decodedPath, err := url.QueryUnescape(path); err == nil && decodedPath != path {
getLogger().WithField("original", path).WithField("decoded", decodedPath).
Warn("Detected URL-encoded path, using decoded version for validation")
path = decodedPath
}
// Normalize unicode characters to prevent bypass attempts
path = normalizeUnicode(path)
// Basic path traversal detection (before cleaning)
if hasPathTraversal(path) {
return "", fmt.Errorf("path contains path traversal patterns")
}
// Clean and resolve the path
cleanPath, err := filepath.Abs(filepath.Clean(path))
if err != nil {
return "", fmt.Errorf("invalid path: %w", err)
}
// Additional check after cleaning (double-check for sophisticated attacks)
if hasPathTraversal(cleanPath) {
return "", fmt.Errorf("path contains path traversal patterns after normalization")
}
// Handle symlinks according to configuration
finalPath, err := handleSymlinks(cleanPath, config)
if err != nil {
return "", err
}
// Validate against allowed base paths
if err := validateBasePath(finalPath, config.AllowedBasePaths); err != nil {
return "", err
}
// Check if path points to a device file or other dangerous file types
if err := validateFileType(finalPath); err != nil {
return "", err
}
return finalPath, nil
}
// hasPathTraversal detects various path traversal patterns
func hasPathTraversal(path string) bool {
// Check for various path traversal patterns
dangerousPatterns := []string{
"..",
"./",
".\\",
"//",
"\\\\",
"/../",
"\\..\\",
"%2e%2e", // URL encoded ..
"%2f", // URL encoded /
"%5c", // URL encoded \
"\u002e\u002e", // Unicode ..
"\u2024\u2024", // Unicode bullet points (can look like ..)
"\uff0e\uff0e", // Full-width Unicode ..
}
pathLower := strings.ToLower(path)
for _, pattern := range dangerousPatterns {
if strings.Contains(pathLower, strings.ToLower(pattern)) {
return true
}
}
return false
}
// normalizeUnicode normalizes unicode characters to prevent bypass attempts
func normalizeUnicode(path string) string {
// Replace various Unicode representations of dots and slashes
replacements := map[string]string{
"\u002e": ".", // Unicode dot
"\u2024": ".", // Unicode bullet (one dot leader)
"\uff0e": ".", // Full-width dot
"\u002f": "/", // Unicode slash
"\u2044": "/", // Unicode fraction slash
"\uff0f": "/", // Full-width slash
"\u005c": "\\", // Unicode backslash
"\uff3c": "\\", // Full-width backslash
}
result := path
for unicode, ascii := range replacements {
result = strings.ReplaceAll(result, unicode, ascii)
}
return result
}
// handleSymlinks resolves or validates symlinks according to configuration
func handleSymlinks(path string, config PathSecurityConfig) (string, error) {
// Check if the path is a symlink
if info, err := os.Lstat(path); err == nil {
if info.Mode()&os.ModeSymlink != 0 {
if !config.AllowSymlinks {
return "", fmt.Errorf("symlinks not allowed: %s", path)
}
if config.ResolveSymlinks {
resolved, err := filepath.EvalSymlinks(path)
if err != nil {
return "", fmt.Errorf("failed to resolve symlink: %w", err)
}
return resolved, nil
}
}
} else if !os.IsNotExist(err) {
return "", fmt.Errorf("failed to check file info: %w", err)
}
return path, nil
}
// validateBasePath ensures the path is within allowed base directories
func validateBasePath(path string, allowedBasePaths []string) error {
if len(allowedBasePaths) == 0 {
return nil // No restrictions if no base paths configured
}
for _, basePath := range allowedBasePaths {
cleanBasePath, err := filepath.Abs(filepath.Clean(basePath))
if err != nil {
continue
}
// Check if path starts with allowed base path
if strings.HasPrefix(path, cleanBasePath+string(filepath.Separator)) ||
path == cleanBasePath {
return nil
}
}
return fmt.Errorf("path outside allowed directories: %s", path)
}
// validateFileType checks for dangerous file types (devices, named pipes, etc.)
func validateFileType(path string) error {
// Check if file exists
info, err := os.Stat(path)
if os.IsNotExist(err) {
return nil // File doesn't exist yet, allow it
}
if err != nil {
return fmt.Errorf("failed to stat file: %w", err)
}
mode := info.Mode()
// Block device files
if mode&os.ModeDevice != 0 {
return fmt.Errorf("device files not allowed: %s", path)
}
// Block named pipes (FIFOs)
if mode&os.ModeNamedPipe != 0 {
return fmt.Errorf("named pipes not allowed: %s", path)
}
// Block socket files
if mode&os.ModeSocket != 0 {
return fmt.Errorf("socket files not allowed: %s", path)
}
// Block irregular files (anything that's not a regular file or directory)
if !mode.IsRegular() && !mode.IsDir() {
return fmt.Errorf("irregular file type not allowed: %s", path)
}
return nil
}
// shouldSkipFile checks if a file should be skipped due to size limits
func shouldSkipFile(path string, maxFileSize int64) bool {
if maxFileSize <= 0 {
return false
}
if info, err := os.Stat(path); err == nil {
if info.Size() > maxFileSize {
getLogger().WithField("file", path).WithField("size", info.Size()).
Warn("Skipping large log file due to size limit")
return true
}
}
return false
}
// createLogScanner creates a scanner for the log file, handling gzip compression
func createLogScanner(path string) (*bufio.Scanner, func(), error) {
// #nosec G304 - Path is validated and sanitized above
const maxLineSize = 64 * 1024 // 64KB per line
return CreateGzipAwareScannerWithBuffer(path, maxLineSize)
}
// scanLogLines scans lines from the scanner with filtering and limits
func scanLogLines(scanner *bufio.Scanner, config LogReadConfig) ([]string, error) {
var lines []string
lineCount := 0
for scanner.Scan() {
line := strings.TrimSpace(scanner.Text())
if line == "" {
continue
}
if !passesFilters(line, config) {
continue
}
lines = append(lines, line)
lineCount++
if config.MaxLines > 0 && lineCount >= config.MaxLines {
break
}
}
if err := scanner.Err(); err != nil {
return nil, fmt.Errorf("error scanning log file: %w", err)
}
return lines, nil
}
// scanLogLinesWithContext scans log lines with context support for timeout handling
func scanLogLinesWithContext(ctx context.Context, scanner *bufio.Scanner, config LogReadConfig) ([]string, error) {
var lines []string
lineCount := 0
linesProcessed := 0
for scanner.Scan() {
// Check context periodically (every 100 lines to avoid excessive overhead)
if linesProcessed%100 == 0 {
select {
case <-ctx.Done():
return nil, ctx.Err()
default:
}
}
linesProcessed++
line := strings.TrimSpace(scanner.Text())
if line == "" {
continue
}
if !passesFilters(line, config) {
continue
}
lines = append(lines, line)
lineCount++
if config.MaxLines > 0 && lineCount >= config.MaxLines {
break
}
}
if err := scanner.Err(); err != nil {
return nil, fmt.Errorf("error scanning log file: %w", err)
}
return lines, nil
}
// passesFilters checks if a log line passes the configured filters
func passesFilters(line string, config LogReadConfig) bool {
if config.JailFilter != "" && config.JailFilter != AllFilter {
jailPattern := fmt.Sprintf("[%s]", config.JailFilter)
if !strings.Contains(line, jailPattern) {
return false
}
}
if config.IPFilter != "" && config.IPFilter != AllFilter {
if !strings.Contains(line, config.IPFilter) {
return false
}
}
return true
}
// readLogFile reads the contents of a log file, handling gzip compression if necessary.
// DEPRECATED: Use streamLogFile instead for better memory efficiency.
func readLogFile(path string) ([]byte, error) {
// Validate path for security using comprehensive validation
cleanPath, err := validateLogPath(path)
if err != nil {
return nil, err
}
// Use consolidated gzip detection utility
reader, err := OpenGzipAwareReader(cleanPath)
if err != nil {
return nil, err
}
defer func() {
if cerr := reader.Close(); cerr != nil {
getLogger().WithError(cerr).Error("failed to close log file")
}
}()
return io.ReadAll(reader)
}