WTRL TTT Scraper

Project Overview

The WTRL TTT Scraper is a Python-based project designed to scrape, process, and visualize time trial results from the WTRL website. It provides detailed data analysis for multiple teams and riders, generates dashboards with sortable tables and charts, and outputs static HTML files.

The results are found here (requires a WTRL account):
🔗 WTRL TTT Results

Features

Fetches and parses race data from the WTRL website.
Supports multiple clubs with multiple teams and aliases.
Outputs sortable and filterable DataTables with visualizations.
Generates static HTML dashboards for each team and an index for each club.
Files can be hosted on Netlify or similar static hosting platforms.

Setup and Configuration

1. Install Dependencies

Ensure you have Python 3.8 or higher installed.
Install required packages using:
```
pip install -r requirements.txt
```

2. Configuration

A configuration file named config.secret.json is required in the project root.
Start by copying the template:
```
cp config.json config.secret.json
```

Configuration Format (config.secret.json):

{
  "netlify_auth_token": "your_global_netlify_auth_token",
  "wtrl_sid": "your_wtrl_sid",
  "wtrl_ouid": "your_wtrl_ouid",
  "ctoken": "your_ctoken",
  "clubs": [
    {
      "club_name": "Waterloo Cycling Club",
      "site_id": "your_wcc_site_id",
      "teams": [
        {
          "team_name": "Waterloo Ones",
          "aliases": ["WCC Butter Tarts"]
        },
        {
          "team_name": "Waterloo Twos",
          "aliases": ["WCC Canucks"]
        }
      ]
    },
    {
      "club_name": "SISU Racing",
      "site_id": "your_sisu_site_id",
      "teams": [
        {
          "team_name": "SISU Racing Grit",
          "aliases": []
        },
        {
          "team_name": "SISU Racing Mokki",
          "aliases": []
        }
      ]
    }
  ]
}

3. Extract Required Tokens

Running main.py will open a browser for you to log in to WTRL.
The tokens will automatically be extracted and inserted into config.secret.json.
[Optional] Manually extract tokens using Developer Tools:
- wtrl_sid and wtrl_ouid: Found under Application > Cookies > https://www.wtrl.racing.
- ctoken: Found under Application > Session Storage > https://www.wtrl.racing (refresh after logout).

File Structure

WTRL-TTT-Scraper/
├── cache/                   # Cached race results in JSON format
├── results/                 # Generated HTML files for clubs and teams
├── wtrl_ttt_scraper/        # Core package for the scraper
│   ├── __init__.py          # Package init
│   ├── calculate.py         # Percentile calculations
│   ├── deploy.py            # Automates deployment to Netlify
│   ├── format.py            # Formatting utilities
│   ├── models.py            # Data models
│   ├── parse.py             # Parsing utilities
│   ├── render.py            # HTML rendering
│   ├── scrape.py            # Scraping logic
├── .gitignore               # Ignore sensitive files
├── config.json              # Default configuration
├── config.secret.json       # User-specific configuration (ignored in Git)
├── LICENSE.md               # License file
├── main.py                  # Entry point for scraping
├── README.md                # Documentation
├── requirements.txt         # Python dependencies
└── scripts.sh               # Utility scripts for automation

Usage

1. Run the Scraper

Execute the main script to scrape data and generate results:
```
python main.py
```
The script will automatically:
- Load config.secret.json (or fallback to config.json).
- Authenticate with WTRL.
- Scrape all configured clubs and teams.
- Save results in results/[club_name]/.

2. Access Results

Generated HTML files will be saved in the results/ directory, structured per club:
- results/[club_name]/index.html → Central index for the club.
- results/[club_name]/[team_name].html → Individual dashboards.

3. Publish Results

Host the results/ directory on a static hosting service:
- Netlify deployment is automated via deploy.py
- [optional] You can also manually drag and drop results/[club] into Netlify’s UI.

4. Format Code

Ensure Python code is formatted using Black
```
black .
```

Notes

Make sure your config.secret.json file is excluded from version control (.gitignore).
The cache/ directory stores JSON-formatted race results for caching.
Keep your extracted tokens (wtrl_sid, wtrl_ouid, ctoken) secure, as they are tied to your account.

Contributing

Feel free to submit issues or pull requests to improve the project. Contributions are always welcome! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WTRL TTT Scraper

Project Overview

Features

Setup and Configuration

1. Install Dependencies

2. Configuration

3. Extract Required Tokens

File Structure

Usage

1. Run the Scraper

2. Access Results

3. Publish Results

4. Format Code

Notes

Contributing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
cache		cache
examples		examples
wtrl_ttt_scraper		wtrl_ttt_scraper
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
config.json		config.json
config.py		config.py
main.py		main.py
requirements.txt		requirements.txt
scripts.sh		scripts.sh

License

ekalvi/wtrl-ttt-scraper

Folders and files

Latest commit

History

Repository files navigation

WTRL TTT Scraper

Project Overview

Features

Setup and Configuration

1. Install Dependencies

2. Configuration

3. Extract Required Tokens

File Structure

Usage

1. Run the Scraper

2. Access Results

3. Publish Results

4. Format Code

Notes

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages