Unstructured Meshes in MARS

Welcome to the MARS Unstructured Meshes Documentation. This section covers the unstructured mesh handling capabilities in the MARS (Multilevel Adaptive Refinement Solver) framework.

Overview

MARS provides comprehensive support for unstructured mesh processing, including:

ElementDomain: Core class for managing mesh elements and their properties
Mesh Reading: Support for various mesh formats with automatic partitioning
SFC Mapping: Space-filling curve mapping for load balancing
Adjacency Structures: Efficient neighbor finding and connectivity
Halo Management: Ghost cell handling for parallel computations (element halo + per-node halo)
Coordinate Caching: Optimized coordinate storage and access
Characteristic Sizes: Mesh quality metrics and sizing functions
Connectivity Management: Element-to-element relationships
GPU Acceleration: CUDA-based parallel processing
Multi-Rank Support: MPI-based distributed computing
AMR: GPU-native multi-rank adaptive mesh refinement with solution transfer

Key Components

Core Infrastructure

ElementDomain Overview - Main class for unstructured mesh management
Mesh Reading and Partitioning - Input handling and domain decomposition
SFC Mapping - Load balancing through space-filling curves

Advanced Features

Adjacency Structures - Neighbor relationships and connectivity
Halo Management - Ghost cell management for parallel processing
Node Halo Topology - Per-node halo via direct CUDA-aware MPI
Coordinate Caching - Efficient coordinate storage
Characteristic Sizes - Mesh quality and sizing
Connectivity Management - Element relationships

AMR Module - Refinement pipeline (mark → refine → rebuild → transfer)
Solution Transfer - Field interpolation across AMR levels (warm-start CG)

Performance & Parallelism

GPU Acceleration - CUDA implementation details
Multi-Rank Support - MPI parallel processing

Quick Start

#include <mars.hpp>

// Hex8 mesh, double precision, uint64_t SFC keys, GPU
using Domain = mars::ElementDomain<mars::HexTag, double, uint64_t, cstone::GpuTag>;

// Read + partition + build cstone domain
Domain domain(meshFile, rank, numRanks);

// Lazy-built data: triggers HaloData (ownership) + NodeHaloTopology
const auto& d_nodeOwnership = domain.getNodeOwnershipMap();   // size = nodeCount
const auto& d_conn          = domain.getElementToNodeConnectivity();  // local node IDs

// Cache decoded node coordinates
domain.cacheNodeCoordinates();
const auto& d_x = domain.getNodeX();
const auto& d_y = domain.getNodeY();
const auto& d_z = domain.getNodeZ();

// Distributed CG halo callback
auto haloExchange = [&domain, dofMap = d_node_to_dof.data()]
    (cstone::DeviceVector<double>& p) {
        domain.exchangeNodeHalo(p, dofMap);
    };

For an end-to-end distributed AMR + Poisson example, see examples/distributed/unstructured/mars_amr_cvfem_graph.cu and the AMR Module walkthrough.

Architecture

The unstructured mesh system in MARS is built on Cornerstone octree and uses a lazy composition pattern where GPU components are initialized on-demand to minimize VRAM usage and startup time. Key architectural decisions include:

GPU-Native Design: All data structures live in device memory (Cornerstone DeviceVector)
Template-based Design: Type-safe mesh element handling via AcceleratorTag
Lazy Initialization: GPU components (adjacency, halo, coordinates) built only when needed
SFC-Centric: Elements identified by space-filling curve keys, not integer indices
Thrust Algorithms: CSR building, sorting, and reductions use Thrust primitives
MPI Partitioning: Multi-rank support via Cornerstone domain decomposition

Contributing

When contributing to the unstructured mesh system:

Follow the existing template patterns for new mesh element types
Implement lazy initialization for new components
Add comprehensive tests for parallel functionality
Update this documentation for new features