2504.12534v2
Model: gemini-2.0-flash
# A functional limit theorem for a dynamical system with an observable maximised on a Cantor set
**Authors**: Raquel Couto, Ana Cristina Moreira Freitas, Jorge Milhazes Freitas, Mike Todd
> Raquel CoutoCentro de Matemática & Faculdade de Ciências da Universidade do PortoRua do
Campo Alegre 6874169-007 PortoPortugalup201106786@fc.up.pt
> Ana Cristina Moreira FreitasCentro de Matemática &
Faculdade de Economia da Universidade do PortoRua Dr. Roberto Frias4200-464 PortoPortugalamoreira@fep.up.pthttp://www.fep.up.pt/docentes/amoreira/
> Jorge Milhazes FreitasCentro de Matemática & Faculdade de Ciências da Universidade do PortoRua do
Campo Alegre 6874169-007 PortoPortugaljmfreita@fc.up.pthttp://www.fc.up.pt/pessoas/jmfreita/
> Mike ToddMathematical InstituteUniversity of St AndrewsNorth HaughSt AndrewsKY16 9SSScotlandm.todd@st-andrews.ac.ukhttps://mtoddm.github.io/
(Date: January 17, 2026)
Abstract.
We consider heavy-tailed observables maximised on a dynamically defined Cantor set and prove convergence of the associated point processes as well as functional limit theorems. The Cantor structure, and its connection to the dynamics, causes clustering of large observations: this is captured in the ‘decorations’ on our point processes and functional limits, an application of the theory developed in a paper by the latter three authors.
Key words and phrases: Functional limit theorems, point processes, Lévy processes with decorations, extremal processes, clustering, Cantor sets, hitting times
2020 Mathematics Subject Classification: 37A25, 37A50, 37B20, 60F17, 60G55, 60G70
All authors were partially financed by Portuguese public funds through FCT – Fundação para a Ciência e a Tecnologia, I.P., in the framework of the projects PTDC/MAT-PUR/4048/2021, 2022.07167.PTDC and UID/00144 - Centro de Matemática da Universidade do Porto. RC was supported by an FCT scholarship with Reference Number PD/BD/150456/2019.
1. Introduction
Understanding the statistical properties of extreme events in dynamical systems is crucial across various scientific disciplines, notably in climate dynamics where accurate predictions of rare but impactful phenomena are paramount. This paper extends the study of extreme value theory and sums of heavy tailed observables which are maximised on fractal sets, paying particular attention to the clustering profiles of extreme observations.
The motivations to consider fractal maximal sets stem, in particular, from meteorology where the appearance of critical regions with a complex multifractal structure is common, such as in [FMAY17], where the anomalies for the precipitation frequency data are consistent with an underlying fractal or, in [FAMR19], where it was observed that the enhancement of greenhouse gases carries the attractor towards some sort of lower dimensional fractal structure. As for the relevance of understanding clustering, we mention its crucial role for assessing compound risks, such in the case of the 2022 European drought addressed in [FPB23].
The study of rare events for observables with fractal maximal sets is quite recent in the dynamical systems framework ([MP16, FFRS20, FFS21]). In these papers, the main results prove the existence of distributional limits for the partial maxima of stochastic processes arising from uniformly expanding systems with these observables, which, of course, is related to the elapsed time the orbits take to enter small vicinities of the maximal fractal sets (see [FFT10, FFT11]).
Here, we generalise these results and prove the convergence of decorated point processes of rare events, which provide a more refined approach, aimed at better characterising how extreme events cluster and how their cumulative impact can be quantified. In particular, using the tools developed in [FFT25], we will also prove the existence of enriched functional limits for sums of heavy tailed observables with such fractal maximal sets. We note that the theory in [FFT25] is related to ideas in stochastic processes where richer spaces for the limiting process to converge in were developed, such as in [BPS18] (see [FFT25] for a fuller historical account).
We recall that the study sums of heavy tailed stochastic processes is closely intertwined with the extremal properties of the process because the behaviour of the sum is dominated by that of the extreme observations among the summands ([LWZ81]). In particular, in order to prove convergence to a stable law or to a corresponding Lévy process, the convergence of point processes is usually at the core of the approach (see for example [T10, JPZ20, FFT25, CKM24], in the dynamical setting, or earlier works [D83, R87, DH95] in the more classical statistical setting).
Dynamically generated heavy tailed stochastic processes can occur either because the observable is unbounded with a polynomial type of spike on the maximal set, or because the dynamics has some some source of non-hyperbolicity, like an indifferent periodic point or a cusp, where the dynamics is severely decelerated so that, after inducing, there is a piling of observations that aggregate making the induced observable heavy tailed. To our knowledge, the sources of heavy tailed behaviour in the literature have been taken either finite or at most countable, which is one of the reasons for the interest in considering fractal sets.
In order to be able to give concrete formulae and describe the point processes, particularly their decorations, we will consider uniformly hyperbolic systems (Markov full branched linear maps) and observables maximised on dynamically generated Cantor sets associated to these maps. We remark that although we work with simple models, they capture the essence of the limiting behaviour and we believe that the spirit of our findings should prevail in more general settings, where closed formulas would be impossible to obtain.
One of the highlights of this paper lies in the construction of an observable function maximised on Cantor sets that still has a continuous distribution function. This is achieved by considering a carefully designed symbolic distance to the maximal set, which is necessary to apply the theory developed in [FFT25] to obtain enriched functional limit theorems and the convergence of decorated point process of rare events. This contrasts with the Cantor ladder type of observables considered in [MP16, FFRS20, FFS21], which give rise to discrete distributions.
The formal statements of the main results involve introducing objects and notations that would overload this introduction and for this reason we defer it to Section 4.1. Nonetheless, in the following subsections we will give an informal version of these results.
1.1. Enriched functional limit theorems
We will consider a finite full-branched interval map and a corresponding dynamically defined Cantor set. Then, defining an observable maximised on this Cantor set, we obtain a sequence of stationary random variables $X_{0},X_{1},...$ with an $\alpha$ -heavy tail and prove a suitable functional limit theorem, namely find limits of
$$
S_{n}(t)=\sum_{i=0}^{\lfloor nt\rfloor-1}\dfrac{1}{n^{\frac{1}{\alpha}}}X_{i},\ t\in[0,1].
$$
We will see that the limit, which will live in the space $F^{\prime}$ of decorated cà dlà g functions, introduced in [FFT25], can be seen as an $\alpha$ stable Lévy process, denoted by $V$ , with decorations at each jump, which capture the excursion performed by the finite time process $S_{n}$ , when a cluster of abnormally high observations occurs (note that these correspond to visits of the orbit to close vicinities of the maximal (Cantor) set). In this case, the compatibility between the systems considered and the structure of the maximal (Cantor) set imply that the decoration corresponding to the jumping time, $s$ , can be described by a cà dlà g function of the form:
$$
e_{V}^{s}(t)=V(s^{-})+U^{-\frac{1}{\alpha}}\displaystyle\sum_{0\leq j\leq\lfloor\tan(\pi t/2)\rfloor}(1-\theta)^{\frac{j}{\alpha}},\ t\in[0,1]
$$
where $0<\theta<1$ is the extremal index, which gives the reciprocal of the average number of abnormal observations within a cluster and, in this case, is related to the decay rate of the measure of the iterative approximations of the maximal (Cantor) set, during its construction. In the case of the ternary Cantor set, $\theta=2/3$ .
1.2. Point processes with decorations
The convergence of enriched functional limit theorems is derived from the convergence of point processes with decorations. Point processes are a very powerful tool to study rare events and, in particular, sums of heavy tailed observations. An illustration of their convenience is the fact that the Lévy processes obtained here can be written as an integral with respect to a Poisson random measure that underlies the limiting point process.
The construction of the point processes is based on splitting $n$ observations into $k_{n}$ blocks of size $r_{n}$ , where both $(k_{n})_{n},(r_{n})_{n}$ diverge. Then we consider a point process with a time component keeping record of each block and a decoration consisting of a vector of increasing length with all block observations conveniently normalised:
$$
N_{n}=\sum_{i=1}^{\infty}\delta_{\left(i/k_{n},\mathbb{X}_{n,i}\right)},
$$
where $\mathbb{X}_{n,i}=\left(n(X_{(i-1)r_{n}})^{-\alpha},n(X_{(i-1)r_{n}+1})^{-\alpha},...,n(X_{r_{n}-1})^{-\alpha}\right)$ .
The main result establishes that these point processes after embedded in a proper space converge to a limiting process that can be written as a bidimensional Poisson process with Lebesgue intensity measure decorated with a bi-infinite sequence, which in this case is equal to $(1-\theta)^{j}$ for all $j≥ 0$ and $∞$ for all $j≤-1$ .
1.3. Structure of the paper
In Section 2 we outline our class of interval maps and the observables we will use, which are maximised on a dynamically defined Cantor set. In Section 3 we explain the theory of functional limit theorems for heavy tailed systems with clustering. This involved defining sequence spaces $l_{∞}^{+},\tilde{l}_{∞}^{+}$ aligned to a ‘block structure’, defining the anchored tail process, defining the dependence conditions $\D_{q_{n}}^{*}$ and $\D_{q_{n}}^{{}^{\prime}*}$ , defining Rare Events Point Processes and explaining what the enriched functional limit theorem in [FFT25] is. In Section 4 we apply this theory to our class of examples. This long section is broken down into the statement of the main theorems, defining our constants and normalisation sequences, the extremal index, checking the dependence requirements, showing the anchored tail process is well defined and finally dealing with the small jumps condition in the case $1<\alpha<2$ , using ideas from [CNT25].
2. A Fractal Maximal Set
We divide $[0,1]$ into interleaved domains $I_{i}$ and $J_{i}$ , on each of which we define linear bijections to $[0,1]$ : the former intervals will define our dynamical attractor.
Let $0≤ a_{0}<a_{1}<·s<a_{2k_{I}-1}≤ 1$ and $I_{i}=[a_{2(i-1)},a_{2i-1}]$ for $i=1,...,k_{I}$ . Next define $J_{1},...,J_{k_{J}}$ be the complementary (open) intervals in order where $k_{J}∈\{k_{I}-1,k_{I},k_{I}+1\}$ . So $k_{J}=k_{I}-1$ if $a_{0}=0$ and $a_{2k_{I}-1}=1$ ; $k_{J}=k_{I}+1$ if $a_{0}>0$ and $a_{2k_{I}-1}<1$ ; and $k_{J}=k_{I}$ in the other two cases. Let $\hat{k}:=k_{I}+k_{J}$ . Writing the $I_{i}$ and $J_{i}$ s together in order, given $1≤ k≤\hat{k}$ it will sometimes be convenient to let $(I,J)_{k}$ be the $k$ -th interval in this list, so either some $I_{i}$ or some $J_{i}$ . Moreover, let $\mathcal{I}_{I}$ be the indices $k$ where $(I,J)_{k}=I_{i}$ for some $i$ and $\mathcal{I}_{J}$ be the indices $k$ where $(I,J)_{k}=J_{i}$ for some $i$ (note $\{1,...,\hat{k}\}=\mathcal{I}_{I}\cup\mathcal{I}_{J}$ ).
Let $g_{i}:I_{i}→[0,1]$ and $h_{j}:J_{j}→(0,1)$ be linear bijections (making the obvious change if $J_{1}$ contains 0 and/or $J_{k_{J}}$ contains 1). We also assume that $|I_{i}|,|J_{i}|≤ 1/2$ so that the derivatives of $g_{i},h_{i}$ are greater than or equal to two. Let $m_{i}$ denote the absolute value of the derivative of $g_{i}$ and let $G:[0,1]→[0,1]$ be such that the restriction of $G$ to $I_{i}$ is $g_{i}$ and to $J_{i}$ is $h_{j}$ .
The collection of the intervals $I_{i}$ and $J_{j}$ , with $i=1,...,k_{I}$ and $j=1,...,k_{J}$ forms a measurable partition of $[0,1]$ that we denote by $\mathcal{P}$ . Then we define by recursion the sequence of partitions $\mathcal{P}^{n}=\bigvee_{j=0}^{n-1}G^{-j}(\mathcal{P})$ , given by the $n$ -th join of $\mathcal{P}$ . The elements of $\mathcal{P}^{n}$ are called $n$ -cylinders and $n$ will be referred to as the depth of the cylinder $\omega∈\mathcal{P}^{n}$ . Note that, since $G$ is linearly expanding and full-branched, if $\omega_{n}(x)$ denotes the element of $\mathcal{P}^{n}$ that contains $x$ then $G^{n}(\omega_{n}(x))=[0,1]$ and $\cap_{n∈\mathbb{N}}\omega_{n}(x)=\{x\}$ .
Let $\Lambda_{0}=[0,1]$ and, for all $n∈\mathbb{N}$ , define
$$
\Lambda_{n}:=\bigcup_{i=1}^{k_{I}}g_{i}^{-1}(\Lambda_{n-1})
$$
and
$$
\Lambda:=\displaystyle\bigcap_{n\in\mathbb{N}}\Lambda_{n}.
$$
So $\Lambda$ can be thought of as a dynamically defined Cantor set.
A natural example of such a system, is for $k=2$ , $I_{1}=\left[0,\dfrac{1}{3}\right]$ , $J_{1}=\left(\dfrac{1}{3},\dfrac{2}{3}\right)$ , $I_{2}=\left[\dfrac{2}{3},1\right]$ and $g_{1}(x)=3x$ , $h_{1}(x)=3x-1$ and $g_{2}(x)=3x-2$ . Then $\Lambda$ here is the ternary/middle- $\frac{1}{3}$ Cantor set.
It is always the case that $([0,1],\mathcal{B}_{[0,1]},\mathfrak{m},G)$ , where $\mathfrak{m}$ denotes the Lebesgue measure on $[0,1]$ , is a probability preserving dynamical system. We will define an observable on this system so that the theory from [FFT25] applies, and whose maximal set is precisely $\Lambda$ . For that, we define a distance on $[0,1]$ as follows.
Let $\Sigma=\Sigma_{\hat{k}}=\{1,...,\hat{k}\}^{\mathbb{N}}$ and let $\sigma:\Sigma→\Sigma$ be the left-shift map. For $x∈[0,1]$ , let $\underline{x}=(x_{1},x_{2}...)∈\Sigma$ be defined by $x_{i}=k∈\{1,...,\hat{k}\}$ if $G^{i-1}(x)∈(I,J)_{k}$ . Observe that $x∈\Lambda\iff\underline{x}=(x_{1},x_{2},...)$ where $x_{i}∈\mathcal{I}_{I}$ for all $i∈\mathbb{N}$ .
**Definition 2.1**
*Let $\underline{x}=(x_{1},x_{2},...)∈\Sigma$ . Let $\mathcal{I}_{x}=\{i∈\mathbb{N}:x_{i}∈\mathcal{I}_{J}\}$ . Let
$$
\psi(x)=\sum_{i\in\mathcal{I}_{x}}2^{-i}.
$$*
$\psi$ provides us with a notion of distance to $\Lambda$ such that the earlier a point exits the dynamical construction of $\Lambda$ (in the sense of (2.1)) the bigger the corresponding distance to $\Lambda$ . We will also view $\psi$ is a $[0,1]$ -valued random variable defined on $([0,1],\mathcal{B}[0,1],\mathfrak{m})$ .
<details>
<summary>x1.png Details</summary>

### Visual Description
## Sierpinski Triangle Plot
### Overview
The image depicts a Sierpinski triangle, a fractal pattern, plotted on a coordinate plane. The triangle is formed by recursively dividing an equilateral triangle into smaller equilateral triangles. The plot shows the fractal structure within a unit square.
### Components/Axes
* **X-axis:** Horizontal axis ranging from 0.0 to 1.0, with tick marks at intervals of 0.2.
* **Y-axis:** Vertical axis ranging from 0.0 to 1.0, with tick marks at intervals of 0.2.
* **Fractal Structure:** The Sierpinski triangle is composed of smaller, self-similar triangles arranged in a hierarchical pattern. The overall structure is a large triangle, with an inverted triangle removed from its center, leaving three smaller triangles. Each of these smaller triangles is further subdivided in the same manner.
### Detailed Analysis
The Sierpinski triangle is plotted within the unit square defined by the x and y axes. The vertices of the largest triangle are approximately at the following coordinates:
* Bottom-left: (0.0, 0.0)
* Top-center: (0.5, ~0.87)
* Bottom-right: (1.0, 0.0)
The first level of subdivision creates three smaller triangles, with vertices approximately at:
* Left Triangle: Centered around (0.25, 0.43)
* Top Triangle: Centered around (0.5, 0.87)
* Right Triangle: Centered around (0.75, 0.43)
Each of these smaller triangles is further subdivided, creating a more intricate fractal pattern.
### Key Observations
* **Self-Similarity:** The Sierpinski triangle exhibits self-similarity, meaning that the same pattern is repeated at different scales.
* **Fractal Dimension:** The Sierpinski triangle has a fractal dimension, which is a non-integer value that describes its space-filling properties.
* **Recursive Construction:** The triangle is constructed through a recursive process of subdivision and removal.
### Interpretation
The plot visually represents the mathematical concept of a Sierpinski triangle, demonstrating its fractal nature and recursive construction. The arrangement of triangles within the unit square highlights the self-similar properties of the fractal. The plot serves as a visual aid for understanding the geometric properties of this mathematical object.
</details>
Figure 1. Graph of the function $\psi$ for the ternary Cantor set.
**Lemma 2.2**
*1. $\psi(x)=0\iff x∈\Lambda$ ;
1. $\psi(x)<2^{-n}\iff x∈\Lambda_{n}\setminus\{p_{1},...,p_{k_{I}^{n}}\}$ , where
$$
\{p_{1},\dots,p_{k_{I}^{n}}\}=\left\{x\in[0,1]:x_{i}\in\mathcal{I}_{I}\text{ for all }i=1,\ldots n,\text{ and }x_{i}\in\mathcal{I}_{J}\text{ for }i\geq n+1\right\};
$$
1. for $F:[0,1]→[0,1]$ the distribution function of $\psi$ ,
1. $F$ is strictly increasing on $[0,1]$ ;
1. $F(2^{-n})=|\Lambda_{n}|=\lambda^{n}$ for $\lambda:=\left|\cup_{i=1}^{\mathcal{I}_{I}}I_{i}\right|=\sum_{i=1}^{\mathcal{I}_{I}}\frac{1}{m_{i}}$ ;
1. we can view $F\circ\psi$ as a uniformly distributed random variable.*
* Proof*
The proofs of (a) and (b) are immediate. For (c) the strict increasing property follows from the definition of $\psi$ and (ii) follows from (b). (iii) is a standard fact in probability, which results from observing that $\mathbb{P}\left(F\circ\psi≤ z\right)=\mathbb{P}\left(\psi≤ F^{-1}(z)\right)=F(F^{-1}(z))=z.$ ∎
<details>
<summary>x2.png Details</summary>

### Visual Description
## Line Chart: Cantor Function
### Overview
The image is a line chart depicting the Cantor function, also known as the Devil's Staircase. The chart shows a non-decreasing function that increases from 0 to 1, but remains constant on intervals whose total length is 1. The x-axis ranges from 0 to 1, and the y-axis also ranges from 0 to 1. The function is represented by a blue line.
### Components/Axes
* **X-axis:** Ranges from 0 to 1, with markers at 0.2, 0.4, 0.6, 0.8, and 1.0.
* **Y-axis:** Ranges from 0 to 1, with markers at 0.2, 0.4, 0.6, 0.8, and 1.0.
* **Data Series:** A single data series represented by a blue line.
### Detailed Analysis
The blue line representing the Cantor function starts at (0, 0) and ends at (1, 1). The function increases in steps, with horizontal segments in between.
* From x = 0 to approximately x = 0.33, the function increases from 0 to approximately 0.33.
* From x = 0.33 to approximately x = 0.66, the function remains constant at approximately 0.33.
* From x = 0.66 to approximately x = 1, the function increases from approximately 0.33 to 1.
More specifically, the function exhibits a self-similar pattern. It increases from 0 to 1/3, remains constant at 1/3 for a certain interval, then increases from 1/3 to 2/3, remains constant at 2/3 for a certain interval, and finally increases from 2/3 to 1.
### Key Observations
* The function is continuous everywhere but not absolutely continuous.
* The function is constant almost everywhere.
* The function is self-similar.
### Interpretation
The Cantor function is a classic example of a function that is continuous but not absolutely continuous. It demonstrates that a function can increase from 0 to 1 without having a derivative that is positive on a set of positive measure. The function is also an example of a fractal, as it exhibits self-similarity at different scales. The "flat" sections of the curve represent intervals where the function's derivative is zero, contributing to its peculiar properties.
</details>
Figure 2. Graph of the the distribution function of $\psi$ for the ternary Cantor set.
We may now define our observable.
**Definition 2.3**
*For $\alpha∈(0,2)$ , let
$$
\varphi_{\alpha}(x):=(F\circ\psi(x))^{-\frac{1}{\alpha}}.
$$
where $F:[0,1]→[0,1]$ is the distribution function of $\psi$ . Moreover, define
$$
X_{n}:=\varphi_{\alpha}\circ G^{n}
$$
for all $n∈\mathbb{N}_{0}$ .*
Note that since $G$ is $\mathfrak{m}$ -invariant, $X_{n}$ is a stationary process.
**Lemma 2.4**
*1. $\phi_{\alpha}(x)=∞$ iff $x∈\Lambda$ ;
1. if $x∈\Lambda_{n}\setminus\{p_{1},...,p_{k_{I}^{n}}\}$ and $y∈\Lambda_{n+1}$ then $\phi_{\alpha}(x)<\phi_{\alpha}(y)$ ;
1. $\{X_{n}>u\}=\{\psi\circ G^{n}<F^{-1}(u^{-\alpha})\}$ ;
1. if $u^{\prime}>u$ then $\mathfrak{m}\{X_{n}>u^{\prime}\}<\mathfrak{m}\{X_{n}>u\}$ .*
* Proof*
Parts (a), (b) and (c) are immediate from the definition. Part (d) then follows from part (b) along with Lemma 2.2 (c). ∎
3. General theory of functional limit theorems
In this section we present a simplified version of the theory in [FFT25] which, to fit with our application outlined above, deals with real non-negative random variables $X_{0},X_{1},...$ distributed according to $\mathbb{P}$ , defined via a dynamical system $f:[0,1]→[0,1]$ with invariant probability measure $\mu$ and an observable $\phi:[0,1]→[0,∞]$ with $X_{n}=\phi\circ f^{n-1}$ (hence $\mathbb{P}(X>u)=\mu(\phi>u)$ , but we will also write this as $\mu(X>u)$ ).
We define sequences $(k_{n})_{n∈\mathbb{N}}$ , $(r_{n})_{n∈\mathbb{N}}$ , $(t_{n})_{n∈\mathbb{N}}$ , $(q_{n})_{n}$ , which we assume to be such that
$$
k_{n},r_{n},t_{n}\xrightarrow[n\to\infty]{}\infty\quad\text{ where }k_{n}\cdot t_{n}=o(n),k_{n}\cdot r_{n}\sim n\text{ and }q_{n}=o(r_{n}).
$$
As in [FFT25, (3.2), (3.3)], these relate to a blocking argument with $k_{n}$ blocks of size $\lfloor n/k_{n}\rfloor$ , where $t_{n}$ is a time gap inserted to achieve asymptotic independence of the blocks, while $q_{n}$ is the clustering run length, i.e., all abnormal observations occurring within a time difference of at most $q_{n}$ units between each other belong to the same cluster. In most dynamical applications $q_{n}=q$ for all $n∈\mathbb{N}$ , such as when the observable is maximised at a single periodic point of period $p$ , in which case $q_{n}=p$ (see [FFT12]). In fact, we will see that, in this paper, for our applications we only require $q_{n}=1$ . We assume for each $\tau≥ 0$ there is $(u_{n}(\tau))_{n}$ such that
$$
\lim_{n\to\infty}n\mathbb{P}(X_{0}>u_{n}(\tau))=\tau,\lim_{\tau_{1}\to 0,\tau_{2}\to\infty}\mathbb{P}\left(u_{n}(\tau_{1})<X_{0}<u_{n}(\tau_{2})\right)=1
$$
and that $u_{n}:[0,∞]→[0,∞]$ is bijective.
We say that $X_{0}$ has $\alpha$ -regularly varying tails if there is a sequence $(\mathfrak{a}_{n})_{n}$ such that for any $y>0$ ,
$$
\lim_{n\to\infty}n\mathbb{P}(X_{0}>y\mathfrak{a}_{n})=y^{-\alpha}.
$$
Define
$$
U_{n}(\tau):=\{X_{0}>u_{n}(\tau)\}.
$$
3.1. Sequence spaces and the anchored tail process
For $i<j∈\{0,...,n\}$ we will use the notation
$$
\mathbb{X}_{n}^{i,j}:=\left(u_{n}^{-1}(X_{i}),\ldots,u_{n}^{-1}(X_{j-1})\right),\quad\mathbb{X}_{n,i}:=\mathbb{X}_{n}^{(i-1)r_{n},ir_{n}}.
$$
Define
| | $\displaystyle l_{∞}^{+}$ | $\displaystyle:=\left\{\mathbf{x}=(x_{j})_{j}∈(0,∞]^{\mathbb{Z}}\colon\;\lim_{|j|→∞}x_{j}=∞\right\}.$ | |
| --- | --- | --- | --- |
Note that we can embed $\cup_{n∈\mathbb{N}}(0,∞]^{n}$ into $l_{∞}^{+}$ by adding a sequence of $∞$ before and after the $n$ entrances of any element of $(0,∞]^{n}$ For example, $\mathbb{X}_{n}^{i,j}$ can be seen as an element of $l_{∞}^{+}$ by identifying it with
$$
\left(\ldots,\infty,\infty,u_{n}^{-1}(X_{i}),\ldots,{u_{n}^{-1}(X_{j-1})},\infty,\infty,\ldots\right).
$$
We can define the left shift $\sigma$ on this space, which moves every element one place to the left. We define the quotient space $\tilde{l}_{∞}^{+}=l_{∞}^{+}/{\sim}$ where $\sim$ is the equivalence relation defined by $\mathbf{x}\sim\mathbf{y}$ if and only if there exists $k∈\mathbb{Z}$ such that $\sigma^{k}(\mathbf{x})=\mathbf{y}$ . Also let $\tilde{\pi}$ denote the natural projection from $l_{∞}^{+}$ to $\tilde{l}_{∞}^{+}$ , which assigns to each element $\mathbf{x}$ of $l_{∞}^{+}$ the corresponding equivalence class $\tilde{\pi}(\mathbf{x})=\tilde{\mathbf{x}}$ in $\tilde{l}_{∞}^{+}$ . Given any vector $v$ of $(0,∞]^{m}$ , for some $m∈\mathbb{N}$ , we write $\tilde{\pi}(v)$ for the projection of the natural embedding of $v$ into $l_{∞}^{+}$ to the quotient space $\tilde{l}_{∞}^{+}$ . Namely,
$$
\tilde{\pi}(\mathbb{X}_{n}^{i,j})=\tilde{\pi}\left(\left(\ldots,\infty,\infty,u_{n}^{-1}(X_{i}),\ldots,{u_{n}^{-1}(X_{j-1})},\infty,\infty,\ldots\right)\right).
$$
We let $\tilde{∞}∈\tilde{l}_{∞}^{+}$ be the sequence where all terms are $∞$ .
We will assume in the theory, and prove in practice, the existence of a process $(Y_{j})_{j∈\mathbb{Z}}∈ l_{∞}^{+}$ satisfying the following assumptions:
1. $\mathcal{L}\left(\frac{1}{\tau}\mathbb{X}_{n}^{r_{n}+s,r_{n}+t}\;\middle|\;X_{r_{n}}>u_{n}(\tau)\right)\xrightarrow[n→∞]{}\mathcal{L}\left((Y_{j})_{j=s,...,t}\right),$ for all $s<t∈\mathbb{Z}$ and all $\tau>0$ ;
1. the process $(\Theta_{j})_{j∈\mathbb{Z}}$ given by $\Theta_{j}=\frac{Y_{j}}{Y_{0}}$ is independent of $Y_{0}$ ;
1. $\lim_{|j|→∞}Y_{j}=∞$ a.s.;
1. $\mathbb{P}\left(∈f_{j≤-1}Y_{j}≥ 1\right)>0$ .
Here $(r_{n})_{n}$ is assumed to satisfy (3.1): in our applications it is the sequence appearing in $\D_{q_{n}}^{*}$ and $\D^{{}^{\prime}*}_{q_{n}}$ below. We call this the transformed tail process.
We can then define:
**Definition 3.1**
*Assuming the existence of a sequence $(Y_{j})_{j∈\mathbb{Z}}$ satisfying conditions (1)–(4), we define the transformed anchored tail process $(Z_{j})_{j∈\mathbb{Z}}$ as a sequence of random numbers satisfying
$$
\mathcal{L}\left((Z_{j})_{j\in\mathbb{Z}}\right)=\mathcal{L}\left((Y_{j})_{j\in\mathbb{Z}}\;\middle|\;\inf_{j\leq-1}Y_{j}\geq 1\right).
$$*
The anchor $Z_{0}$ was chosen so that it marks the beginning of a new cluster. We consider a polar decomposition of the transformed anchored tail process by defining the random variable $L_{Z}$ and the process $(Q_{j})_{j∈\mathbb{Z}}$ by
$$
L_{Z}:=\inf_{j\in\mathbb{Z}}Z_{j}\qquad Q_{j}:=\frac{Z_{j}}{L_{Z}}.
$$
3.2. The $\D_{q_{n}}^{*}$ and $\D_{q_{n}}^{{}^{\prime}*}$ conditions and the extremal index
For $B⊂[0,1]$ , $J$ a subset of $[0,∞)$ and $q∈\mathbb{N}$ define
$$
\mathscr{W}_{J}(B):=\bigcap_{i\in J\cap\mathbb{N}_{0}}f^{-i}(B^{c}),\ \mathscr{W}_{J}^{c}(B):=(\mathscr{W}_{J}(B))^{c}=\bigcup_{i\in J\cap\mathbb{N}_{0}}f^{-i}(B),
$$
and
$$
B^{(q)}:=B\cap\bigcap_{i=1}^{q}f^{-i}(B^{c}),\qquad B^{(0)}:=B. \tag{0}
$$
Now let $E$ be an element of the ring generated by the half open intervals of $[0,1]$ so that $E=\cup_{\ell=1}^{N}J_{\ell}=\cup_{\ell=1}^{N}[a_{\ell},b_{\ell})$ and for each $\ell=1,...,N$ , let $A_{\ell}$ be an element of the ring generated by the half open intervals of $[0,∞)$ so that $A_{\ell}=\cup_{s=1}^{n_{\ell}}[{\tau_{s}^{\prime\prime}},\tau^{\prime}_{s})$ for some $n_{\ell}∈\mathbb{N}$ and some $\tau_{1}^{\prime\prime}<\tau_{1}^{\prime}<\tau_{2}^{\prime\prime}<·s<\tau_{n_{\ell}}^{\prime\prime}<\tau_{n_{\ell}}^{\prime}$ .
Given $E$ and $A_{\ell}$ as above, define
$$
J_{n,\ell}:=k_{n}r_{n}J_{\ell}:=[k_{n}r_{n}a_{\ell},k_{n}r_{n}b_{\ell}),\quad A_{n,\ell}:=\bigcup_{s=1}^{n_{\ell}}\left\{u_{n}^{-1}(X_{0})\in[\tau^{\prime\prime}_{s},\tau^{\prime}_{s})\right\}=\bigcup_{s=1}^{n_{\ell}}U_{n}(\tau^{\prime}_{s})\setminus U_{n}(\tau_{s}^{\prime\prime})
$$
Condition $\D_{q_{n}}^{*}$ . We say that $\D_{q_{n}}^{*}$ holds for the sequence ${X}_{0},{X}_{1},...$ if there exist sequences $(k_{n})_{n∈\mathbb{N}},(r_{n})_{n∈\mathbb{N}},(t_{n})_{n∈\mathbb{N}}$ and $(q_{n})_{n∈\mathbb{N}}$ as defined in (3.1), such that for every $N,t,n∈\mathbb{N}$ and every $J_{\ell},A_{\ell}$ as above, for $\ell=1,...,N$ , we have
$$
\left\lvert\mathbb{P}\left(A_{n,\ell}^{(q_{n})}\cap\bigcap_{i=\ell}^{N}\mathscr{W}_{J_{n,i}}\left(A_{n,i}^{(q_{n})}\right)\right)-\mathbb{P}\left(A_{n,\ell}^{(q_{n})}\right)\mathbb{P}\left(\bigcap_{i=\ell}^{N}\mathscr{W}_{J_{n,i}}\left(A_{n,i}^{(q_{n})}\right)\right)\right\rvert\leq\gamma(n,t_{n})
$$
where $\min\{J_{n,\ell}\cap\mathbb{N}_{0}\}≥ t_{n}+q_{n}$ and $\lim_{n→∞}n\gamma(n,t_{n})=0$ .
Condition $\D_{q_{n}}^{{}^{\prime}*}$ . We say that $\D_{q_{n}}^{{}^{\prime}*}$ holds for the sequence ${X}_{0},{X}_{1},...$ if there exist sequences $(k_{n})_{n∈\mathbb{N}},(r_{n})_{n∈\mathbb{N}},(t_{n})_{n∈\mathbb{N}}$ and $(q_{n})_{n∈\mathbb{N}}$ as defined in (3.1), such that for every $A_{1}=\cup_{s=1}^{N_{1}}[{\tau^{\prime\prime}}_{s},\tau^{\prime}_{s})$ ,
$$
\lim_{n\to\infty}n\mathbb{P}\left(A_{n,1}^{(q_{n})}\cap\mathscr{W}^{c}_{[q_{n}+1,r_{n})}\left(A_{n,1}\right)\right)=0.
$$
**Remark 3.2**
*Note that if $(q_{n})_{n∈\mathbb{N}}$ and $(\tilde{q}_{n})_{n∈\mathbb{N}}$ are two sequences satisfying (3.1) and $q_{n}≤\tilde{q}_{n}$ , for all $n∈\mathbb{N}$ . Then $\D^{\prime}_{q_{n}}$ implies $\D^{\prime}_{\tilde{q}_{n}}$ . As observed in [AFF20, Remark 2.11], one should try first to find the smallest constant sequence $q_{n}=q$ that makes $\D_{q_{n}}^{{}^{\prime}*}$ hold, establishing, in this way, the run length. We will see that here we can actually take $q_{n}=1$ for all $n∈\mathbb{N}$ .*
If, for all $\tau>0$ , the following limit exists:
$$
\theta=\lim_{n\to\infty}\frac{\mathbb{P}(U_{n}^{(q_{n})}(\tau))}{\mathbb{P}(U_{n}(\tau))},
$$
then we call $\theta$ the extremal index.
3.3. Complete convergence of the Rare Events Point Processes
As mentioned above, at the core of all convergence results stated and mentioned in this paper is the convergence of the so-called Rare Events Point Processes (REPP), which are spatio-temporal clustering processes that keep record of the cluster profiles and their time occurrences:
$$
N_{n}=\sum_{i=1}^{\infty}\delta_{\left(i/k_{n},\tilde{\pi}(\mathbb{X}_{n,i})\right)}.
$$
The complete convergence of these REPP follows from [FFT25, Theorem 3.18], which assuming that the transformed anchored tail process exists and is well defined (see Definition 3.1) and that conditions $\D_{q_{n}}^{*}$ and $\D_{q_{n}}^{*^{\prime}}$ are satisfied, states that $N_{n}$ given in (3.7) converges weakly in the space of boundedly finite point measures on $\mathbb{R}_{0}^{+}×\tilde{l}_{∞}^{+}\setminus\{\tilde{∞}\}$ with weak # topology (see [FFT25, Appendix C] for details) to the Poisson process with decorations, which can be written as:
$$
N=\sum_{i=1}^{\infty}\delta_{(T_{i},U_{i}\mathbf{\tilde{Q}}_{i})},
$$
where $T_{i}$ and $U_{i}$ are independently (of $i$ and of each other) distributed according to $Leb$ and $\theta Leb$ respectively, for $\theta$ as in (3.6), and $(\mathbf{\tilde{Q}}_{i})_{i∈\mathbb{N}}$ is an i.i.d sequence such that each $\mathbf{\tilde{Q}}_{i}∈\tilde{l}_{∞}^{+}\setminus\{\tilde{∞}\}$ has the same distribution as $\tilde{\pi}((Q_{j})_{j∈\mathbb{Z}})$ , where the bi-infinite sequence $(Q_{j})_{j}$ is as in (3.4).
3.4. Enriched functional limit theorem
For processes $X_{0},X_{1},...$ with $\alpha$ -regularly varying tails, for which we can prove the convergence of the REPP defined above, applying [FFT25, Theorem 2.5], we obtain an enriched functional limit theorem (FLT) for the normalised sums:
$$
S_{n}(t)=\sum_{i=0}^{\lfloor nt\rfloor-1}\dfrac{1}{\mathfrak{a}_{n}}X_{i}-tc_{n},\ t\in[0,1],
$$
where the sequence $(c_{n})_{n∈\mathbb{N}}$ is such that $c_{n}=0$ if $0<\alpha<1$ and
$$
c_{n}=\frac{n}{\mathfrak{a}_{n}}\mathbb{E}\left(X_{0}{\mathbbm{1}}_{|X_{0}|\leq\mathfrak{a}_{n}}\right),\qquad\text{for }1<\alpha<2.
$$
In order to guarantee the existence of a limit for the cà dà g function $S_{n}(t)$ , which sometimes is not possible with any of the Skorohod metrics in $D([0,1])$ , the space of cà dà g functions on $[0,1]$ , and to keep the information regarding the excursions performed by finite time process during a cluster of abnormal observations that gets collapsed on the same discontinuity of the limit process, in [FFT25], the authors introduced the enriched space $F^{\prime}([0,1])$ as the space of triplets $(v,S^{v},\{e_{v}^{s}\}_{s∈ S^{v}})$ , where $v$ is the main cà dà g function, $S^{v}$ a set containing all its discontinuities and $e_{v}^{s}$ a decorating excursion associated to each element of $S^{v}$ . This excursion is an element of $\tilde{D}([0,1])$ , the equivalence space of cà dà g functions on $[0,1]$ , where two elements are identified if there is a strictly continuous time reparametrisation that sends one into another. We refer to [FFT25, Section 2.3.1] for the details about this space, the corresponding metric and notions of convergence there.
From [FFT25, Theorem 2.5], in the case $0<\alpha<1$ , we obtain that the cà dà g function $S_{n}(t)$ embedded in the space $F^{\prime}$ converges to $(V,\text{disc}(V),(e_{V}^{s})_{s∈\text{disc}(V)})$ , where $V$ is an $\alpha$ -stable Lévy process on $[0,1]$ , which can be written as:
$$
V(t)=\sum_{T_{i}\leq t}\sum_{j\in\mathbb{Z}}U_{i}^{-\frac{1}{\alpha}}\mathcal{Q}_{i,j}
$$
and the excursions are given by
$$
e_{V}^{T_{i}}(t)=V(T_{i}^{-})+U_{i}^{-\frac{1}{\alpha}}\displaystyle\sum_{j\leq\lfloor\tan(\pi(t-\frac{1}{2}))\rfloor}\mathcal{Q}_{i,j},\ t\in[0,1]
$$
where $\mathcal{Q}_{i,j}=\mathbf{\tilde{Q}}_{i,j}^{-\frac{1}{\alpha}}$ (zero if $\mathbf{\tilde{Q}}_{i,j}=∞$ ), with $\mathbf{\tilde{Q}}_{i}=\left(\mathbf{\tilde{Q}}_{i,j}\right)_{j}$ , $T_{i}$ and $U_{i}$ as in $N$ in (3.8).
In the case $1<\alpha<2$ , we need the next two extra conditions to hold. Namely, for all $\delta>0$
$$
\lim_{\varepsilon\to 0}\limsup_{n\to\infty}\mathbb{P}\left(\max_{1\leq k\leq n}\left\|\sum_{j=1}^{k}\left(X_{j}{\mathbbm{1}}_{|X_{j}|\leq\varepsilon a_{n}}\right)-\mathbb{E}\left(X_{j}{\mathbbm{1}}_{|X_{j}|\leq\varepsilon a_{n}}\right)\right\|\geq\delta a_{n}\right)=0
$$
and, moreover,
$$
\mathbb{E}\left(\left(\sum_{j\in\mathbb{Z}}\|\mathcal{Q}_{j})\|\right)^{\alpha}\right)<\infty.
$$
Then, under these two assumptions as well, $S_{n}(t)$ converges to $(V,\text{disc}(V),(e_{V}^{s})_{s∈\text{disc}(V)})$ in $F^{\prime}$ , where $V$ has the more complicated expression:
$$
\displaystyle V(t) \displaystyle=\lim_{\varepsilon\to 0}\Bigg(\sum_{T_{i}\leq t}\sum_{j\in\mathbb{Z}}U_{i}^{-\frac{1}{\alpha}}\mathcal{Q}_{i,j}{\mathbbm{1}}_{\{|U_{i}^{-\frac{1}{\alpha}}\mathcal{Q}_{i,j}|>\varepsilon\}} \displaystyle\hskip 128.0374pt-t\theta\int_{0}^{+\infty}\mathbb{E}\Bigg(y\sum_{j\in\mathbb{Z}}\mathcal{Q}_{j}{\mathbbm{1}}_{\{\varepsilon<y|\mathcal{Q}_{j}|\leq 1\}}\Bigg)d(-y^{-\alpha})\Bigg),
$$
while $e_{V}^{T_{i}}(t)$ is again as in (3.11).
4. Application of the theory to our examples
We apply the theory described above to the setting described in Section 2. In particular, we have now all the tools and notation introduced in order to state our main results.
4.1. Statement of main results
We begin with the convergence of the REPP.
**Theorem 4.1**
*Let $G$ be as defined in Section 2, let $\varphi_{\alpha}$ be as in Definition 2.3 and $X_{0},X_{1},...$ , as in (2.5). We consider the REPP
$$
N_{n}=\sum_{i=1}^{\infty}\delta_{\left(\frac{i}{k_{n}},\tilde{\pi}\left(\left(n(X_{(i-1)r_{n}})^{-\alpha},n(X_{(i-1)r_{n}+1})^{-\alpha},\ldots,n(X_{ir_{n}-1})^{-\alpha}\right)\right)\right)},
$$
where the sequences $(k_{n})_{n},(r_{n})_{n}$ are as described in (3.1). Then $N_{n}$ converges in the weak # topology to a process $N$ which can be written as in (3.8), where $Q_{j}=(1-\theta)^{-j}$ for all $j≥ 0$ , $Q_{j}=∞$ for all $j≤-1$ and
$$
\theta=1-\displaystyle\sum_{i=1}^{\mathcal{I}_{I}}\dfrac{1}{m_{i}}=1-\lambda.
$$*
From the convergence of the REPP and [FFT25, Theorem 2.5], we obtain the following enriched functional limit theorem.
**Theorem 4.2**
*Under the same assumptions of the previous theorem, we consider the heavy tailed sums:
$$
S_{n}(t)=\sum_{i=0}^{\lfloor nt\rfloor-1}\dfrac{1}{n^{\frac{1}{\alpha}}}X_{i}-tc_{n},\ t\in[0,1],
$$
where $c_{n}=0$ , when $0<\alpha<1$ and is given by (3.9), when $1<\alpha<2$ . Then, the process $S_{n}(t)$ converges in $F^{\prime}$ to $(V,\text{disc}(V),(e_{V}^{s})_{s∈\text{disc}(V)})$ , where $V$ is as described in (3.10), when $0<\alpha<1$ , or as in (3.14), when $1<\alpha<2$ , and the excursions can be written as
$$
e_{V}^{T_{i}}(t)=V(T_{i}^{-})+U_{i}^{-\frac{1}{\alpha}}\sum_{0\leq j\leq\lfloor\tan(\pi t/2)\rfloor}(1-\theta)^{\frac{j}{\alpha}},\ t\in[0,1].
$$*
**Remark 4.3**
*In light of [FFRS20], for $\mathcal{M}$ equal to the middle- $\frac{1}{3}$ Cantor set then $\theta<1$ if and only if, among the class of maps $mx\mod 1$ , $m≥ 2$ , we restrict to $m=3^{k}$ , $k∈\mathbb{N}$ . Hence, an extremal index strictly less than $1$ is obtained if and only if the dynamics is somehow compatible with the construction of the middle- $\frac{1}{3}$ Cantor set. We proceed with the computation of the extremal index in the greater generality of $\Lambda$ dynamically defined by $G$ and for our observable $\varphi_{\alpha}$ which is different from the one used in [FFRS20].*
The rest of this section is dedicated to the proof of Theorem 4.1, which essentially means checking that we have the right normalisations, that all conditions of [FFT25, Theorem 3.18], [FFT25, Theorem 2.5] described in Sections 3.3 and 3.4 hold, and computing the extremal index and the transformed anchored tail process.
4.2. Tuning constants and normalising sequences
**Lemma 4.4**
*$X_{n}$ have $\alpha$ -regularly varying tails.*
* Proof*
We wish to find $(\mathfrak{a}_{n})_{n∈\mathbb{N}}$ such that $\lim_{n→∞}n\mathbb{P}(X_{0}>y\mathfrak{a}_{n})=y^{-\alpha}.$ Since from Lemma 2.4,
$$
\{X_{0}>y\mathfrak{a}_{n}\}=\{F\circ\psi<y^{-\alpha}\mathfrak{a}_{n}^{-\alpha}\},
$$
it follows that
$$
\mathbb{P}(X_{0}>y\mathfrak{a}_{n})=\mathbb{P}(F\circ\psi<y^{-\alpha}\mathfrak{a}_{n}^{-\alpha})=y^{-\alpha}\mathfrak{a}_{n}^{-\alpha}.
$$
Hence we conclude that $\mathfrak{a}_{n}=n^{\frac{1}{\alpha}}$ for all $n∈\mathbb{N}$ . ∎
In particular, this lemma means for each $\tau>0$ we can define $(u_{n}(\tau))_{n}$ so that
$$
\lim_{n\to\infty}n\mathbb{P}(X_{0}>u_{n}(\tau))=\tau,\text{ so }u_{n}(\tau)=\left(\frac{n}{\tau}\right)^{\frac{1}{\alpha}}\quad\text{and}\quad u_{n}^{-1}(z)=nz^{-\alpha}.
$$
We next give a geometric description of this set. By Lemma 2.2 and (4.1) there are $a_{i}=a_{i}(n,\tau)∈\{0,1\}$ so that,
$$
F^{-1}((u_{n}(\tau))^{-\alpha})=F^{-1}\left(\frac{\tau}{n}\right)=\sum_{i\in\mathbb{N}}a_{i}2^{-i}=\sum_{j\in\mathcal{J}_{n,\tau}}2^{-j}=:u_{n,\tau}
$$
where $\mathcal{J}_{n,\tau}=\{i∈\mathbb{N}:a_{i}=1\}$ .
Let
$$
j_{n,\tau}=j_{n,\tau}^{1}:=\min\{j:j\in\mathcal{J}_{n,\tau}\}\;\text{and}\;j_{n,\tau}^{\ell+1}=\min\{j>j_{n,\tau}^{\ell}\colon j\in\mathcal{J}_{n,\tau}\},\;\text{for all}\;\ell\in\mathbb{N}.
$$
If, for some $\ell>1$ , we have $a_{i}=0$ for all $i≥ j_{n,r}^{\ell}$ , then we set $j_{n,r}^{s}=∞$ for all $s≥\ell$ . Note that Lemma 2.2 (c)(ii) implies that $j_{n,\tau}$ grows logarithmically in $n$ (i.e., , $\limsup_{n→∞}\frac{j_{n,\tau}}{\log n}<∞$ ). Moreover, by Lemma 2.4,
$$
\begin{split}U_{n}(\tau)=\{X_{0}>u_{n}(\tau)\}=\{\psi<u_{n,\tau}\}=\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}\end{split}
$$
where $H^{(j_{n,\tau})}$ denotes a subset of $\Lambda_{j_{n,\tau}-1}\setminus\Lambda_{j_{n,\tau}}$ .
**Remark 4.5**
*Recall Lemma 2.2 (b) to justify the appearance of $\Lambda_{j_{n,\tau}}$ . Observe that $u_{n,\tau}=2^{-j_{n,\tau}}+\delta$ with $\delta≤ 2^{-j_{n,\tau}}$ , so, in particular, $H^{(j_{n,\tau})}=\emptyset$ if $\delta=0$ (which means that $j_{n,\tau}^{\ell}=∞$ for all $\ell≥ 2$ ) and $H^{(j_{n,\tau})}=\Lambda_{j_{n,\tau}-1}\setminus(\Lambda_{j_{n,\tau}}\cup\{p_{1},...,p_{k_{I}^{j_{n,\tau}}}\})$ , if $\delta=2^{-j_{n,\tau}}$ and where the $p_{i}$ s are as in Lemma 2.2. Now, $H^{(j_{n,\tau})}$ can be further decomposed into
$$
H^{(j_{n,\tau})}=\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}}\cup H^{(j_{n,\tau}^{2})},
$$
where $\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}}$ is homothetic to $\Lambda_{j_{n,\tau}^{2}-j_{n,\tau}}$ , which was rescaled to fit the corresponding connected components of $\Lambda_{j_{n,\tau}-1}\setminus\Lambda_{j_{n,\tau}}$ , and $H^{(j_{n,\tau}^{2})}$ is a subset of $\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}}\setminus\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}-1}$ . Note that $\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}}$ can be written as a union of $j_{n,\tau}^{2}$ -cylinders. This procedure can be repeated to obtain approximations of $H^{(j_{n,\tau})}$ by unions of cylinders of greater and greater depth.*
4.3. Extremal Index
Recall that
$$
\begin{split}U_{n}^{(q)}(\tau)&=\{X_{0}(x)>u_{n}(\tau),X_{1}\leq u_{n}(\tau),\dots,X_{q}\leq u_{n}(\tau)\}\\
&=\{\psi<u_{n,\tau},\psi\circ G\geq u_{n,\tau},\dots,\psi\circ G^{q}\geq u_{n,\tau}\}.\end{split}
$$
Observe that for $n$ sufficiently large, we have
$$
\begin{split}U_{n}^{(1)}(\tau)&=\{\psi<u_{n,\tau},\psi\circ G\geq u_{n,\tau}\}\\
&=\left(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}\right)\cap G^{-1}([0,1]\setminus(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}))\\
&=H^{(j_{n,\tau})}\cup(\Lambda_{j_{n,\tau}}\cap G^{-1}((\Lambda_{j_{n,\tau}-1}\setminus\Lambda_{j_{n,\tau}})\setminus H^{(j_{n,\tau})}))\end{split} \tag{1}
$$
Now, observing that the points of $U_{n}^{(1)}(\tau)$ are sent by $G$ into the holes of smaller and smaller generation in the construction of $\Lambda$ , it follows that $U_{n}^{(1)}(\tau)=U_{n}^{(2)}(\tau)=U_{n}^{(3)}(\tau)=...$ . In light of Remark 3.2, this means that $q_{n}=1$ is then a natural choice to prove condition $\D_{q_{n}}^{{}^{\prime}*}$ , which is indeed established in Section 4.4.
The extremal index $\theta$ can be written as
$$
\theta=\lim_{n\to\infty}\dfrac{\mathfrak{m}(U_{n}^{(1)}(\tau))}{\mathfrak{m}(U_{n}(\tau))} \tag{1}
$$
provided the limit exists.
**Proposition 4.6**
*For $G$ as defined in Section 2 and observable $\varphi_{\alpha}$ as given in Definition 2.3, the extremal index is $\theta=1-\displaystyle\sum_{i=1}^{\mathcal{I}_{I}}\dfrac{1}{m_{i}}=1-\lambda$ .*
* Proof*
Let $u_{n,\tau}∈[2^{-j},2^{-j+1})$ . Then, $j_{n,\tau}=j$ and, by (4.3), $U_{n}(\tau)=\Lambda_{j}\cup H^{(j)}$ so that
$$
\mathfrak{m}(U_{n}(\tau))=\mathfrak{m}(\Lambda_{j})+\kappa\mathfrak{m}(\Lambda_{j-1}\setminus\Lambda_{j})
$$
where $\kappa∈[0,1]$ since $H^{(j)}$ is a fraction of $\Lambda_{j-1}\setminus\Lambda_{j}$ . Now, by (4.4),
$$
U_{n}^{(1)}(\tau)=H^{(j)}\cup(\Lambda_{j}\cap G^{-1}((\Lambda_{j-1}\setminus\Lambda_{j})\setminus H^{(j)})) \tag{1}
$$
and we have
$$
\mathfrak{m}(U_{n}^{(1)}(\tau))=\kappa\mathfrak{m}(\Lambda_{j-1}\setminus\Lambda_{j})+(1-\kappa)\mathfrak{m}(\Lambda_{j}\setminus\Lambda_{j+1}) \tag{1}
$$
where $\kappa$ is the same as in (4.5) (in particular, observe that the second summand reflects the fact that only a fraction of $\Lambda_{j}\setminus\Lambda_{j+1}$ , whose measure is $(1-\kappa)\mathfrak{m}(\Lambda_{j}\setminus\Lambda_{j+1})$ , is iterated in one time step to the complement of $H^{(j)}$ in $\Lambda_{j-1}\setminus\Lambda_{j}$ ). (4.5) can be rewritten
$$
\mathfrak{m}(U_{n}(\tau))=\left(\sum_{i=1}^{\mathcal{I}_{I}}\dfrac{1}{m_{i}}\right)\mathfrak{m}(\Lambda_{j-1})+\kappa\left(1-\sum_{i=1}^{\mathcal{I}_{I}}\dfrac{1}{m_{i}}\right)\mathfrak{m}(\Lambda_{j-1})=\lambda\mathfrak{m}(\Lambda_{j-1})+\kappa\left(1-\lambda\right)\mathfrak{m}(\Lambda_{j-1})
$$ and (4.6) can be rewritten
$$
\mathfrak{m}(U_{n}^{(1)}(\tau))=\kappa\left(1-\lambda\right)\mathfrak{m}(\Lambda_{j-1})+(1-\kappa)\left(1-\lambda\right)\lambda\mathfrak{m}(\Lambda_{j-1}) \tag{1}
$$ Finally,
$$
\begin{split}\theta_{n}=\dfrac{\mathfrak{m}(U_{n}^{(1)}(\tau))}{\mathfrak{m}(U_{n}(\tau))}&=\dfrac{\kappa\left(1-\displaystyle\lambda\right)+(1-\kappa)\left(1-\lambda\right)\lambda}{\lambda+\kappa\left(1-\lambda\right)}=\dfrac{\kappa\left(1-\lambda\right)\left(1-\lambda\right)+\left(1-\lambda\right)\lambda}{\lambda+\kappa\left(1-\lambda\right)}\\
&=\left(1-\lambda\right)\dfrac{\kappa\left(1-\lambda\right)+\lambda}{\lambda+\kappa\left(1-\displaystyle\lambda\right)}=1-\displaystyle\lambda\end{split} \tag{1}
$$ Therefore, $\theta=\displaystyle\lim_{n→∞}\theta_{n}=1-\displaystyle\lambda$ . ∎
**Remark 4.7**
*Observe that with $\Lambda$ equal to the middle- $\frac{1}{3}$ Cantor set as described in Section 2 we obtain $\theta=\dfrac{1}{3}$ .*
4.4. Dependence requirements
In order to prove that the conditions $\D_{q_{n}}$ and $\D^{\prime}_{q_{n}}$ from Section 3.2 hold in our setting we will approximate the sets $A_{n,\ell}^{(q_{n})}$ by a union of cylinders of controlled depth and then use the excellent mixing properties on cylinders in the spirit of [HP14, DHY21, BF24], for example.
Since $A_{n,\ell}^{(q_{n})}$ is itself a finite union of sets of the form $U_{n}(\tau^{\prime},\tau^{\prime\prime}):=U_{n}(\tau^{\prime})\setminus U_{n}(\tau^{\prime\prime})$ , for $\tau^{\prime\prime}<\tau^{\prime}$ , we restrict the analysis to these building blocks in order to simplify the already heavy notation.
Since
$$
\begin{split}U_{n}(\tau^{\prime},\tau^{\prime\prime})&=\left(\Lambda_{j_{n,\tau^{\prime}}}\cup H^{(j_{n,\tau^{\prime}})}\right)\setminus\left(\Lambda_{j_{n,\tau^{\prime\prime}}}\cup H^{(j_{n,\tau^{\prime\prime}})}\right)\\
&=H^{(j_{n,\tau^{\prime}})}\cup\left(\Lambda_{j_{n,\tau^{\prime}}}\setminus\left(\Lambda_{j_{n,\tau^{\prime\prime}}}\cup H^{(j_{n,\tau^{\prime\prime}})}\right)\right),\end{split}
$$
$U_{n}(\tau^{\prime},\tau^{\prime\prime})$ is made up of holes in the dynamical construction of $\Lambda$ (i.e. subsets of the form $\Lambda_{k}\setminus\Lambda_{k+1}$ where $k∈\mathbb{N}$ ), where $j_{n,\tau^{\prime}}$ is as in (4.2). Moreover, as observed earlier, for $n$ sufficiently large,
$$
\begin{split}U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(q)}&=U_{n}(\tau^{\prime},\tau^{\prime\prime})\cap G^{-1}(U_{n}(\tau^{\prime},\tau^{\prime\prime}))^{c}\cap\dots\cap G^{-q}(U_{n}(\tau^{\prime},\tau^{\prime\prime}))^{c}\\
&=U_{n}(\tau^{\prime},\tau^{\prime\prime})\cap G^{-1}(U_{n}(\tau^{\prime},\tau^{\prime\prime}))^{c}\end{split}
$$
so, in particular, taking $q_{n}=1$ for all $n∈\mathbb{N}$ is the natural choice to prove condition $\D_{q_{n}}^{{}^{\prime}*}$ , which is established in 4.4.
Combining (4.7) and (4.8), we obtain
$$
U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}=H^{(j_{n,\tau^{\prime}})}\ \bigcup\left\{\left((\Lambda_{j_{n,\tau^{\prime}}}\setminus\Lambda_{j_{n,\tau^{\prime\prime}}})\setminus H^{(j_{n,\tau^{\prime\prime}})}\right)\cap G^{-1}\left((\Lambda_{j_{n,\tau^{\prime}}-1}\setminus\Lambda_{j_{n,\tau^{\prime}}})\setminus H^{(j_{n,\tau^{\prime}})}\right)\right\}. \tag{1}
$$
For a Borel set $A⊂[0,1]$ and $j∈\mathbb{N}$ , we define
$$
\Upsilon^{+}(A,j):=\bigcup_{\omega\in\mathcal{P}_{j}\colon\omega\cap A\neq\emptyset}\omega\qquad\text{and}\qquad\Upsilon^{-}(A,j):=\bigcup_{\omega\in\mathcal{P}_{j}\colon\omega\subset A}\omega.
$$
They are the approximations of $A$ from above and below by $j$ -cylinders.
**Lemma 4.8**
*Let $U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}$ be given and let $j_{n,\tau^{\prime}}$ be associated to this set as in (4.2). Let $\Gamma_{n}^{+}=\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},2j_{n,\tau^{\prime}})$ and $\Gamma_{n}^{-}=\Upsilon^{-}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},2j_{n,\tau^{\prime}})$ be the respective outer and inner approximations by unions of $(2j_{n,\tau^{\prime}})$ -cylinders. Then there exists $(\rho_{n})_{n}$ such that
$$
\dfrac{\mathfrak{m}(\Gamma_{n}^{+}\setminus\Gamma_{n}^{-})}{\mathfrak{m}\left(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}\right)}\leq\rho_{n} \tag{1}
$$
where $\displaystyle\lim_{n→∞}\rho_{n}=0$ .*
* Proof*
We write
$$
U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}=H^{(j_{n,\tau^{\prime}})}\cup K \tag{1}
$$
where $K⊂\Lambda_{j_{n,\tau^{\prime}}}\setminus\Lambda_{j_{n,\tau^{\prime}}+1}$ . Therefore,
$$
\begin{split}\mathfrak{m}\left(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}\right)&=\beta\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}-1}\setminus\Lambda_{j_{n,\tau^{\prime}}})+\gamma\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}}\setminus\Lambda_{j_{n,\tau^{\prime}}+1})\\
&=\beta\left(1-\lambda\right)\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}-1})+\gamma\left(1-\lambda\right)\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}})\\
&=\beta\left(1-\lambda\right)\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}-1})+\gamma\left(1-\lambda\right)\lambda\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}-1})\\
&=\left(\beta\left(1-\lambda\right)+\gamma\left(1-\lambda\right)\lambda\right)\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}-1})\\
&=\left(\beta\theta+\gamma\theta(1-\theta)\right)\lambda^{j_{n,\tau^{\prime}}-1}\end{split} \tag{1}
$$
where $\beta,\gamma∈[0,1]$ and $\theta$ is as in Proposition 4.6 (as $H^{(j_{n,\tau^{\prime}})}$ is a fraction of $\Lambda_{j_{n,\tau^{\prime}}-1}\setminus\Lambda_{j_{n,\tau^{\prime}}}$ while $K$ is a fraction of $\Lambda_{j_{n,\tau^{\prime}}}\setminus\Lambda_{j_{n,\tau^{\prime}}+1}$ ). Observe that the larger the set $H^{(j_{n,\tau^{\prime}})}$ is, the smaller the set $K$ is and vice-versa. This means that, in particular, $\beta$ and $\gamma$ cannot be simultaneously equal to $0$ or to $1$ . On the other hand, taking into consideration the definition of the function $\psi$ and of the approximating cylinders, as well as the fact that $G$ is a full branched Markov linear map (which, in particular, means that there exist $\tilde{C}>0$ and $0<\eta<1$ such that $\mathfrak{m}(\omega)≤\tilde{C}\eta^{n}$ , for all $n∈\mathbb{N}$ and $\omega∈\mathcal{P}^{n}$ ), we claim that
$$
\mathfrak{m}(\Gamma_{n}^{+}\setminus\Gamma_{n}^{-})\leq C\eta^{2j_{n,\tau^{\prime}}},\quad\text{for some}\quad C>0.
$$
To see this, we start by noting that since $K$ is a preimage of the complement of $H^{(j_{n,\tau^{\prime}})}$ in $\Lambda_{j_{n,\tau^{\prime}}-1}\setminus\Lambda_{j_{n,\tau^{\prime}}}$ , we only need to check that $\mathfrak{m}\left(\Upsilon^{+}(H^{(j_{n,\tau^{\prime}})},2j_{n,\tau^{\prime}})\setminus\Upsilon^{-}(H^{(j_{n,\tau^{\prime}})},2j_{n,\tau^{\prime}})\right)≤ C\eta^{2j_{n,\tau^{\prime}}}$ for some $C>0$ . Recalling Remark 4.5, assume first that $j_{n,\tau^{\prime}}^{2}>2j_{n,\tau^{\prime}}$ . This means that $H^{(j_{n,\tau^{\prime}})}⊂\Lambda_{j_{n,\tau^{\prime}}}^{\prime}$ , where $\Lambda_{j_{n,\tau^{\prime}}}^{\prime}$ is homothetic to $\Lambda_{j_{n,\tau^{\prime}}}$ , which was scaled to fit in one the the holes of $\Lambda_{j_{n,\tau^{\prime}}}$ that comprise $\Lambda_{j_{n,\tau^{\prime}}-1}\setminus\Lambda_{j_{n,\tau^{\prime}}}$ in order to give rise to $\Lambda_{j_{n,\tau^{\prime}}}^{\prime}$ . Hence, since $\emptyset⊂\Upsilon^{-}(H^{(j_{n,\tau^{\prime}})}⊂\Upsilon^{+}(H^{(j_{n,\tau^{\prime}})}⊂\Lambda_{j_{n,\tau^{\prime}}}^{\prime}$ , we have that
$$
\mathfrak{m}\left(\Upsilon^{+}(H^{(j_{n,\tau^{\prime}})},2j_{n,\tau^{\prime}})\setminus\Upsilon^{-}(H^{(j_{n,\tau^{\prime}})},2j_{n,\tau^{\prime}})\right)\leq\mathfrak{m}(\Lambda_{j_{n,\tau^{\prime}}}^{\prime})\leq C\eta^{2j_{n,\tau^{\prime}}},
$$
for some $C>0$ . Now, if $j_{n,\tau^{\prime}}^{2}≤ 2j_{n,\tau^{\prime}}$ , we decompose the set $H^{(j_{n,\tau^{\prime}})}=\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}}\cup H^{(j_{n,\tau}^{2})}$ further, as in Remark 4.5. Then, observe that $\Lambda^{\prime}_{j_{n,\tau}^{2}-j_{n,\tau}}$ can clearly be written as a union of cylinders of $\mathcal{P}^{2j_{n,\tau^{\prime}}}$ , which means that we only need to estimate the measure of $\Upsilon^{+}(H^{(j_{n,\tau}^{2})},2j_{n,\tau^{\prime}})\setminus\Upsilon^{-}(H^{(j_{n,\tau}^{2})},2j_{n,\tau^{\prime}})$ . We now consider the case $j_{n,\tau^{\prime}}^{3}≤ 2j_{n,\tau^{\prime}}$ , which will lead us to the same conclusion, as before, and otherwise proceed to another decomposition of $H^{(j_{n,\tau}^{2})}$ until we end by exhaustion. The result now follows from gathering estimates (4.10) and (4.11). ∎
It is a direct consequence of Lemma 4.8 that the REPP counting the number of hits to $U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}$ converges if and only if the REPP counting the number of hits to either $\Gamma_{n}^{+}$ or $\Gamma_{n}^{-}$ converges, and the limits are the same (see [BF24, Lemma 3.13], for example).
As a result, we only need to check that the conditions $\D_{q_{n}}^{*}$ and $\D^{{}^{\prime}*}_{q_{n}}$ are verified for the cylinder approximating sets. To this end, we next define a strong form of mixing, which is a property of the maps we are considering here.
**Definition 4.9**
*Let $(\mathcal{X},\mathcal{B}_{\mathcal{X}},\mu,f)$ be a probability preserving system. Let $\mathcal{F}_{0,k}$ denote the $\sigma$ -algebra generated by $\mathcal{P}^{k}$ . $(\mathcal{X},\mathcal{B}_{\mathcal{X}},\mu,f)$ is exponentially $\phi$ -mixing if there exist $C,c>0$ such that for every $H∈\mathcal{F}_{0,k}$ and every $A∈\mathcal{B}_{\mathcal{X}}$
$$
\lvert\mu(H\cap f^{-(t+k)}(A))-\mu(H)\mu(A)\rvert\leq C{\rm e}^{-ct}\mu(A).
$$*
We remark that $G$ is exponentially $\phi$ -mixing, see for example [AN05, Theorem 1]. Recall that, as observed above, we only need to check the conditions $\D_{q_{n}}^{*}$ , $\D_{q_{n}}^{{}^{\prime}*}$ , where the set $A_{n,\ell}^{(q_{n})}$ is replaced by the approximations $\Gamma_{n}^{+}$ and $\Gamma_{n}^{-}$ . In what follows, we will only check such conditions for $\Gamma_{n}^{+}$ , but the argument applies with minor adjustments to $\Gamma_{n}^{-}$ .
* Proof of ConditionДqn∗\D_{q_{n}}^{*}*
We need to control:
$$
E_{n}:=\left\lvert\mathfrak{m}\left(\Gamma_{n}^{+}\cap\bigcap_{i=\ell}^{N}\mathscr{W}_{J_{n,i}}\left(\Gamma_{n}^{+}\right)\right)-\mathfrak{m}\left(\Gamma_{n}^{+}\right)\mathfrak{m}\left(\bigcap_{i=\ell}^{N}\mathscr{W}_{J_{n,i}}\left(\Gamma_{n}^{+}\right)\right)\right\rvert
$$ We start by noting that $E_{n}≤ 2\mathfrak{m}(\Gamma_{n}^{+})$ . Also, observe that $\Gamma_{n}^{+}∈\mathcal{F}_{0,2j_{n,\tau^{\prime}}}$ . Assuming that $t>2j_{n,\tau^{\prime}}$ and writing (4.12) with $H=\Gamma_{n}^{+}$ and $f^{-(t-2j_{n,\tau^{\prime}})+2j_{n,\tau^{\prime}}}(A)=\bigcap_{i=l}^{m}\mathscr{W}_{J_{n,i}}(\Gamma_{n}^{+})$ , we estimate $E_{n}≤ C{\rm e}^{-c(t-2j_{n,\tau^{\prime}})}$ . Hence, we can set $\gamma(n,t)=\max\left\{2,C{\rm e}^{-c(t-2j_{n,\tau^{\prime}})}\right\}$ . Since $j_{n,\tau^{\prime}}$ grows logarithmically in $n$ , we can choose $t_{n}=o(n)$ and satisfying conditions (3.1) so that $\lim_{n→∞}n\gamma(n,t_{n})=0$ . ∎
* Proof of ConditionДqn∗′\D_{q_{n}}^{{}^{\prime}*}*
We start by noting that, by construction, the points of $U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}$ are sent by $G$ into the holes that appeared in the $(j_{n,\tau^{\prime}}-1)$ -th and/or the $(j_{n,\tau^{\prime}}-2)$ -th step of the construction of $\Lambda$ and then they are sent successively by $G$ to lower and lower ranked holes in the construction until they reach the first holes of the construction and only then can they return to $U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)}$ . Using this fact and the exponential $\phi$ mixing property of $G$ , we have:
$$
\begin{split}&n\mathfrak{m}\left(\Gamma_{n}^{+}\cap\mathscr{W}^{c}_{[q_{n}+1,r_{n})}\left(A_{n,(\bar{\tau}_{i})_{i},N}\right)\right)\leq n\sum_{j=j_{n,\tau^{\prime}}-1}^{r_{n}-1}\mathfrak{m}\left(\Gamma_{n}^{+}\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)\\
&\leq n\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}\mathfrak{m}\left(\Gamma_{n}^{+}\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)+n\sum_{j=2j_{n,\tau^{\prime}}}^{r_{n}-1}\mathfrak{m}\left(\Gamma_{n}^{+}\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)\\
&\leq n\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}\mathfrak{m}\left(\Gamma_{n}^{+}\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)+n\sum_{j=2j_{n,\tau^{\prime}}}^{r_{n}-1}\mathfrak{m}(\Gamma_{n}^{+})\mathfrak{m}(\Gamma_{n}^{+})+n\mathfrak{m}(\Gamma_{n}^{+})\sum_{j=2j_{n,\tau^{\prime}}}^{r_{n}-1}C{\rm e}^{-cj}\\
&\leq n\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}\mathfrak{m}\left(\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)+\tilde{C}\frac{n^{2}\left(\mathfrak{m}(\Gamma_{n}^{+})\right)^{2}}{k_{n}}+\hat{C}n\mathfrak{m}(\Gamma_{n}^{+})\sum_{j=2j_{n,\tau^{\prime}}}^{\infty}C{\rm e}^{-cj},\end{split} \tag{1}
$$
for some $\tilde{C},\hat{C}>0$ . The second term goes to $0$ by (3.2), Lemma 4.8 and the fact that $k_{n}→∞$ . The third term goes to $0$ by (3.2) and the fact that $j_{n,\tau^{\prime}}→∞$ . Now, to handle the first term, we write $\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)=\bigcup_{\omega∈\mathcal{P}_{j}\colon\omega⊂\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)}\omega$ , where the union is disjoint. Since $G$ is a full branched linear Markovian map, we have
$$
\mathfrak{m}\left(\omega\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)\leq C\mathfrak{m}(\omega)\mathfrak{m}\left(\Gamma_{n}^{+}\right).
$$
It follows that
$$
\begin{split}n\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}&\mathfrak{m}\left(\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)\\
&\leq n\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}\sum_{\omega\in\mathcal{P}_{j}\colon\omega\subset\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)}\mathfrak{m}\left(\omega\cap f^{-j}\left(\Gamma_{n}^{+}\right)\right)\\
&\leq n\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}\sum_{\omega\in\mathcal{P}_{j}\colon\omega\subset\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)}C\mathfrak{m}(\omega)\mathfrak{m}\left(\Gamma_{n}^{+}\right)\\
&\leq Cn\sum_{j=j_{n,\tau^{\prime}}-1}^{2j_{n,\tau^{\prime}}}\mathfrak{m}\left(\Upsilon^{+}(U_{n}(\tau^{\prime},\tau^{\prime\prime})^{(1)},j)\right)\mathfrak{m}\left(\Gamma_{n}^{+}\right)\leq\hat{C}\frac{j_{n,\tau^{\prime}}}{n}\xrightarrow[n\to\infty]{}0\end{split} \tag{1}
$$
∎
4.5. Anchored tail process
We will see that for our $G$ and $\varphi_{\alpha}$ the Cantor set $\Lambda$ behaves like a fixed point.
**Proposition 4.10**
*Let $G$ be as defined in Section 2 and let $\varphi_{\alpha}$ be as in Definition 2.3. Then, the anchored tail process is (a.s.) the bi-infinite sequence $(Z_{j})_{j∈\mathbb{Z}}$ with entry $U.(1-\theta)^{-j}$ at $j∈\mathbb{N}_{0}$ and $∞$ otherwise, where $U$ is uniformly distributed on $[0,1]$ and $\theta$ is the extremal index (cf. Proposition 4.6).*
* Proof*
We check that the process $(Y_{j})_{j∈\mathbb{Z}}$ is (a.s.) the bi-infinite sequence with:
1. entry $U$ at $j=0$ ;
1. entries $U.(1-\theta)^{-j}$ for all positive indices $j$ ;
1. $∞$ for all negative indices $j$ except, possibly, $U.(1-\theta)^{-j}$ at $j≥-m$ for some $m∈\mathbb{N}$ ;
where $U$ is uniformly distributed on $[0,1]$ . Observe that $U.(1-\theta)^{-j}<1$ for all $j<0$ . Therefore, if $(Y_{j})_{j∈\mathbb{Z}}$ is as described by (i)-(iii) then $(Z_{j})_{j∈\mathbb{Z}}$ is as in the statement of the theorem. We must check that conditions (1)-(4) in Section 3.1 are satisfied for $(Y_{j})_{j∈\mathbb{Z}}$ as given by (i)-(iii). Conditions (2) and (3) are straightforward. As for condition (4), $Y_{j}≥ 1$ for all $j≤-1$ implies that the hit to $\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ at time $r_{n}$ is the first hit to that same neighbourhood of $\Lambda$ (i.e. there is no hit to $\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ before time $r_{n}$ given that there is a hit to $\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ at time $r_{n}$ - recall, from (4.3), that $\{X_{r_{n}}>u_{n}(\tau)\}=G^{-r_{n}}(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})})$ ). Since
$$
\dfrac{\mathfrak{m}(G^{-r_{n}}(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})})\cap G^{-(r_{n}-1)}([0,1]\setminus(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})})))}{\mathfrak{m}(G^{-r_{n}}(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}))}>0,
$$
condition (4) is satisfied. So, we check that condition (1) holds. Recall that $u_{n}^{-1}(X_{j})=\left(\dfrac{X_{j}}{\mathfrak{a}_{n}}\right)^{-\alpha}$ . Therefore,
$$
\left\{\dfrac{1}{\tau}u_{n}^{-1}(X_{r_{n}+j})\>\middle|\>X_{r_{n}}>u_{n}(\tau)\right\}=\left\{\dfrac{1}{\tau}\left(\dfrac{X_{r_{n}+j}}{\mathfrak{a}_{n}}\right)^{-\alpha}\>\middle|\>X_{r_{n}}>u_{n}(\tau)\right\}.
$$ Now, $\{x∈[0,1]:X_{r_{n}}(x)>u_{n}(\tau)\}=G^{-r_{n}}(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})})$ . First note that for $t∈[0,1]$ , $\mathbb{P}(Y_{0}≤ t)=\mathbb{P}(F\circ\psi≤ t)=t$ by Lemma 2.2 (c)(iii), so we write this uniform distribution by $U$ . For $j≥ 1$ , if $G^{r_{n}}(x)∈\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ then $G^{r_{n}+j}(x)∈\Lambda_{j_{n,\tau}-j}\cup H^{(j_{n,\tau}-j)}$ provided $j_{n,\tau}-j≥ 0$ , in which case $\psi(G^{r_{n}+j}(x))=2^{j}\psi(G^{r_{n}}(x))$ giving
$$
\begin{split}\dfrac{F(\psi(G^{r_{n}+j}(x)))}{F(\psi(G^{r_{n}}(x)))}=\dfrac{\mathbb{P}(\psi\leq\psi(G^{r_{n}+j}(x)))}{\mathbb{P}(\psi\leq\psi(G^{r_{n}}(x)))}=\dfrac{\mathfrak{m}(\Lambda_{j_{n,\tau}-j}\cup H^{(j_{n,\tau}-j)})}{\mathfrak{m}(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})})}&=\left(\displaystyle\sum_{i=1}^{k}\dfrac{1}{m_{i}}\right)^{-j}\\
&=(1-\theta)^{-j}\end{split}
$$ We have
$$
\lim_{n\to\infty}\mathbb{P}\left(\left\{\dfrac{u_{n}^{-1}(X_{r_{n}+j})}{u_{n}^{-1}(X_{r_{n}})}=(1-\theta)^{-j}\>\middle|\>X_{r_{n}}>u_{n}(\tau)\right\}\right)=1
$$
(since $j_{n,\tau}-j≥ 0$ when $n→∞$ ). We may write
$$
\begin{split}&\hskip-28.45274pt\lim_{n\to\infty}\mathbb{P}\left\{\dfrac{1}{\tau}u_{n}^{-1}(X_{r_{n}+j})=s\>\middle|\>X_{r_{n}}>u_{n}(\tau)\right\}\\
&=\lim_{n\to\infty}\mathbb{P}\left\{\dfrac{u_{n}^{-1}(X_{r_{n}+j})}{u_{n}^{-1}(X_{r_{n}})}\dfrac{u_{n}^{-1}(X_{r_{n}})}{\tau}=s\>\middle|\>X_{r_{n}}>u_{n}(\tau)\right\}\end{split}
$$
which, by (2) in in Section 3.1 holds, gives $\mathcal{L}(Y_{j})=\mathcal{L}(\Theta_{j}Y_{0})=(1-\theta)^{-j}.U$ (a.s.). Let $j<0$ . If $G^{r_{n}}(x)∈\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ then $G^{r_{n}+j}(x)∈\Lambda_{j_{n,\tau}-j}\cup H^{(j_{n,\tau}-j)}$ for an at most finite number of indices $j≥-m$ where $m∈\mathbb{N}$ . In words, a hit, at time $r_{n}$ , to the approximation (of $\Lambda$ ) $\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ can only be preceded by a finite number of hits to the finer approximations $\Lambda_{j_{n,\tau}-j}\cup H^{(j_{n,\tau}-j)}$ , as, otherwise, the point would be in $\Lambda$ . This leads to $Y_{j}=U.(1-\theta)^{-j}$ for an at most finite number of indices $j≥-m$ where $m∈\mathbb{N}$ . If $G^{r_{n}}(x)∈\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})}$ is such that $G^{r_{n}+k}(x)∈[0,1]\setminus\Lambda_{1}$ for some $k∈\mathcal{K}⊂eq\{1,...,-j\}$ , we have
$$
\dfrac{\psi(G^{r_{n}+j}(x))}{\psi(G^{r_{n}}(x))}=\dfrac{\sum_{k\in\mathcal{K}}2^{-k}+2^{j}\psi(G^{r_{n}}(x))}{\psi(G^{r_{n}}(x))}
$$
so that
$$
\dfrac{F(\psi(G^{r_{n}+j}(x)))}{F(\psi(G^{r_{n}}(x)))}=\dfrac{\mathfrak{m}([0,1]\setminus\Lambda_{1})}{\mathfrak{m}(\Lambda_{j_{n,\tau}}\cup H^{(j_{n,\tau})})}.
$$ This leads to
$$
\lim_{n\to\infty}\mathbb{P}\left\{\dfrac{u_{n}^{-1}(X_{r_{n}+j})}{u_{n}^{-1}(X_{r_{n}})}=\infty\>\middle|\>X_{r_{n}}>u_{n}(\tau)\right\}=1.
$$
So, $Y_{j}=∞$ . ∎
4.6. The $1<\alpha<2$ case
To prove our result in the case $1<\alpha<2$ , we are required to satisfy the small jumps condition (3.12) and we will also need to check that the sequence $(Q_{j})_{j∈\mathbb{Z}}$ , obtained from the spectral decomposition of the transformed anchored tail process, satisfies the assumption (3.13). Since we have geometric decay of $\mathcal{Q}_{j}$ (from geometric growth of $(\mathbf{\tilde{Q}}_{i,j})_{j}$ ), this latter condition holds in our case.
For the small jumps condition, we will now follow the proof strategy of [CNT25]. Note that they did not cover the case $\alpha=1$ , so we will also omit this. As is well known (see [D83]), to show (3.12) it is sufficient to show that for $Y_{j}=X_{j}{\bf 1}_{X_{j}≤\varepsilon\mathfrak{a}_{n}}-\mathbb{E}\left(X_{j}{\bf 1}_{X_{j}≤\varepsilon\mathfrak{a}_{n}}\right)$ , that
$$
\lim_{\varepsilon\to 0}\limsup_{n\to\infty}\frac{n}{\mathfrak{a}_{n}^{2}}\sum_{j=1}^{n}\max\left\{0,\mathbb{E}(Y_{1}Y_{j})\right\}=0.
$$
As in [CNT25], we define $\phi_{n,\varepsilon}=\phi_{\alpha}·{\bf 1}_{\phi_{\alpha}≤\varepsilon\mathfrak{a}_{n}}$ and it is sufficient to show that
$$
\lim_{\varepsilon\to 0}\limsup_{n\to\infty}\frac{n}{\mathfrak{a}_{n}^{2}}\sum_{j=1}^{n}\int|\phi_{n,\varepsilon}|\cdot|\phi_{n,\varepsilon}\circ G^{j}|\penalty 10000\ d\mathfrak{m}=0.
$$
In that paper they use the decay of correlations against $BV$ , which we also have here. By Lemma 2.2 (c)(ii) that the set $\{\phi_{\alpha}<t\}=\Lambda_{k}$ for $k≈\frac{-\alpha\log t}{\log\lambda}$ , so the complexity here is polynomial. That is, there exists $\beta>1$ such that
$$
\|{\bf 1}_{\{\phi_{\alpha}<t\}}\|_{BV}\leq Kt^{\beta}.
$$
This is useful for us since if we have exponential decay of correlations at rate $\theta$ , as in [CNT25] we need only show
$$
\lim_{\varepsilon\to 0}\limsup_{n\to\infty}\frac{n}{\mathfrak{a}_{n}^{2}}\sum_{j=1}^{b\log n}\int|\phi_{n,\varepsilon}|\cdot|\phi_{n,\varepsilon}\circ G^{j}|\penalty 10000\ d\mathfrak{m}=0
$$
since we can choose $b$ large enough so that
$$
\frac{n}{\mathfrak{a}_{n}^{2}}\|\phi_{n,\varepsilon}\|_{BV}\|\phi_{n,\varepsilon}\|_{L^{1}}e^{-cj}\lesssim\frac{n}{\mathfrak{a}_{n}^{2}}(\mathfrak{a}_{n}\varepsilon)^{\beta+1}e^{-\theta j}\approx n^{1+\frac{1}{\alpha}(\beta-1)}e^{-\theta j}
$$
is bounded for $j=b\log n$ .
For the rest of the argument, we can follow the steps of the proof of [CNT25, Theorem 8.1]. For the estimates (II) and (III) there, the argument is identical. For (I) we observe that in the setting of this paper we are always in Case 1, and the derivative is constant 3, so we get an exponentially decaying constant in front of $∈t\phi_{n,\varepsilon}^{2}\penalty 10000\ d\mathfrak{m}$ , so the proof then follows.
4.7. Conclusion of Theorem 4.1
We have shown that our observable has regularly varying tails, a well-defined anchored process, that the system satisfies the relevant dependence/mixing conditions and in the case $1<\alpha<2$ , we have the required small jumps condition. Hence Theorem 4.1 (and hence, as in [FFT25], also Theorem 4.2) is proved.
References
- [AN05] J. Aaronson and H. Nakada. On the mixing coefficients of piecewise monotonic maps. volume 148, pages 1–10. 2005. Probability in mathematics.
- [AFF20] M. Abadi, A. C. M. Freitas, and J. M. Freitas. Dynamical counterexamples regarding the extremal index and the mean of the limiting cluster size distribution. J. Lond. Math. Soc. (2), 102(2):670–694, 2020.
- [BF24] D. Bansard-Tresse and J. M. Freitas. Inducing techniques for quantitative recurrence and applications to misiurewicz maps and doubly intermittent maps. to appear in Ann. Inst. Henri Poincaré Probab. Stat., 2024.
- [BPS18] B. Basrak, H. Planinić, and P. Soulier. An invariance principle for sums and record times of regularly varying stationary sequences. Probab. Theory Related Fields, 172, 869–914, 2018.
- [CKM24] I. Chevyrev, A. Korepanov, and I. Melbourne. Superdiffusive limits beyond the Marcus regime for deterministic fast-slow systems. Commun. Am. Math. Soc., 4:746–786, 2024.
- [CNT25] A. Chen, M. Nicol, A. Török, Birkhoff sum convergence of Fréchet observables to stable laws for Gibbs-Markov systems and applications, to appear in Nonlinearity.
- [DHY21] C. Davis, N. Haydn, and F. Yang. Escape rate and conditional escape rate from a probabilistic point of view. Ann. Henri Poincaré, 22(7):2195–2225, 2021.
- [D83] R. A. Davis. Stable limits for partial sums of dependent random variables. Ann. Probab., 11(2):262–269, 1983.
- [DH95] R. A. Davis and T. Hsing. Point process and partial sum convergence for weakly dependent random variables with infinite variance. Ann. Probab., 23(2):879–917, 1995.
- [FAMR19] D. Faranda, M. C. Alvarez-Castro, G. Messori, D. Rodrigues, and P. Yiou. The hammam effect or how a warm ocean enhances large scale atmospheric predictability. Nature Communications, 10(1):1316, 2019.
- [FMAY17] D. Faranda, G. Messori, M. C. Alvarez-Castro, and P. Yiou. Dynamical properties and extremes of northern hemisphere climate fields over the past 60 years. Nonlinear Processes in Geophysics, 24(4):713–725, 2017.
- [FPB23] D. Faranda, S. Pascale, and B. Bulut. Persistent anticyclonic conditions and climate change exacerbated the exceptional 2022 european-mediterranean drought. Environmental Research Letters, 18(3):034030, 2023.
- [FFRS20] A. C. M. Freitas, J. M. Freitas, F. B. Rodrigues, and J. V. Soares. Rare events for Cantor target sets. Comm. Math. Phys., 378(1):75–115, 2020.
- [FFS21] A. C. M. Freitas, J. M. Freitas, and J. V. Soares. Rare events for product fractal sets. J. Phys. A, 54(34):Paper No. 345202, 30, 2021.
- [FFT10] A. C. M. Freitas, J. M. Freitas, and M. Todd. Hitting time statistics and extreme value theory. Probab. Theory Related Fields, 147(3-4):675–710, 2010.
- [FFT11] A. C. M. Freitas, J. M. Freitas, and M. Todd. Extreme value laws in dynamical systems for non-smooth observations. J. Stat. Phys., 142(1):108–126, 2011.
- [FFT12] A. C. M. Freitas, J. M. Freitas, and M. Todd. The extremal index, hitting time statistics and periodicity. Adv. Math., 231(5):2626–2665, 2012.
- [FFT25] A.C.M. Freitas, J.M. Freitas, M. Todd, Enriched functional limit theorems for dynamical systems, to appear in Ann. Scuola Norm. Sup. Pisa Cl. Sci.
- [HP14] N. T. A. Haydn and Y. Psiloyenis. Return times distribution for Markov towers with decay of correlations. Nonlinearity, 27(6):1323–1349, 2014.
- [JPZ20] P. Jung, F. Pène, and H.-K. Zhang. Convergence to $\alpha$ -stable Lévy motion for chaotic billiards with several cusps at flat points. Nonlinearity, 33(2):807–839, 2020.
- [LWZ81] R. LePage, M. Woodroofe, and J. Zinn. Convergence to a stable distribution via order statistics. Ann. Probab., 9(4):624–632, 1981.
- [MP16] G. Mantica and L. Perotti. Extreme value laws for fractal intensity functions in dynamical systems: Minkowski analysis. J. Phys. A, 49(37):374001, 21, 2016.
- [R87] S. I. Resnick. Extreme values, regular variation, and point processes, volume 4 of Applied Probability. A Series of the Applied Probability Trust. Springer-Verlag, New York, 1987.
- [T10] M. Tyran-Kamińska. Weak convergence to Lévy stable processes in dynamical systems. Stoch. Dyn., 10(2):263–289, 2010.