Rarefied particle motions on hillslopes – Part 1: Theory

We describe the probabilistic physics of rarefied particle motions and deposition on rough hillslope surfaces. The particle energy balance involves gravitational heating with conversion of potential to kinetic energy, frictional cooling associated with particle–surface collisions, and an apparent heating associated with preferential deposition of low-energy particles. Deposition probabilistically occurs with frictional cooling in relation to the distribution of particle energy states whose spatial evolution is described by a Fokker–Planck equation. The Kirkby number Ki – defined as the ratio of gravitational heating to frictional cooling – sets the basic deposition behavior and the form of the probability distribution fr (r) of particle travel distances r , a generalized Pareto distribution. The shape and scale parameters of the distribution are well-defined mechanically. For isothermal conditions where frictional cooling matches gravitational heating plus the apparent heating due to deposition, the distribution fr (r) is exponential. With non-isothermal conditions and small Ki this distribution is bounded and represents rapid thermal collapse. With increasing Ki the distribution fr (r) becomes heavy-tailed and represents net particle heating. It may possess a finite mean and finite variance, or the mean and variance may be undefined with sufficiently large Ki. The formulation provides key elements of the entrainment forms of the particle flux and the Exner equation, and it clarifies the mechanisms of particle-size sorting on large talus and scree slopes. Namely, with conversion of translational to rotational kinetic energy, large spinning particles are less likely to be stopped by collisional friction than are small or angular particles for the same surface roughness.


Introduction
Sediment transport on steepland hillslopes involves a great range of scales of particle motions. These vary from relatively small motions that collectively produce the slow en masse motion of disturbance-driven creep (Culling, 1963;Roering et al., 1999Roering et al., , 2002Gabet, 2000;Anderson, 2002;Furbish, 2003;Roering, 2004;Furbish et al., 2009Furbish et al., , 2018a in concert with athermal granular creep (Houssais and Jerolmack, 2017;Bendror and Goren, 2018;Ferdowsi et al., 2018;Deshpande et al., 2020) to the longdistance and relatively fast en masse motions of landsliding and the rarefied motions associated with rockfall and ravel (Kirkby and Statham, 1975;Statham, 1976;Dorren, 2003;Roering and Gerber, 2005;Luckman, 2013;Tesson et al., 2020). Particularly in relation to long-distance motions, there is a growing interest in non-continuum formulations of sediment transport on hillslopes that are aimed Published by Copernicus Publications on behalf of the European Geosciences Union. 540 D. J. Furbish et al.: Rarefied particle motions -Part 1: Theory at accommodating nonlocal transport, where the particle flux at a hillslope position x depends on upslope conditions that influence the entrainment and motions of particles reaching x. These formulations include explicit particle-based descriptions (Tucker and Bradley, 2010) and probabilistic descriptions (Foufoula-Georgiou et al., 2010;Furbish and Haff, 2010;Furbish and Roering, 2013;Doane, 2018;Doane et al., 2018Doane et al., , 2019 of sediment motions. Importantly, these descriptions do not hinge on satisfying a continuum-like behavior as assumed in most previous treatments of transport on hillslopes. Nonetheless, to date these particle-based and probabilistic descriptions of transport are mostly kinematic in form, lacking a formal mechanical underpinning. Herein we focus on rarefied motions of particles which, once entrained, travel downslope over the land surface. This notably includes the dry ravel of particles down hillslopes following disturbances (Roering and Gerber, 2005;Doane, 2018;Doane et al., 2019;Roth et al., 2020) or upon their release from obstacles (e.g., vegetation) following failure of the obstacles (Lamb et al., 2011DiBiase and Lamb, 2013;DiBiase et al., 2017;Doane et al., 2018Doane et al., , 2019, and the motions of rockfall material over the surfaces of talus and scree slopes (Gerber and Scheidegger, 1974;Kirkby and Statham, 1975;Statham, 1976;Dorren, 2003;Luckman, 2013) (Fig. 1). By "rarefied motions" we are referring to the situation in which moving particles may frequently interact with the surface but rarely interact with each other. Thus, rarefied particle motions are decidedly distinct from granular flows. Indeed, processes such as rockfall and the subsequent motions of the rock material over talus or scree slopes represent the archetypal case of rarefied particle motions. Nonetheless, the ideas outlined below pertaining to the motions of individual particles may be entirely relevant to conditions that are not strictly rarefied, but where during the collective motions of many particles (e.g., during ravel) the effects of particle-surface interactions dominate over effects of particle-particle interactions in determining the behavior of the particles -akin to granular shear flows at high Knudsen number (Risso and Cordero, 2002;Kumaran, 2005Kumaran, , 2006. To be clear, the Knudsen number Kn is conventionally defined as the ratio of the mean free path between particleparticle collisions and a characteristic length, for example, a system dimension, a gradient length scale, or a numerical resolution scale; also see Sect. 2 in Furbish et al. (2018b). (For an ordinary gas, the onset of rarefied conditions occurs when Kn 0.01.) We note that laboratory experiments (Kirkby and Statham, 1975;Gabet and Mendoza, 2012;Furbish et al., 2021a) and field-based experiments (DiBiase et al., 2017;Roth et al., 2020) designed to mimic particle motions and travel distances on hillslopes effectively focus on rarefied conditions. Indeed, these conditions represent one of the most fundamental of Earth surface processes imaginable -how individual sediment particles that are not transported by a fluid move down a rough inclined surface. Tuff showing downslope sorting of particle sizes, with the largest particles preferentially accumulating near the base of the slope. The largest boulders in the foreground are about 1 m in diameter. As described in the text, we suspect that with conversion of translational to rotational kinetic energy, large spinning particles are less likely to be stopped by collisional friction than are small or angular particles for the same surface roughness, thus contributing to the sorting in this image. Image location is at the confluence of the Rito de los Frijoles river canyon with the Rio Grande river canyon on the eastern boundary of the Bandelier National Monument, New Mexico, USA.
The purpose of this paper is to provide a probabilistic description of the physics of rarefied particle motions and disentrainment. This involves threading together elements of statistical mechanics, concepts from granular gas theory, particle collision mechanics, and probability distribution theory. To motivate the formalism we start in Sect. 2 with a probabilistic definition of the particle disentrainment rate and show its relation to the entrainment forms of the flux and the Exner equation, following previous presentations (Furbish and Haff, 2010;Furbish and Roering, 2013). This highlights how the disentrainment rate determines the probability distribution of travel distances and thus connects descriptions of the flux and mass conservation with the physics of particle motions. In Sect. 3 we formulate disentrainment in terms of particle energetics, where the particles are treated as a rarefied granular gas. Ensemble-averaged motions are described in terms of a balance between gravitational heating and frictional cooling, wherein the latter leads to deposition. We neglect entrainment. (Our choice of terminology is based on that of granular physics as outlined in Appendix A.) The analysis in Sect. 4 illustrates the effects of collisional friction in determining the basic form of the distribution of travel distances, a generalized Pareto distribution. Depending on the balance between heating and cooling, this distribution transitions from a bounded form representing rapid thermal collapse to a heavy-tailed form representing net particle heating. In Sect. 5 we compare the formulation with previous descriptions of disentrainment, showing both similarities and dissimilarities with these descriptions. These include the formulation of Kirkby and Statham (1975), which involves a particle energy balance assuming a Coulomb-like friction behavior, and the formulation of Gabet and Mendoza (2012), which starts with a particle momentum balance involving gravity, Coulomb friction and collisional friction. We then consider elements of the probabilistic formulation presented by Furbish and Haff (2010), Furbish and Roering (2013), and Doane et al. (2018), which assumes a fixed disentrainment rate determined by the local surface slope for given surface roughness.
We emphasize that this initial phase of our work on rarefied particle motions is aimed at clarifying how particle disentrainment works. With this in place we will be positioned to consider effects of rarefied transport over timescales spanning many transport events, including ensemble-averaged particle fluxes and changes in land-surface elevation as described by formulations of nonlocal transport. As a step in this effort we show in the second companion paper (Furbish et al., 2021a) that the theory in this first paper is entirely consistent with data from laboratory and field-based experiments involving measurements of particle travel distances on rough surfaces. These include data reported by Kirkby and Statham (1975), Gabet and Mendoza (2012), DiBiase et al. (2017), and Roth et al. (2020) and new travel distance data from laboratory experiments supplemented with highspeed imaging and audio recordings that highlight effects of particle-surface collisions. Outstanding questions concern how particle size and shape in concert with surface roughness influence the extraction of particle energy and the likelihood of deposition.
In the third companion paper (Furbish et al., 2021b) we show that the generalized Pareto distribution in this problem is a maximum entropy distribution (Jaynes, 1957a, b) constrained by a fixed energetic "cost" -the total cumulative energy extracted by collisional friction per unit kinetic energy available during particle motions. That is, among all possible accessible microstates -the many different ways to arrange a great number of particles into distance states where each arrangement satisfies the same fixed total energetic costthe generalized Pareto distribution represents the most prob-able arrangement. Because this idea applies equally to the accessible microstates associated with net cooling, isothermal conditions and net heating, the fixed energetic cost provides a unifying interpretation for these distinctive behaviors, including the abrupt transition in the form of the generalized Pareto distribution in crossing isothermal conditions. The analysis therefore represents a novel generalization of an energy-based constraint in using the maximum entropy method to infer non-exponential distributions of particle motions.
In the fourth companion paper (Furbish and Doane, 2021) we step back and examine the philosophical underpinning of the statistical mechanics framework for describing sediment particle motions and transport. Specifically, the analyses presented in the first three companion papers provide an ideal case study for highlighting three key elements of this framework: the merits of probabilistic versus deterministic descriptions of sediment motions; the implications of rarefied versus continuum transport conditions; and the consequences of increasing uncertainty in descriptions of sediment motions and transport that accompany increasing length scales and timescales. We use the analyses to illustrate the mechanistic yet probabilistic nature of the approach, highlighting the idea that the endeavor is not simply about adopting theory or methods of statistical mechanics "off the shelf" but rather involves appealing to the style of thinking of statistical mechanics while tailoring the analysis to the process and scale of interest. Under rarefied transport conditions, descriptions of the particle flux and its divergence pertain to ensemble conditions involving a distribution of possible outcomes, each realization being compatible with the controlling factors. When these factors change over time, individual outcomes reflect a legacy of earlier conditions that depends on the rate of change in the controlling factors relative to the intermittency of particle motions. The implication is that landform configurations and associated particle fluxes reflect an inherent variability ("weather") that is just as important as the expected ("climate") conditions in characterizing system behavior.

Continuous form
Following the presentations of Furbish and Haff (2010) and Furbish and Roering (2013), let f r (r; x) denote the probability density function of particle travel distances r whose motions begin at position x. By definition the cumulative distribution function is where the prime denotes a variable of integration. In turn, the exceedance probability, also referred to as the survival function, is R r (r; x) = 1 − F r (r; x) = ∞ r f r (r ; x)dr . (2) With these definitions in place we now define the spatial disentrainment rate as which is a conditional probability per unit distance. Namely, upon multiplying both sides of Eq. (3) by dr, then P r (r; x)dr = f r (r; x)dr/R r (r; x) is interpreted as the probability that a particle will become disentrained within the small interval r to r + dr, given that it "survived" travel to the distance r. The disentrainment rate P r (r; x) also may be interpreted as an inhomogeneous Poisson rate (Feller, 1949). Now, using the fact that f r (r; x) = −dR r (r; x)/dr, one may deduce from Eq. (3) that the probability density f r (r; x) is given by f r (r; x) = P r (r; x)e − r 0 P r (r ;x)dr .
Thus, according to Eq. (4), the disentrainment rate P r (r; x) completely determines the probability density f r (r; x) of travel distances r.
Assuming particle motions occur only in the positive x direction, the entrainment form of the volumetric particle flux is where E s (x) denotes the volumetric entrainment rate at position x. In turn, letting ζ (x, t) denote the local land-surface elevation, the entrainment form of the Exner equation is (Tsujimoto, 1978;Nakagawa and Tsujiomoto, 1980) where c s = 1 − φ s is the particle volumetric concentration of the surface with porosity φ s . These probabilistic formulations of the flux and the Exner equation have three lovely properties. They are mass conserving, they are nonlocal in form, and they are scale independent in that no length constraints are imposed on the density f r (r; x) (Furbish and Haff, 2010;Furbish and Roering, 2013). They illustrate that the probability density f r (r; x) of particle travel distances r and its related survival function R r (r; x) form the centerpiece of describing mass conservation and the particle flux. In turn, the significance of the disentrainment rate P r (r; x) becomes clear. This rate connects Eqs. (5) and (6) to the physics of particle motions on a hillslope. That is, this rate, together with the entrainment rate E s (x), represents the elements in the formulation that can be elucidated by physics.
To date, previous formulations of the disentrainment rate P r (r; x) have envisioned a friction-dominated behavior in which the land-surface slope S(x) = |∂ζ (x)/∂x| has a primary role (Furbish and Haff, 2010;Furbish and Roering, 2013;Doane, 2018;Doane et al., 2018;Sect. 5.3). The disentrainment rate is specified as a function of the landsurface slope at the position of entrainment, with the idea that the slope changes over a distance much larger than the average particle travel distance. That is, P r (r; x) is assumed to be fixed and determined locally by the slope S(x) at position x such that the distribution of travel distances of particles entrained at x is exponential with mean µ r [S(x)]. As the land-surface slope S varies with increasing downslope distance x, the mean µ r [S(x)] changes. The disentrainment rate is qualitatively consistent with limiting cases; namely, it yields a fixed small average travel distance at zero slope, and it approaches zero in the limit of a steep critical slope beyond which disentrainment does not occur. However, the mechanical elements of the disentrainment rate P r (r; ) are otherwise not explicitly specified. We also note that Kirkby and Statham (1975) first pointed out the relation between the distribution of travel distances and the disentrainment rate function. These authors defined a posteriori the disentrainment rate from an assumed exponential distribution of travel distances whose mean value is expressed in terms of a Coulomb-like description of particle friction (Sect. 5.1).

Discrete form
It is valuable to recast the ideas of disentrainment above in discrete form. The motivation is this. Instead of trying to formulate a continuous disentrainment rate function that is generally applicable to the entirety of a hillslope, we instead break it into discrete spatial intervals, where certain physics may be more or less important in some intervals than in others. This gets us closer to the physical ingredients of disentrainment that are occurring at different locations on a hillslope, where the mechanical behavior at a location transitions to another behavior in the downslope direction. We may then combine the intervals together as a whole.
Let k = 1, 2, 3, . . . denote a set of discrete intervals of length dr. Let p denote the probability that a particle is disentrained within the first interval (k = 1). If N denotes a great number of particles, then the number of particles n(1) disentrained within the first interval is n(1) = Np. Because q = 1 − p is the probability that a particle is not disentrained within the first interval, then the number of particles moving beyond the first interval is N q = N (1 − p). That is, this is the number of particles that "survived" without being disentrained within the first interval. In turn, of the number of particles that survived, the number that is disentrained within the second interval is n(2) = N(1 − p)p. More generally, n(k) = N(1 − p) k−1 p. Dividing this by N then gives the probability mass function which defines the well-known geometric distribution with mean µ k = 1/p. Note that the probability p is taken here as being fixed. That is, in this formulation, the probability that a particle survives the kth interval is (1 − p) k−1 , so the disentrainment probability is constant, namely, P k (k) = p. The geometric distribution, Eq. (7), is the discrete counterpart of the exponential distribution. Here we relate the two. The cumulative distribution function of Eq. (7) is F k (k) = 1− (1 − p) k . We may thus write F r (r = kdr) = F k (k) = 1 − q k . The quantity q k is a memoryless geometric series, and because q ≤ 1 we may write q = e −dr/µ r , where µ r is a characteristic distance. In turn, then, F r (r) = 1−(e −dr/µ r ) k = 1− e −r/µ r . Finally, f r (r) = dF r (r)/dr = (1/µ r )e −r/µ r , where it becomes clear that µ r is the mean of the exponential distribution, analogous to µ k for the discrete counterpart. Also note that the disentrainment rate P r (r) = 1/µ r is fixed. Below we show that the exponential and geometrical distributions represent isothermal conditions, where gravitational heating of particles is balanced by frictional cooling.
In contrast, suppose that the probability of disentrainment p varies from one interval k to another. Here we generalize the ideas above. Let p k denote the probability that a particle, having not been disentrained before the kth interval, then becomes disentrained within this interval. Similar to the formulation above, the number of particles n(1) within the first interval is n(1) = N p 1 and the number moving beyond the first interval is N(1 − p 1 ). In turn the number of particles disentrained within the second interval is n(2) = N(1−p 1 )p 2 , the number disentrained within the third interval is N(1−p 1 )(1−p 2 )p 3 , and so on. In general, n(k) = N(1 − p 1 )(1 − p 2 )(1 − p 3 ) . . . (1 − p k−1 )p k . Dividing this expression by N then gives Note that if p k = p is fixed, then Eq. (8) reduces to Eq. (7). This generalization has a lovely property. Namely, by definition it conserves probability, and it therefore is mass conserving. That is, the sum of f k (k) over all possible k is equal to unity, regardless of how p k might vary with k. As alluded to above, the physics of each p k may be treated differently if desired. Moreover, like its continuous counterpart presented above, this discrete formulation of mass conservation is nonlocal and scale independent.

Brief preview
Our next objective is to illustrate the mechanical elements of disentrainment, which we then use to elaborate the contin- Figure 2. Definition diagram of surface inclined at angle θ and control volume with edge length dx through which particles move. uous and discrete cases described in the preceding sections. The main ingredients of the theoretical analysis involve the particle mass balance and the particle kinetic energy balance. Let N denote a great number of particles whose motions started at position x = 0 such that the particle travel distance r → x. We then show that the particle mass balance is given by dN(x)/dx ∼ −N/E a , where E a is the ensemble-averaged kinetic energy of moving particles. In turn we show that for a uniform slope angle and surface roughness the average energy varies as E a ∼ Ax+B, where A and B are the shape and scale parameters of the generalized Pareto distribution. These parameters are related to particle and surface conditions (e.g., particle size and shape, surface slope and roughness) and the initial particle energy state at x = 0. We then show that the disentrainment rate P x (x) = −(1/N )dN/dx = 1/(Ax + B), which, using Eq. (4), leads to the generalized Pareto distribution. When A < 0, this distribution is bounded; when A > 0, it is heavy-tailed; and when A = 0 such that P x = 1/B is a fixed value, the distribution reduces to an exponential form. Deposition is an inhomogeneous Poisson process for A = 0 and it is homogeneous for A = 0. We also illustrate the discrete counterpart to these results, as outlined in Sect. 2.2, including the situation of nonuniform surface conditions.

Conservation of mass
Consider a rough, inclined surface with uniform slope angle θ (Fig. 2). At this juncture we simplify the notation and consider the motions of particles entrained at a single position x = 0. Now the particle travel distance r → x and the probability density function f r (r; x) → f x (x). Consider a control volume with edge length dx parallel to the mean particle motion. Over a period of time a great number of particles enter the left face of the control volume. Some of these particles move entirely through the volume, exiting its right face, and some come to rest within the control volume. Many, but not necessarily all, of the particles interact with the surface one or more times in moving through the volume or in being disentrained within it. We now imagine collecting this great number of particles and treating them as a cohort, independent of time (Appendix B). That is, let N(x) denote the number of particles that enter the control volume, and let N(x + dx) denote the number that leaves the volume. We may imagine for the purpose of visualizing the problem that the N(x) particles enter the control volume at the same time, but this actually is not essential. Similarly, we may imagine that the N(x + dx) particles exit the control volume at approximately the same time, but again, this reference to time only is a means to envision particle motions (Appendix B). The number of particles disentrained within the control volume then If N(0) denotes the great number of particles whose motions started at position x = 0, then the exceedance probability R x (x) (analogous to R r (r; x) above) is R x (x) = N(x)/N (0). Then dN = −N(0)f x (x)dx and the spatial disentrainment rate P x (x) (analogous to P r (r) above) is Our objective is to determine the derivative dN(x)/d in relation to particle energy, as this derivative represents disentrainment. Here we summarize the essence of this problem before turning to a description of conservation of energy. Let E p = (m/2)u 2 denote the translational kinetic energy of a particle with mass m and downslope velocity u. Here we are assuming that the total translational energy is dominated by downslope motion. Let f E p (E p , x) denote the probability density function of particle energies E p as these vary with position x. For a great number N of particles the number density is n E p (E p , x) = N f E p (E p , x). Let p(E p , x) denote the probability that a particle at energy state E p will become disentrained within the small interval x to x + dx. Because N f E p (E p , x)dE p is the number of particles within the small interval E p to E p +dE p , then Np(E p , x)f E p (E p , x)dE p is the number of particles in this energy interval that becomes disentrained. The total number of particles that becomes disentrained within the interval x to x + dx is then Letting angle brackets denote an ensemble average, then according to the law of the unconscious statistician, Eq. (10) is simply dN(x) = −N(x) p(E p , x) . Below we introduce the expected number of particle-surface collisions per unit distance n x = 1/λ, where λ is the expected travel distance between successive collisions. We then show that dN(x)/dx = −N(x)n x p(E p , x) . Thus, the essence of the problem is to determine the averaged probability p(E p , x) as this depends on particle energy E p . This in turn requires specifying the particle energy as this varies with position x.

Particle energy
Our focus on conservation of particle energy versus momentum is aimed at defensible simplicity. Namely, particle motions down a rough hillslope surface involve numerous details that control momentum exchanges during particlesurface interactions. As a scalar quantity, energy forces us to blur our eyes appropriately, focusing on the essence of these complex interactions rather than attempting to describe details of momentum exchanges that ultimately cannot be constrained given the stochastic nature of the phenomenon.
As an example, below we introduce the random variable β x to represent the proportion of downslope kinetic energy extracted during a particle-surface collision. This quantity blurs over many details (e.g., differences between collisions during rolling, tumbling and bouncing motions, rotational versus translational motion, and the roles of normal and tangential coefficients of restitution), yet β x is entirely meaningful when treated as a random variable. (In Appendix E we provide a description of how the energy-centric quantity β x is related to momentum exchanges during collisions, and in the companion paper we illustrate the elements of β x using highspeed imaging.) In contrast, when describing the collisional behavior of an ideal granular gas, one can at lowest order appeal to a single coefficient of restitution because of the relative simplicity of the particles and their collisions (e.g., Haff, 1983;Jenkins and Savage, 1983). This simplicity is not possible here. The focus on energy thus offers tractable and defensible simplicity amidst the messiness of natural hillslopes. We start our formulation with a general statement concerning conservation of the kinetic energy of a system of particles. Because of its familiarity in relation to studies of granular gas systems, we initially consider changes with respect to time, then return to changes with respect to space as in the preceding section. Namely, let E p denote the kinetic energy of a particle, and let E p denote the expected energy state, where angle brackets represent an ensemble average over a great number N of moving particles. The total energy of the system is E = N E p . Neglecting transport of energy over space, the rate of change in the total energy of the system with respect to time is then The first term on the right side of Eq. (11) represents the rate of change in the average energy state of N moving particles and thus describes either a net heating (d E p /dt > 0) or cooling (d E p /dt < 0) of the system, depending on the relative contribution of the sources of each. The second term on the right side represents the rate of change in the number of moving particles with average energy state E p and thus describes the rate of change in the total energy due to either the addition or loss of moving particles. For a closed system, this represents either a net sublimation (dN/dt > 0) or net deposition (dN/dt < 0) of particles, depending on the relative contribution of each. The first term on the right side of Eq. (11) has been studied extensively for granular gas systems, specifically in relation to the "homogeneous cooling state" of a closed system as described by Haff's cooling law (Haff, 1983;Brilliantov and Pöschel, 2004;Dominguez and Zenit, 2007;Volfson et al., 2007;Brilliantov et al., 2018;Yu et al., 2020). In what follows, we start with similar concepts of particle energy, but the formulation is designed to be independent of time and focused on changes in energy and particle disentrainment over space.
Reconsider a control volume with edge length dx parallel to the mean motion of particles over a rough, inclined surface (Fig. 2). Analogous to Eq. (11) we write where now the angle brackets formally denote a Gibbs ensemble average over a cohort of particles (Appendix B). As described below, the first term on the right side of Eq. (12) represents the spatial rate of change in energy due to the sum of gravitational heating and frictional cooling. The second term on the right side represents the rate of change in energy due to deposition, that is, disentrainment. In this problem, we assume that sublimation (entrainment) does not occur over x > 0. Equation (12) provides a basic starting point. However, it is not particularly useful in this form. If in fact the probability of deposition varies with energy state, then in general the derivative dN/dx contributes to the derivative d E p /dx, as removal of energy by deposition affects the average energy of the remaining particles. We note that Brilliantov et al. (2018) demonstrate an analogous effect, as described below, associated with aggregation of particles in a granular gas. We therefore must be careful in formulating a statement of conservation of particle energy, as deposition preferentially involves particles at low-energy states.

Total energy
Focusing just on slope parallel motions, let E p = (m/2)u 2 denote the translational kinetic energy of a particle with mass m and downslope velocity u. Then let f E p (E p , x) denote the probability density function of particle energies E p as these vary with downslope position x (Appendix B). For a great number N of particles the number density is n E p (E p , x) = N f E p (E p , x). The average particle energy is The total energy E(x) = N E p , so We now take the derivative of Eq. (14) with respect to x using Leibniz's rule to give The derivative within the integral of Eq. (15) satisfies a Fokker-Planck equation (see next section and Appendix C), the solution of which represents the evolution of the distribution n E p (E p , x) of particle energy states E p with distance x.
In particular this derivative has three parts. The first part, denoted below by K h (E p , x), is associated with a change in the density n E p (E p , x) due to gravitational heating. The second part, K c (E p , x), is associated with a change in this density due to frictional cooling. The third part, K d (E p , x), is associated with a loss of energy due to deposition (which does not involve the analogue of release of latent heat; but see below). We thus write and then rewrite Eq. (15) as The next task consists of showing the correspondence of K h (E p , x), K c (E p , x) and K d (E p , x) to terms in the Fokker-Planck equation, then describing the physical elements of these terms. This is followed by evaluating each of the integral quantities in Eq. (17). There are a lot of moving parts in this formulation, so bear with us.

Fokker-Planck-like equation
The density n E p (E p , x) within Eqs. (15) and (16) satisfies a Fokker-Planck equation (Appendix C), which describes the evolution of this density with increasing distance x. Namely, 546 D. J. Furbish et al.: Rarefied particle motions -Part 1: Theory The first term on the right side of Eq. (18) represents advective gravitational heating, where k 1h (E p , x) is a drift speed, the average spatial rate of change in particle energy over the energy domain due to heating. The second term on the right side represents advective frictional cooling, where k 1c (E p , x) is a drift speed, the average spatial rate of change in particle energy due to cooling. The third term represents diffusive frictional cooling, where k 2c (E p , x) is a diffusion coefficient. The last term represents a loss of energy due to deposition, where for now we have retained the notation from above. Explicitly, for K h (E p , x) and K c (E p , x) we now have and In the next section we step through gravitational heating, frictional cooling and deposition, in each case unfolding the mechanical elements of k 1h (E p , x), k 1c (E p , x), k 2c (E p , x) and Here is a didactic sidebar if the formulation above seems counterintuitive. Notice that Eq. (18) effectively represents an advection-diffusion equation with two advective terms, a diffusive term and a sink term. Normally we think of an advection-diffusion equation as involving space and time, that is, where the rate at which a quantity changes with respect to time at a given position is equal to the sum of an advective term and a diffusive term involving derivatives of the quantity with respect to space. Indeed, imagine replacing E p with x, and x with t, in Eq. (18). The result looks like a familiar advection-diffusion equation with a sink term (albeit involving two advective terms rather than one). The basic idea of Eq. (18) is the same. It just describes the rate of change in n E p with respect to position x (rather than time t) in relation to advection and diffusion of n E p occurring over the energy coordinate E p (rather than x). A consideration of the rate of change with respect to position x as in Eq. (18) is perhaps unusual, but the idea of advection and diffusion of a quantity occurring over a domain other than a spatial coordinate (e.g., a velocity coordinate) is common in statistical physics, of which examples pertaining to sediment motions include those presented in Furbish et al. (2012Furbish et al. ( , 2018a.

Gravitational heating
We start by noting that the rate at which the potential energy of a particle is converted to kinetic energy per unit distance x is mg sin θ. To be clear, between collisions a particle that is not in contact with the inclined surface beneath it accelerates vertically at a rate of −g, independently of the orientation of the surface. The factor sin θ therefore is a geometrical constraint on the magnitude of the potential energy that is accessible for net heating when viewed with respect to x. This means that (Appendix C) so that Eq. (19) becomes We now write the first integral in Eq. (17) as Because ∂(E p n E p )/∂E p = E p ∂n E p /∂E p +n E p , Eq. (23) may be written as Assuming n E p (∞, x) → 0, the second integral in Eq. (24) vanishes and the first integral in Eq. (17) becomes Note that the form of the density n E p (E p , x) is immaterial in this formulation. If for illustration we assume that no cooling or deposition occurs, then dE(x)/dx = N mg sin θ . The solution of this is E(x) = E(0) + N mg sin θ x, where E(0) denotes the starting energy at x = 0. That is, the total kinetic energy E(x) increases linearly with downslope distance x. Moreover, for reference below, no particle can be heated to an energy greater than E p (0) + mg sin θ x, representing a complete conversion of gravitational to kinetic energy without any loss due to particle-surface collisions. This ensures that the density n E p (E p , x) is bounded with finite mean and variance, a point that becomes useful below.

Frictional cooling
We start by assuming that a change in the downslope energy of a particle associated with a collision is E p = −β x E p , so that β x = − E p /E p is the proportion of energy extracted by the collision (Appendix E). This is akin to the dissipation factor introduced by Quartier et al. (2000). By definition β x is a random variable. (Note that the negative sign above is by convention. As a random variable we are assuming that 0 ≤ β x ≤ 1. The sign associated with β x will be clear from the context in the developments below.) The change E p includes frictional loss, any conversion of translational to rotational energy, and any apparent change when downslope incident motion is reflected to transverse motion during a glancing particle-surface collision. Note that E p generally is a negative quantity. But strictly speaking it could be positive, albeit with low probability, if transverse incident motion is reflected to downslope motion during a collision. Because E p and β x are random variables, E p is a random variable. As a point of reference, in granular gas theory where the total translational energy is considered rather than just the energy associated with one coordinate direction, the proportion β x = 1 − 2 where is the normal coefficient of restitution (Haff, 1983). Normally is treated as a fixed deterministic quantity, although recent efforts have treated this quantity as a random variable (Gunkelmann et al., 2014;Serero et al., 2015). Here, in contrast, collision mechanics theory suggests that the constitution of β x is far more complicated in relation to normal, tangential and rotational impulses during particlesurface collisions (Appendix E).
Let q = E p (x + dx) − E p (x) denote a change in the energy of a particle over the small distance dx. Then as described in Appendix C, the drift speed k 1c (E p , x) = dq/dx ≈ n x β x E p and the diffusion coefficient k 2c = dq 2 /dx ≈ n x β 2 x E 2 p , where the overline denotes an average over particles at the energy state E p (rather than an ensemble average), and n x = 1/λ denotes the expected number of particle-surface collisions per unit distance where λ is the expected travel distance between collisions. Scaling (Appendix D) shows that where φ is the expected reflection angle of a particle with energy E p following a surface collision. We now assume that and that Now Eq. (20) becomes We now use these results to write the second integral in Eq. (17) as Upon applying the product rule to the derivative ∂(E p β x n E p )/∂E p , the first integral in Eq. (30) may be written as Assuming that n E p (∞, x) → 0, the second integral in Eq. (31) vanishes and the first integral becomes N β x , where the angle brackets now represent an ensemble average.
In turn, upon applying the product rule to the derivative ∂[E p ∂(β 2 x E p n E p )/∂E p ]/∂E p , the second integral in Eq. (30) may be written as Thus, whereas the diffusive term in Eq. (18) redistributes energy by modifying the density n E p (E p , x) (see below), it does not contribute to the total energy balance. The second integral in Eq. (17) is thus We return to these results below.

Energy loss with deposition
For illustration, suppose initially (unrealistically) that deposition is independent of the particle energy state E p . This means that the number of particles disentrained within any small energy interval E p to E p + dE p is a fixed proportion k d of the particles within this interval. Thus, If we momentarily assume that no heating or cooling occurs, where E(0) denotes the starting energy at x = 0. That is, the total energy E(x) decays exponentially with downslope position x. In this example, note that the form of the density n E p (E p , x) is immaterial. Moreover, as a point of reference we may momentarily equate the left side of Eq. (18) with the last term in this equation and write Thus, comparing this result with Eq. (12), the situation in which deposition is independent of the particle energy state is consistent with isothermal conditions wherein the average energy state is unchanging, that is, d E p /dx = 0. More generally, deposition is unlikely to be independent of the particle energy state, as particles with small energy are on average more likely to become disentrained than are particles with large energy. Thus, K d (E p , x) likely possesses a more complicated form than in the example above. Whereas early work on granular gases focused on their behavior in the absence of deposition, the phenomenon of thermal collapse, condensation and freezing in a gravitational field now is receiving significant attention (Volfson et al., 2006;Kachuck and Voth, 2013). We can lean on insight from this work, but because energy dissipation in a granular gas is dominated by particle-particle collisions rather than particle-boundary collisions, the rarefied problem considered here is quite different. As with approaches used in the study of condensation and freezing of granular gases, our analysis at this stage is aimed at lowest-order behavior.
For any position x, we do not know the ensemble distribution f E p (E p , x) of particle energy states E p with expected value E p . Because no particle can be heated to an energy greater than E p (0) + mg sin θ x (representing a complete conversion of gravitational to kinetic energy without any loss due to particle-surface collisions), we know only that 0 ≤ E p ≤ E p (0) + mg sin θ x. Most energies likely are significantly smaller than the upper limit due to collisions.
Collecting results from above, the density n E p (E p , x) satisfies a Fokker-Planck-like equation, namely, where we are assuming for simplicity that β x and β 2 x are fixed. As a reminder, the first term on the right side of Eq. (36) represents gravitational heating, and the second and third terms on the right side represent frictional cooling. The term −K d (E p , x), which describes the loss of energy associated with deposition, is defined below explicitly in terms of a deposition length scale.
Let n E p0 and E p0 denote suitable characteristic values of the density n E p and the energy E p , and let X denote a characteristic length scale. We now define the following dimensionless quantities denoted by circumflexes: Upon substituting these quantities into Eq. (36), we may identify three characteristic length scales, namely, The first of these, X h , represents the distance required to heat a particle to the energy state E p0 in the absence of frictional cooling. The second, X cA , represents the distance over which thermal collapse by advective cooling occurs. The third, X cD , represents a distance over which diffusive cooling occurs. We now define two dimensionless numbers: the Kirkby number 1 , and a cooling Péclet-like number, The Kirkby number Ki is the ratio of gravitational heating to advective cooling. The Péclet-like number Pe c is the ratio of advective cooling to diffusive cooling. Choosing X cA as the characteristic length scale and neglecting the deposition term in Eq. (36), we now rewrite it as Note that with β x 1, then Pe c 1 according to Eq. (42), such that the diffusive term in Eq. (43) becomes insignificant relative to the advective cooling term.
With reference to Fig. 3, imagine a great number of particles whose initial energy states at x = 0 are described by the density n E p (E p , 0). With just gravitational heating, this distribution is advected to higher energy values at a fixed rate mg sin θ . With just frictional cooling, but in the absence of diffusion, the distribution is advected to lower energy values at a fixed rate mgβ x cos θ/4 tan φ. If gravitational heating is balanced by advective cooling (Ki = 1), the form of the distribution remains fixed with increasing distance x. With diffusive cooling, advective cooling of the density n E p (E p , x) to lower energy values involves smoothing of this density. When these effects are combined, whether heating is greater than advective cooling (Ki > 1) or vice versa (Ki < 1), no value of E p is larger than E p (0) + mg sin θ x, and most values are significantly less than this maximum due to the increasing likelihood of particle-surface interactions (cooling) within increasing x. When the magnitude of the term in Eq. (43) involving Ki is greater than the sum of the magnitudes of the two cooling terms, then net heating occurs. When the magnitudes of the cooling terms are larger than the heating term, then net cooling occurs. For particles reaching relatively small energy states, there is an increasing likelihood of deposition (see below). As a reminder, this description does not pertain to the energy states of a great number of particles during an interval of time. Rather, this description pertains to an ensemble of particles reaching any position x over a long period of time when treated as a cohort. That is, n E p (E p , x) is the density of particle energies at any x representing the great number of particles that occupied this position while in motion at many previous instants in time.
We now offer a simple hypothesis describing the loss of energy associated with deposition. Recall that X cA is a measure of the distance over which particles with energy E p0 thermally collapse by frictional cooling. We may imagine, for example, a sudden removal of the source of heating such that X cA is a measure of the distance of relaxation to a total loss of energy. For particles with energy E p , this length scale can be expressed more generally as Figure 3. Schematic diagram of downslope changes in the distribution f E p (E p , x) of particle energy states E p (for simplicity a uniform distribution) due to the following: (a) gravitational advective heating in the absence of cooling; (b) advective frictional cooling in the absence of heating; and (c) net cooling. Arrows represent displacement occurring over a small downslope interval dx. The triangle represents an idealized situation in which, with net cooling, the likelihood of deposition increases with decreasing particle energy E p and decreases with increasing energy. Note that an effect of deposition with heating or cooling is to increase the average energy E p by culling lower-energy particles, thereby selecting higher-energy particles for continued travel with increasing distance.
length in the presence of heating. As described below, the factor α is assumed to be a function of the Kirkby number. Substituting Eq. (45) into Eq. (17) and evaluating the integral then yields where we now redefine the Kirkby number as assuming that β x is independent of E p . Comparing this result with Eq. (33), the energy loss rate due to deposition is the same as the advective cooling rate but modulated by the factor α.

Conservation of mass revisited
The preceding material provides the basis for the next step, namely, calculating the disentrainment rate wherein effects of particle deposition on the energy balance are taken into account. Because n E p (E p , x)dE p represents the number of particles within the small energy interval E p to E p + dE p , using Eqs. (44) and (45) the total disentrainment rate is therefore Thus, the disentrainment rate is proportional to the cooling rate, as it should be. Here it is important to note that the expected value 1/E p = 1/ E p . In fact, 1/E p is the reciprocal of the harmonic mean (Appendix F). This means that E p 1/E p ≥ 1. Only in the limit where n E p (E p , x) has zero variance does 1/E p → 1/ E p . To simplify the notation, hereafter we denote the arithmetic mean as E p = E a and the harmonic mean as 1/ 1/E p = E h . Thus E a /E h ≥ 1. As a point of reference we may now define an ensembleaveraged deposition length as with µ = β x /4 tan φ. Note that in contrast to the energyspecific length scale l c in Eqs. (44) and (45), L c in Eq. (49) is keyed to the harmonic average energy of the ensemble. Setting θ = 0 so that cos θ = 1, the length scale L c is entirely analogous to the length scale λ 0 used by Furbish and Haff (2010), Furbish and Roering (2013), and Doane et al. (2018) as the characteristic particle travel distance on a flat surface, thence modulated with increasing slope S (see also Sect. 5.3).
The factor α has a key role in the formulation. As described above, this factor modulates the length scale L c in the presence of gravitational heating. Note that Eq. (48) is equivalent to For a given value of α the length scale L c is set by the cooling rate, and this length scale increases with increasing slope angle θ . But gravitational heating also increases with θ , the effect of which is to suppress the rate of deposition and increase L c . That is, the deposition length scale is not the same as the cooling length scale. As described below, whereas l c is a measure of the rate of extraction of translational energy, this includes its conversion to rotational energy whose effect is to decrease the likelihood of stopping. On dimensional grounds an inspired guess suggests that this effect is a function f (Ki) of the Kirkby number Ki. For example, suppose that where µ 1 is a coefficient of order unity. This leads to where α → α 0 as µ 1 Ki → 0. Now, This example suggests that L c → ∞ as µ 1 Ki → 1. That is, µ 1 Ki → 1 sets an upper limit above which deposition is insignificant. More generally we may write to indicate the possibility of other dependencies of α on Ki.
Note that we provide evidence for this behavior in the companion paper, including the form of Eq. (52) based on experiments. For notational simplicity in subsequent sections, we use α with the understanding that this implies α = α 0 f (Ki).
Here is a key sidebar for reference in our descriptions below of related formulations. We emphasize that according to Eqs. (45) and (48) the deposition rate is proportional to the advective cooling rate rather than the net cooling rate (the difference between the rates of heating and cooling), where the rate of heating then modulates the deposition rate, therein increasing the deposition length scale. Moreover, the deposition rate explicitly depends on the energy state of the particles. Consider a thought experiment. Let us imagine a system consisting of a box containing a finite number of particles. Suppose that we mechanically add energy to the system such that some proportion of the particles becomes a rarefied granular gas, and suppose that the gas achieves a nonequilibrium steady state with a specific average energy state (Appendix G). This means that the rate of (mechanical) heating is equal to the rate of cooling due to dissipative particlebox collisions, and sublimation (entrainment) matches deposition (disentrainment). That is, depending on the energy state of the particles, deposition occurs even though the difference between the rate of heating and cooling is zero. Now imagine that when a particle is deposited in our idealized box, it cannot become re-entrained (which is analogous to the hillslope system). The rate of heating and cooling of the remaining gas particles is still the same, yet the deposition process continues for those particles which, by chance, cool to sufficiently low energies for deposition to occur. Thus, we are assuming that the deposition rate is proportional to the cooling rate rather than the net cooling rate, depending on the energy state of the particles. The effect of heating is to decrease the likelihood of deposition by decreasing the proportion of particles that cool to sufficiently low energies for deposition to occur -which translates to suppressing the disentrainment rate and increasing the length scale of deposition. As outlined in Sect. 5 below, this effect is absent in previous formulations of particle disentrainment.

Energy and mass balances
We now collect results from above to summarize effects of the energy and mass balances. With µ = β x /4 tan φ the total energy balance is given by To summarize, the first term on the right side of Eq. (55) is due to gravitational heating, the second term is due to frictional cooling, and the last term represents a loss of energy due to deposition. Note that none of these terms explicitly involves the energy E(x). In turn, conservation of mass is given by This is coupled with Eq. (55) via the relation between the total energy E(x), the average energy E a and the harmonic average energy E h (see below), as well as the explicit appearance of N in both of these equations.
At this point we emphasize that the quantity µ = β x /4 tan φ is not to be interpreted as Coulomb-like dynamic friction coefficient. Indeed, the product mgµ cos θ in Eqs. (55) and (56) looks like an ordinary Coulomb friction force (e.g., Kirkby and Statham, 1975;Gabet and Mendoza, 2012). Recall, however, that cos θ enters from the geometry of particle motions and does not represent the angle needed to specify the normal component of the weight mg. Similarly, tan φ is an expected reflection angle, not a friction angle. We elaborate these points below.
To close the circle in reference to our stating point, Eq. (12), we now combine Eqs. (12), (55) and (56) to give This balance involving the average energy E a rather than the total energy E reveals an important behavior associated with deposition, centered on the parenthetical part of the last term. Namely, it is straightforward to show (Appendix F) that E a /E h −1 ≥ 0. The last term in Eq. (57) therefore represents an apparent heating associated with deposition. With reference to Fig. 3, a net advective cooling uniformly lowers all particle energy states, thus lowering the average energy E a as well as the total energy E. As this cooling lowers all energy states, some particles enter the range where deposition occurs, and the deposition rate therefore is proportional to the net advective cooling rate. In the absence of a net advective cooling, particles with small energy nonetheless are preferentially disentrained, so the average energy state increases. When cooling and deposition are combined, the average energy decreases more slowly than it otherwise would in the absence of deposition. This effect increases with increasing variance in the distribution of energies (Appendix F), and it vanishes as the variance goes to zero. The balance described by Eq. (57) thus provides a formal description of what we intuitively know: deposition culls lower-energy particles, thereby selecting higher-energy particles for continued travel with increasing distance. We note that Brilliantov et al. (2018) demonstrate an analogous unexpected behavior of granular gases, namely, the heating of a granular gas associated with particle aggregation with continued loss of total energy. This occurs when the rate of loss of particles by aggregation exceeds the rate of loss of total energy, such that by definition the average particle energy increases. The balance described by Eq. (57) also reveals an important constraint on particle energies. Namely, if we imagine the special situation of isothermal conditions (dE a /dx = 0), then frictional cooling given by the second term on the right side of Eq. (57) must balance two sources of heating, namely, the first and third terms on the right side. This requires that either the Kirkby number Ki < 1 or, if Ki = 1, then the distribution f E p (E p , x) of energies E p must have zero variance 552 D. J. Furbish et al.: Rarefied particle motions -Part 1: Theory such that E h = E a . Because this latter condition is highly unlikely, an isothermal condition generally requires that Ki < 1. Conversely, net heating must occur with Ki > 1.
According to Eq. (55) or Eq. (57), for a given slope angle θ the spatial rate of net cooling (or net heating) of the ensemble is a fixed quantity in which this slope angle has a dual role. Namely, an increasing slope decreases the rate of frictional cooling by decreasing the expected occurrence of particlesurface collisions, and it simultaneously increases the rate of gravitational heating. With θ = 0, heating vanishes and frictional cooling occurs at a maximum rate of µmg. In turn, as θ → π/2, which represents a vertical cliff, frictional cooling vanishes and heating matches that of free-fall motion. This transition from small to large slopes nicely illustrates what virtually every undergraduate student learns intuitively from the sport of boulder rolling (or "trundling"; Forrester, 1931), and why this sport is so spectacular and satisfying in steep terrain. Moreover, recall that the Kirkby number Ki = S/µ is the ratio of gravitational heating to advective cooling. If these are balanced, Ki = 1 and Qualitatively, this is the slope at which an undergraduate student may expect that boulder rolling starts to become particularly interesting. The formulation also nicely illustrates that if the heating and cooling rates are matched, this does not imply an absence of deposition, as the last terms in Eqs. (55) and (57) may be finite with Ki = 1. Moreover, because this is a probabilistic phenomenon, some particles are likely to become disentrained even on steep, rough slopes where heating on average exceeds cooling. Experienced undergraduates indeed inform us that some boulders just do not make it all the way to the bottom of the hillslope despite their best efforts to select conditions satisfying Ki > 1.

Effects of energy and mass balances
We now show how the energy and mass balances together lead to the generalized Pareto distribution of particle travel distances. To do this we restate Eqs. (55)-(57) in dimensionless form, which clearly reveals the important role of the Kirkby number Ki. Let E a0 denote the initial average particle energy at x = 0 and let N 0 denote the initial number of particles at x = 0. In turn we define a characteristic cooling distance X = E a0 /mgµ cos θ so that E a0 = mgµ cos θ X. We now define the following dimensionless quantities denoted by circumflexes: With these definitions we write Eqs. (55)-(57) as Because the dimensionless disentrainment ratePx(x) = −(1/N)dN/dx, notice that Eq. (61) provides the basis for determining the distribution of travel distances using Eq. (4). This requires specifying howÊ h varies withx for given values of α and Ki. At this point, however, we must confront the fact that we have four unknowns,N,Ê,Ê a andÊ h , and three equations, one of which is nonlinear in the ratioÊ a /Ê h . Here we add a fourth equation by assuming that this ratio remains fixed, namely, We do not know the distribution fÊ p (Ê p ) required to determine γ (Appendix F). Nonetheless, Eq. (63) essentially assumes that the form of fÊ p (Ê p ) remains similar with distancex. This allows us to illustrate key elements of the formulation.
With the assumption of Eq. (63) we note that Eq. (61) becomes and Eq. (62) becomes Focusing initially on Eq. (65), isothermal conditions exist if dÊ a /dx = 0. We then rearrange Eqs. (62) or (65) to define a transition value of the Kirkby number, namely, If Ki < Ki * , then cooling occurs (dÊ a /dx < 0); if Ki > Ki * , then heating occurs (dÊ a /dx > 0). Recall thatÊ a /Ê h = γ ≥ 1 (Appendix F) so that γ − 1 ≥ 0. If the variance of energy states E p is zero, thenÊ a /Ê h = γ = 1, giving Ki * = 1. Thus, in this case cooling occurs with Ki < 1 and heating occurs with Ki > 1. The ratioÊ a /Ê h = γ generally increases with the variance of E p , thus decreasing Ki * . That is, as this variance increases, the transition between cooling and heating occurs at a smaller value of the Kirkby number. This represents a stronger culling (deposition) of lower-energy particles. The largest possible transition value is Ki * = 1. We now start with an idealized example that illustrates key elements of the formulation, including the coupling between Eqs. (60)-(62). Assume that the Kirkby number Ki is fixed, and assume isothermal conditions. Thus dÊ a /dx = 0 witĥ E a =Ê a0 so that Eq. (62) leads tô .
Thus, according to Eqs. (61) and (67), In turn, using Eq. (4) this yields an exponential distribution of travel distances with mean so that Note that an increasing value of γ in Eq. (69) represents an increasing proportion of lower-energy particles available for deposition relative to this availability with γ → 1, the effect of which is to decrease the mean travel distance. The total energyÊ also declines exponentially withx. Namely, substituting Eq. (70) into Eq. (60) leads tô This example of isothermal conditions illustrates that witĥ E a =Ê a0 , then according to Eq. (69) the average travel distance µx is directly proportional to the initial average energy. However, isothermal conditions are unlikely, because according to Eq. (66), such a condition requires a specific value of Ki for the ratioÊ a /Ê h = γ . We now consider the more general case involving either net cooling or net heating. As above we assume that the ratioÊ a /Ê h = γ is fixed, although the averagesÊ a andÊ h otherwise are unconstrained. Net cooling or net heating is not prescribed; either condition is allowed. Using Eqs. (60) and (61) the disentrainment rate is (Appendix H) Note that as a → 0 the disentrainment rate goes to a fixed value equal to 1/b = γ /αÊ a0 , and the distribution fx(x) of travel distancesx goes to an exponential distribution with mean µx = b = αÊ a0 /γ . A value of a > 0 (Ki > Ki * ) implies decreasing disentrainment with increasing x. A value of a < 0 (Ki < Ki * ) implies increasing disentrainment. More generally, the distribution of travel distances is a generalized Pareto distribution with position parameter equal to zero (Appendix H), namely, where now a ∈ R is interpreted as a shape parameter and b > 0 is a scale parameter (Pickands, 1975;Hosking and Wallis, 1987). The cumulative distribution iŝ and the exceedance probability iŝ For a < 1 the mean is which is independent of the ratio γ =Ê a /Ê h . This is the same as Eq. (69) for isothermal conditions, although the denominator in Eq. (77) generally is not equal to γ . In turn, Eq. (77) requires that which provides the upper limit of Ki for which the mean µx is defined. Because α > 0, this limit may be greater than one. For a < 1/2 the variance is Unlike the mean, the variance depends on γ =Ê a /Ê h . In turn, for a ≥ 1 such that the mean of fx(x) is undefined. Moreover, for a ≥ 1/2 the variance is undefined. These results reflect the heavy-tailed behavior of the generalized Pareto distribution. As a point of reference in the second companion paper (Furbish et al., 2021a), the generalized Pareto distribution defined by Eq. (74) also may be considered a generalized Lomax distribution. This distribution can be rewritten as an ordinary Lomax distribution (Appendix H). Namely, if we define the shape parameter a L = 1/a and the scale parameter b L = b/a, then Eq. (74) becomes which is a Lomax distribution with mean For a L > 0 (a > 0) the forms and behaviors of Eqs. (74) and (81) are identical. Notice, however, that if a < 0 then a L = 1/a < 0 and b L = b/a < 0 for positive b. This means that we cannot use the form of the Lomax distribution given by Eq. (81) to examine conditions involving a < 0. Yet these conditions are mechanically meaningful, so we proceed using the generalized Pareto distribution given by Eq. (74). To be clear, the ordinary Pareto distribution that is normally referred to in the literature is a special case of the generalized Pareto distribution. In turn the Lomax distribution is a special case of the Pareto distribution (and therefore of the generalized Pareto) with position parameter equal to zero. With reference to Fig. 4, for a < 0 the distribution fx(x) is bounded at a value ofx = b/|a| with a mean given by Eq. (77). This represents a condition of rapid thermal collapse. Specifically, when a < −1 this distribution monotonically increases and becomes asymptotically unbounded at x = b/|a|. In the limit of a → −1 it becomes a uniform distribution. When a = −1/2 this distribution is triangular. For −1/2 < a < 0 this distribution decays more rapidly than an exponential distribution and is bounded at the positionx = b/|a|. For a = 0, fx(x) becomes an exponential distribution, representing an isothermal condition as described above. For a > 0 the distribution fx(x) is heavy-tailed. This represents a condition of net heating. Specifically, for 0 < a < 1/2 this distribution decays more slowly than an exponential distribution, but it possesses a finite mean and a finite variance. For 1/2 ≤ a < 1 the distribution possesses a finite mean, but its variance is undefined. For a ≥ 1 the mean and variance of fx(x) are both undefined, even though this distribution properly integrates to unity. For a > 0 the tail of fx(x) decays as a power function, namely, fx(x) ∼x −(1+1/a) . The exceedance probability decays as Rx(x) ∼x −1/a . These results are summarized in Table 1. We provide evidence of all three behaviors -rapid thermal collapse, isothermal conditions, and net heating -in our second companion paper (Furbish et al., 2021a).
The formulation above assumes uniform surface conditions, specifically, uniform slope angle and roughness texture. We show below (Sect. 6) how it may be adapted to varying downslope conditions. We also note that the distribution fx(x) given by Eq. (74) can be incorporated into a mixed distribution. Indeed, a mixed distribution is the natural choice for describing the travel distances of a mixture of particle sizes, each involving a different frictional cooling behavior for a given surface roughness (Roth et al., 2020).

Elements of the average travel distance
The average travel distance given by Eq. (77) for Ki < 1 + 1/α contains all of the elements that influence particle motions except the quantity γ . Thus, whereas the average by itself does not reveal the source of variations in the form of distribution of travel distances, Eq. (74), the average nonetheless provides a focal point. Here we rewrite this average in its dimensional form, then step through the significance of its elements. Namely, with E a0 = (1/2)m u 2 0 and Ki = S/µ = (1/µ)mg sin θ/mg cos θ , For an ensemble of particles whose motions start at x = 0, the average travel distance µ x increases directly with the average starting energy E a0 ∝ u 2 0 . This is entirely akin to the formulation by Kirkby and Statham (1975) (see below), and it highlights the significance of the initial particle energy conditions at x = 0 in setting their travel distances. The archetypal example involves rockfall from cliffs followed by their motions over talus and scree slope surfaces (Fig. 1), where fall heights and initial rebounds set the initial average downslope energy. This also is a key element in experiments where initial energies are set by the choice of drop height (Kirkby ana Statham, 1975) or launch speed (Gabet and Mendoza, 2012;DiBiase et al., 2017). This aspect of the formulation also points to the significance of energetics associated with the entrainment rate E s (x) in Eqs. (5) and (6) at hillslope positions that are not necessarily as well-defined as, say, the base of a cliff (see Sect. 6).
The average travel distance µ x is inversely proportional to the rate of frictional cooling represented by mgµ cos θ . Here we reemphasize that despite its form, this expression does not represent a Coulomb-like friction. Rather, this expression enters the formulation via the characteristic length λ in setting the expected number of collisions per unit distance, n x . As described below, the surface-normal component of the particle weight does not set collisional friction; this is set by dynamic forces during collisions. Moreover, the appearance of the Kirkby number Ki in the denominator of Eq. (83) indicates that as Ki increases, the denominator becomes smaller Table 1. Behavior of the generalized Pareto distribution associated with the shape parameter a and Kirkby number Ki as illustrated in Fig. 4.

Behavior
Range of a Range of Ki Mean µx Variance σ 2 x Bounded 1 , increasing withx a < −1  (subject to the conditions that Ki < 1 + 1/α), so the average travel distance increases. We also note that, except for purely bouncing motions, it is incorrect to interpret the length λ strictly as a saltation-like distance. This is a scaling approximation (Appendix D) to show that n x must involve the average energy (∝ u 2 ) and the geometrical factor cos θ at lowest order. Notice that Eq. (83) indicates that with E a0 = (1/2)m u 2 0 the average travel distance µ x is independent of the particle mass m. Viewed in isolation, this suggests that large particles should on average travel no farther than small particles. However, this is inconsistent with what is observed in laboratory and field-based experiments (Kirkby and Statham, 1975;DiBiase et al., 2017;Roth et al., 2020) and with downslope size sorting on natural talus and scree slopes (Statham, 1976). We examine this topic in the second companion paper (Furbish et al., 2021a); here we offer a synopsis, which centers on the interpretation and significance of the quantities µ and α.
Recall that the formulation is based on the assumption that a change in translational kinetic energy E p associated with a particle-surface collision can be expressed as E p = −β x E p so that β x = − E p /E p is the proportion of the energy extracted during the collision. Both E p and β x are random variables. As described in Appendix E, in general we may write the energy balance of a particle as Here, a positive change in rotational energy E r is seen as an extraction of translational energy. This loss of translational energy with the onset of rotation may be relatively large if a collision involves stick following initial sliding due to a large normal impulse, and such a loss also may occur due to the imposed torque of friction during a collision that does not necessarily involve stick. The term f c in Eq. (84) represents losses associated with particle and surface deformation as well as work performed against friction during collision impulses (thence converted to heat, sound, etc.). But this term also includes losses associated with deformation of the surface at a scale larger than that of an idealized particle-surface impulse contact, namely, due to momentum exchanges associated with the sputtering of loose surface particles dur-ing collision. (The videos published as supplementary material to DiBiase et al. (2017) nicely illustrate this sputtering as well as the onset of rotational motion.) The term f y in Eq. (84) represents the energy loss associated with glancing collisions that produce transverse translational motions and rotation oriented differently than any incident rotation.
In some cases the change in energy E p can be expressed directly in terms of the energy state E p (Appendix E). However, the complexity of particle-surface collisions on natural hillslopes precludes explicitly demonstrating such a relation for all possible scenarios. Nonetheless, it is entirely defensible to assume that energy losses can be related to the energy state E p if the elements involved are formally viewed as random variables. Then, the simple relation E p = −β x E p is to be viewed as a hypothesis to be tested against data. This hypothesis formally enters the formulation via the right side of Eq. (58). Namely, from this relation we may write µ ∼ β x , highlighting that µ is associated with the cooling rate. In turn, particle collision mechanics (Appendix E) suggest, for example, that µ ∼ β x ∼ M(θ ), where M(θ ) involves the coefficients associated with tangential and normal impulses contributing to energy losses during collisions and depends on the slope angle θ in that the expected surface normal impact velocity varies with this angle. (In an idealized particle-surface collision these coefficients include the normal coefficient of restitution and a coefficient describing the ratio of tangential to normal impulses during the collision (e.g., Brach, 1991;Dunn, 1992, 1995).) Moreover, M(θ ) is independent of particle size.
In turn, focusing on the definition of the deposition length scale L c , Eq. (51), α may be viewed as representing a direct effect of heating described by mg sin θ , namely, to decrease the likelihood of deposition by decreasing the proportion of particles that cool to sufficiently low energies for deposition to occur -which translates to suppressing the disentrainment rate and increasing the length scale of deposition L c relative to the cooling length scale l c = E h /mgµ cos θ . Specifically, heating decreases the spatial rate of the Poisson deposition process below that which would occur in the absence of heating. In this view, µ goes with the cooling rate (not the deposition rate). But we also may write Eq. (51) as in Eq. (53). Viewed in this manner, we may define an apparent friction factor as µ 0 = µ(1 − µ 1 Ki) associated with deposition. Here again, µ is associated with the cooling rate but is then modulated by heating. We suggest in the second companion paper (Furbish et al. 2021a) that for the same particle size, α increases with increasing Ki, very likely due to a combination of increased heating and increased partitioning of translational energy to rotational motion (Dorren, 2003;Luckman, 2013) -both decreasing the likelihood of stopping and not represented in just the factor µ. We also suggest that for the same slope and surface roughness, α increases with increasing particle size, decreasing the likelihood of frictional loss with increasing rotation.
Turning to the factor γ (which does not appear in Eq. 83), recall that an increasing value (γ > 1) reflects an increasing availability of low-energy particles for deposition. Here is what we know. On the one hand, γ cannot be close to unity with randomization of motions by collisional friction, or if the initial downslope energies E p0 are not a fixed value. That is, if the distribution of energies f E p (E p , 0) has finite variance, then γ > 1. On the other hand, γ cannot be very large, or deposition would dominate over small distances without long motions that are observed -unless the Kirkby number Ki is unrealistically large. It is possible to qualitatively explore possible values of γ based on assuming different forms of the distribution of energies (and we have done this), but in the absence of knowing the specific form of the distribution, this exercise is not particularly meaningful. In the companion paper we show that fits of experimental travel distances to the theoretical distribution f x (x) are relatively insensitive to the specific value of γ selected.

Related formulations
Here we briefly examine three related formulations of particle disentrainment, focusing on the mechanical basis of this work for comparison with the formulation above. (We examine associated experiments in the companion paper.) We start with the formulations of Kirkby and Statham (1975) and Gabet and Mendoza (2012). These begin with descriptions of particle motions over time rather than space, centered on consideration of a combination of momentum and energy. We then consider elements of the probabilistic formulation presented by Furbish and Haff (2010), Furbish and Roering (2013), and Doane et al. (2018).

Kirkby-Statham formulation
In their study of particle motions on scree surfaces, Kirkby and Statham (1975) start with a statement of conservation of energy for a particle falling from height h at x = 0 onto a rough surface inclined at an angle θ . Namely, if w 0 = √ 2gh is the vertical impact velocity, then it is assumed that the initial downslope velocity on average is u 0 = w 0 sin θ . (This actually should be u 0 = w 0 sin θ with the normal coefficient of restitution .) The initial downslope particle energy therefore is (1/2)mu 2 0 = 2 mghsin 2 θ = E p0 . In turn, because work is W = F x l, where F x is the downslope force and l is a displacement, then for a fixed force F x the displacement is l = W/F x . Assuming that W must be equal to the initial kinetic energy E p0 (that is, this initial energy is dissipated over the distance l), then l = E p0 /F x . Assuming a Coulomb-like friction behavior, F x = mg sin θ − µ d mg cos θ, where µ d is a dynamic friction coefficient. Upon asserting that the length l represents the expected travel distance µ x , where it is assumed that |µ d cos θ | ≥ sin θ . As described below, this is equivalent to assuming that particle energy decreases linearly with distance. This formulation correctly describes the motion of an individual particle that experiences a fixed Coulomb-like friction force, but it cannot represent the rarefied behavior of an ensemble of particles. Nonetheless, it shares elements of the preceding formulation. Namely, in comparing Eq. (85) with Eq. (83), let us momentarily set aside the fact that E p0 cannot represent E a0 except in the limit of zero variance of initial energy states (γ = 1) and that the friction factor µ in Eq. (83) and the dynamic friction coefficient µ d in Eq. (85) have different interpretations. These two descriptions of the average travel distance µ x then converge in the limit of α → ∞. Inasmuch as Eq. (52) correctly describes the behavior of α, this limit coincides with Ki → 1.
More generally, Eq. (85) implies that the deposition rate is independent of the extant energy state of particles. If Eq. (85) denotes the average of a distribution of travel distances with fixed disentrainment rate, then this fixed rate P x = 1/µ x . In dimensionless form this is That is, Eq. (86) cannot allow for the possibility of variations in the cooling rate or heating rate that give spatial variations in the disentrainment rate, as in Eq. (72). The resulting distribution f x (x) of travel distances therefore is exponential for all Ki < 1. Interestingly, the formulation of Kirkby and Statham (1975) is equivalent to (Appendix I) This is like Eq. (57) but lacks the heating effect of deposition. Like Eq. (85), Eq. (87) implies that deposition is independent of the extant energy state; when Ki = 1 the energy E p remains fixed as x → ∞ (again noting that E p0 cannot represent E a0 except in the limit of zero variance of initial energy states). Dorren (2003) provides a review of efforts to elaborate the Kirkby-Statham description of particle motions in relation to hazard assessment. These mostly appeal to a Coulomblike frictional behavior and are focused on predicting rockfall runout distances.

Gabet-Mendoza formulation
In support of their experimental work involving particle motions on a rough, inclined surface, Gabet and Mendoza (2012) appeal to ideas from Quartier et al. (2000) and Samson et al. (1998) and assume that particle acceleration is described as a linear combination of the gravitational force, a Coulomb-like friction and collisional friction, namely, du(t) dt = g sin θ − µ d g cos θ − κu ψ .
As written, the dimensions of the coefficient κ depend on the value of the exponent ψ, which is thought to vary between one and two based on experiments. The principal significance of this formulation is that it points to the role of collisional friction -which Quartier et al. (2000) demonstrate is the principle source of energy dissipation in their experiments -and thus represents an important step beyond the Coulomb-like model of Kirkby and Statham (1975). However, because there is confusion in the literature regarding the form and interpretation of Eq. (88), we summarize the basis of this formulation in Appendix J. The essence is this: the Coulomb term and the collisional term as written are not additive for an individual particle. The collisional term is a stochastic quantity and applies to an averaged behavior, not to the instantaneous behavior of an individual particle. If this term is involved, the velocity u must be considered a timeaveraged or ensemble-averaged velocity, or Eq. (88) must be recast as a Langevin-like equation. Parts of this formulation are appropriate for describing the behavior of particles that roll bumpety-bump over a surface roughened with a monolayer of particles, but for the reasons described above and in Appendix J the deterministic, continuous form of this formulation is not well matched to the mechanics involved in rarefied motions over the roughness of natural hillslopes. In both formulations above the idea of a Coulomb-like friction with a dynamic friction coefficient is mechanically unsound (Appendix J). For particles that tumble, bounce and skitter down a rough surface, the static normal weight of a particle, mg cos θ , does not set the particle-surface friction. Rather, dynamic forces during collision impulses matter (Brach, 1991;Stronge, 2000). This includes the dynamic Coulomb-like friction force associated with conversion of translational to rotational kinetic energy during collisions (Appendix E). Formulating a dynamic friction coefficient would require ensemble averaging of the ratio of tangential to normal momentum exchanges, both of which are random variables. A Coulomb-like friction is appropriate for solid body and dense granular motions, but not for the rarefied conditions described here.

Furbish-Haff-Roering-Doane formulation
The probabilistic formulation presented by Furbish and Haff (2010), Furbish and Roering (2013), and Doane et al. (2018) assumes that travel distances are described by an exponential distribution whose mean µ x is a function of the local slope S. Namely, the mean increases with S and becomes unbounded as S approaches a critical value S c . This formulation is equivalent to setting the mean µ x ∼ L c . Here we consider the behavior of L c over small S then as S → S c .
Starting with Eq. (49) we write If α is described by Eq. (52), and neglecting the factor µ 1 for simplicity, then this is A binomial expansion of Eq. (89) gives and Eq. (90) gives From Furbish and Haff (2010), If we interpret the length scale λ 0 ∼ αE h /mgµ, then for small to modest slopes S, Eqs. (91) and (93) differ in their linear versus quadratic forms at lowest order. If λ ∼ α 0 E h /mgµ, then the behaviors of Eqs. (92) and (93) are the same if S c = 2µ. More generally, Eqs. (92) and (93) display the same behavior with increasing S. Namely, if the critical slope is interpreted as S c ∼ µ, the length scale L c in both cases increases approximately linearly over much of the domain of S then asymptotically becomes unbounded as S → S c . Note that the formulation involving Eq. (93) is limited to an exponential form of the distribution of travel distances (Furbish and Haff, 2010;Furbish and Roering, 2013;Doane et al., 2018). It does not mimic the different forms of f x (x) illustrated in Fig. 4, and it lacks a mechanical underpinning as presented in previous sections.

Varying disentrainment rate
The formulations above envision particle motions starting at position x = 0 such that the distribution of travel distances is expressed as f x (x). This is particularly convenient when considering laboratory and field experiments in which particles are released on a sloping surface from the same starting position, as examined in the companion paper. Here we return to our starting point concerning calculations of the particle flux and use of the entrainment form of the Exner equation as summarized in Sect. 2. Recall that in this frame of reference the particle travel distance is denoted by r and the starting position may involve any position x. Then, with reference to Eqs. (4)-(6), the disentrainment rate is P r (r; x), the distribution of travel distances r is f r (r; x) and the exceedance probability (survival function) is R r (r; x) = 1 − F r (r; x). In turn, for particles starting at x, the mean travel distance is µ r (x).
Using the results of Sect. 2.1 in specifying the exceedance probability R r (r; x) and the probability density f r (r; x) in the entrainment forms of the flux and the Exner equation, Eqs. (5) and (6), requires a key assumption. Namely, one must assume that the factors controlling the disentrainment rate on a hillslope change sufficiently slowly over x such that these factors defined at any position x correctly determine the conditions for the downslope motions of particles starting at x (Furbish and Roering, 2013;Doane et al., 2018). This is equivalent to assuming that during its downslope motion a particle "sees" conditions similar to those at its starting position. However, in actuality particles may see new conditions during their motions that change their behavior relative to what was "predicted" by the conditions at their starting positions. Let λ S denote a characteristic distance over which conditions persist. For example, focusing on the Kirkby number Ki, Thus, a rapid change in Ki over position x implies that λ S is small, and if Ki changes slowly, then λ S is large. Uniform conditions imply that λ S → ∞. We may then assume that if µ x λ S , conditions change sufficiently slowly that use of the continuous forms of R r (r; x) and f r (r; x) with Eqs. (5) and (6) provides a reasonable approximation of collective particle behavior.
This strategy might be acceptable for an exponential-like distribution with finite moments, but it is problematic if particle travel distances r involve a heavy-tailed distribution or if conditions transition along x between net cooling and net heating, or vice versa. Herein resides the merit of the discrete form of the disentrainment rate and the distribution of travel distribution as summarized in Sect. 2.2. Recall that this formulation is aimed at describing the ingredients of disentrainment that are occurring at different locations on a hillslope, where the mechanical behavior at a location transitions to another behavior in the downslope direction. In this formulation we let p k denote the probability that a particle, having not been disentrained before the kth interval, then becomes disentrained within this interval.
Let dr denote a finite (rather than infinitesimal) interval. Then the kth interval begins at r and ends at r + dr. Letting N k = N(r) denote the number of particles reaching the kth interval, then based on Eq. (56) the probability that a particle will be disentrained within this interval is This will be recognized as the setup for a simple finitedifference scheme, to be coupled with a similar finitedifference expression for the average energy state E a . Namely, in dimensionless form, for particles starting at positionx, Figure 5. Cartoon of hillslope surfaces with downslope variations in steepness leading to concomitant variations in heating, cooling and deposition; this is in contrast to a planar slope with uniform µ that produces either net heating or net cooling, or isothermal conditions. and where both the Kirkby number Ki and the elements of the transition value of the Kirkby number Ki * = 1 − γ /α + 1/α may vary from one interval to the next as conditions vary in the downslope direction. The proportion ofN(0;x) particles starting from positionx is then recovered from We note that, although different in form and implementation, this description is similar to the particle-based scheme of Tucker and Bradley (2010) in which particle behavior adjusts to newly encountered conditions during downslope motion. Consider for illustration a situation in which the Kirkby numbers Ki and Ki * systematically vary with positionx, relative to uniform conditions (Fig. 5). This may be due, for example, to variations in steepness or in the friction µ with increasing travel distance. Also recall that Ki < Ki * implies cooling, whereas Ki > Ki * implies heating.
In these examples we let α vary with the Kirkby number Ki according to Eq. (52) in anticipation of results presented in the companion paper. A decreasing rate of cooling associated with, for example, steepening in the downslope direction generally increases the heaviness of the tail of the distribution relative to the tail associated with a fixed rate of cooling Figure 6. Plot of exceedance probability Rx (x) versus dimensionless travel distancex showing conditions with fixed net cooling (solid line) and conditions that start with the same cooling rate but then involve a decreasing rate with increasing distancex (circles). In this example the Kirkby number starts at Ki = 0.70 atx = 0 and increases to Ki = 0.96 atx = 10. (Fig. 6). We present evidence in the companion paper that this occurs in the field-based experiments reported by DiBiase et al. (2017). Specifically, particles were launched down a rough hillslope surface, and then their travel distances were measured over a 14 m interval. Particles reaching the steeper slope below the measurement interval continued to the base of the hillslope without stopping. In turn, an increasing rate of cooling (e.g., with decreasing slope in the downslope direction) generally lightens the tail and may lead to truncation of the distribution if the rate increases rapidly enough. Moreover, a condition involving initial heating followed by cooling (e.g., with a concave hillslope surface) can lead to a distribution with a finite mode (Fig. 7). These examples represent situations where particle travel distances cannot necessarily be approximated by a distribution whose parametric values are set by the hillslope conditions at the position where particle motions start. We defer further examination of this behavior, including use of the convolutions in Eqs. (5) and (6), for a later time.

Discussion and conclusions
Our formulation of rarefied particle motions is based on a description of the energy balance of a cohort of particles treated as a rarefied granular gas, as well as a description of particle deposition that depends on the energy state of the particles. The formulation leads to a generalized Pareto distribution of particle travel distances, Eq. (74). This distribution represents three well-defined behaviors in which the Kirkby number Ki -the ratio of gravitational heating to frictional cooling -has a principal role. Conditions with relatively small Ki lead to rapid thermal collapse such that the distribution of Figure 7. Probability mass function f k (k;x) of discrete travel distances k associated with initial net heating over small k followed by net cooling with increasing k, leading to a finite mode. In this example the Kirkby number starts at Ki = 0.90 at kdr =x = 0 and decreases to Ki = 0.57 at kdr =x = 10. travel distances is bounded. For intermediate values of Ki the rate of gravitational heating may be matched by the rate of frictional cooling, giving approximately isothermal conditions and an exponential distribution of travel distances. Conditions with large Ki and net heating lead to a heavytailed distribution of travel distances. We provide compelling evidence of all three behaviors in our companion paper (Furbish et al., 2021a). Here we emphasize that we do not choose the generalized Pareto distribution in the empirical manner of selecting a distribution based on goodness-of-fit criteria applied to data sets. Rather, this distribution is dictated by the physics of the problem, just as, for example, the Boltzmann distribution (an exponential distribution) emerges in classical statistical mechanics from consideration of the accessible energy microstates of a gas system. We elaborate this point in the third companion paper (Furbish et al., 2021b).
Two of the most important elements of the formulation are the deposition length scales l c (E p ) and L c (E h ), the former being keyed to the specific particle energy state E p and the latter being keyed to the harmonic average energy E h of the particle cohort. Indeed, these lengths provide the essential connection between particle deposition and the energy balance of the particle cohort. We assume that l c is set by the advective cooling length scale in the Fokker-Planck equation, that is, Eq. (44). This is a natural choice in that deposition must go with cooling. The energy-specific deposition rate in the absence of heating is then specified as if deposition proceeds as a spatial Poisson process. We emphasize that this represents a maximum (information) entropy choice in the sense that it is faithful to what we think we know, namely, the connection between deposition and cooling, as well as to what we do not know (Jaynes, 1957a, b), namely, any detailed physics that would produce a different rate (for example, involving a nonlinear dependence on energy state) but which cannot be specified or constrained with available information. This description then leads to the interesting result, Eq. (46), that the loss of total energy due to deposition appears to be independent of the energy state. In particular, the loss of large energy states occurs at a relatively slow rate, whereas the loss of small energy states occurs at a relatively fast rate. In effect the rate of loss of energy per energy interval is fixed across energy states. The result is that the energy E p cancels with substitution of Eq. (45) into the integral in Eq. (46) such that the total loss becomes independent of the energy state. That is, the loss of total energy goes simply with the loss of particles (and the energy they possess).
In turn, the total deposition rate is energy dependent. This rate, defined by the length scale L c , is obtained by integrating the number density of particles over all possible energy states as in Eq. (48). Because l c is keyed to the energy state E p , but the integral in Eq. (48) does not involve this energy in the numerator, the result involves the reciprocal of the harmonic average energy E h . In general, the harmonic average diverges from the arithmetic average E a with increasing variance of the distribution of energy states. With E a /E h = γ , the resulting ratio γ /E a in Eq. (56) (with dimensionless form given by Eqs. 61 or 64) reflects an increasing proportion of lowerenergy particles available for deposition, relative to this availability with γ → 1. This effect is directly apparent in the expression of the mean travel distance, Eq. (69), associated with isothermal conditions. Note that the formulation does not involve specifying a threshold energy for deposition. Such an idea is mechanically irrelevant. Whereas low-energy particles are on average more likely to become disentrained than are high-energy particles, a set of particles with precisely the same low energy will for probabilistic reasons not be disentrained simultaneously. Each particle experiences a unique set of conditions that disentrain it; because of this uniqueness of conditions a particle with energy below an arbitrarily assigned threshold can with finite probability be gravitationally reheated to a higher-energy state. For given particle and surface roughness conditions, the formulation treats this aspect of disentrainment as a probabilistic process. In effect, this aspect is incorporated into the deposition lengths l c and L c as these are related to the distribution of particle energy states and the probabilistically expected extraction of energy during collisions.
Frictional cooling is formulated in terms of extraction of translational kinetic energy associated with particlesurface collisions. This involves the random variable β x = − E p /E p whose energy-specific average β x is the expected proportion of energy extracted from particles with energy E p . In detail the change in energy E p may be partitioned between a frictional loss, any conversion of translational to rotational energy, and any apparent loss associated with downslope incident motion reflected to transverse motion during a glancing collision. Our treatment of the dissipation quan-tity β x as a random variable does not distinguish the details involved in collisions. Yet these details may be important in terms of effects of different particle sizes and shapes, specifically the likelihood that the partitioning of energy losses differs between sizes or shapes. Herein the factor α modulating the length scale in Eq. (51) has a dualistic role. As incorporated in Eq. (51), this quantity represents the effect of heating, namely, to decrease the likelihood of deposition by decreasing the proportion of particles that cool to sufficiently low energies for deposition to occur -which translates to suppressing the disentrainment rate and increasing the length scale of deposition L c . As incorporated in Eq. (53), this quantity modulates the frictional cooling described by µ ∝ β x to give an apparent decrease in friction associated with deposition.
Whereas particles that are small relative to the surface roughness texture are on average more likely to experience near-collinear collisions with surface bumps and be "captured" within divots and pockets, particles that are large relative to the roughness texture are less likely to experience direct collisions with, or strong deflections by, smaller surface bumps. In addition, large particles are more likely to experience conversion of their translational energy into rotational energy with less loss during collisions. In particular, large spherical particles are more likely to roll or spin with increased heating, and large spinning particles are less likely than are smaller particles to be frictionally cooled. These points are reflected in the laboratory experiments of Samson et al. (1998Samson et al. ( , 1999) (Appendix J), the laboratory experiments of Kirkby and Statham (1975), and the field experiments of DiBiase et al. (2017) and Roth et al. (2020) (see the second companion paper; Furbish et al., 2021a). This also implies that for a given slope angle and surface roughness, some particle sizes may experience net cooling while some sizes experience net heating (Roth et al., 2020), likely contributing to the size sorting observed on many talus and scree slopes (Kirkby and Statham, 1975;Statham, 1976;Luckman, 2013). We suspect the noticeable sorting in Fig. 1 is due to these effects.
The formulation readily accommodates the idea of a mixed distribution composed of different distributions associated with different particle sizes or mechanical behaviors. This amounts to forming a sum of distributions, each weighted in proportion to the size classes involved in transport. As with individual sizes, the formulation assumes rarefied conditions -that particles of different sizes do not interact during their downslope motions, or that such interactions negligibly influence the particle energy balance relative to particle-surface interactions. We provide an example in the second companion paper (Furbish et al., 2021a).
With rockfall and subsequent particle motions over talus and scree surfaces, the initial energy state E a0 can be approximated in terms of the fall height (Kirkby and Statham, 1975). But this is a special situation in which the initial energy can be reasonably constrained. More generally, and with reference to the entrainment forms of the flux and the Exner equation, Eqs. (5) and (6), we are concerned with entrainment of particles from many if not all positions on a hillslope in relation to disturbances. This points to the idea that entrainment, if followed by long-distance motions, requires sufficient initial heating to keep particles moving downslope. This in turn echoes the conclusion of Doane et al. (2018) that correctly specifying the entrainment rate is a key part of implementing formulations of nonlocal transport and mass conservation. Because of the significance of sediment capacitors (e.g., vegetation) in trapping and storing sediment on hillslopes Doane, 2018), there is merit in clarifying the initial energetics of particles upon their release (i.e., entrainment) from storage. There also is a need to examine re-entrainment and transport associated with particle collisions, analogous to work on particle splash during eolian transport and the energetics of collective entrainment (Ancey et al., 2008) by collisions during bed load transport (Lee and Jerolmack, 2018).
That the energy and mass balances are expressed in the form of coupled differential equations opens the possibility of describing effects of varying disentrainment rates in response to changing downslope conditions in a manner intrinsic to particle-based treatments of transport (Tucker and Bradley, 2010), but not readily incorporated in previous probabilistic descriptions. Namely, if surface conditions change in the downslope direction, for example, giving net cooling followed by heating or vice versa (Fig. 5), then particles whose travel distances are large enough "see" this change and their behavior concomitantly changes. In this case the coupled equations of energy and mass in principle can be solved to accommodate these changing conditions. Interestingly, as differential (or finite difference) equations these have a local form, yet they intrinsically represent nonlocal behavior in that information concerning the energy state E a and the mass N is cumulatively handed from one position to the next downslope. In turn, the forms of R r (r; x ) and f r (r; x ) associated with any position x in the expressions of the flux and its divergence, Eqs. (5) and (6), must be based on information downslope from this position.
In this regard, here we offer further perspective on what is meant by local versus nonlocal transport on hillslopes. A transition of travel distances involving a distribution with a light tail to one with a heavy tail, as embodied in the generalized Pareto distribution, does not distinguish local from nonlocal transport. As fully explained in Furbish and Roering (2013) and in Furbish et al. (2017a), the convolutions in Eqs. (5) and (6) represent nonlocal transport regardless of the form of the probability density function f r (r; x) and its associated exceedance probability function R r (r; x). These scale-independent expressions are just specialized forms of the master equation used in probabilistic descriptions of particle motions over a remarkable range of scales (Einstein, 1905;von Smoluchowski, 1906;Chandrasekhar, 1943;Risken, 1984). Nonlocal transport is a physical thing and refers to the idea that attributes of particle motions used in defining the rheology, the flux or its divergence at a position x depend on conditions "far" from this position (e.g., Bocquet et al., 2009;Brantov and Bychenkov, 2013;Henann and Kamrin, 2013). In contrast, local transport is a mathematical thing, not a physical thing, and refers to the idea that under certain circumstances the convolution form of the master equation can be approximated such that the flux or its divergence has the form of a local mathematical expression -for example, a Fokker-Planck equation -whose terms involve conditions associated with the local position x. As alluded to above, a local expression can be formulated when the distribution f r (r; x) has finite moments and is peaked near the origin (r = 0). A heavy-tailed behavior means that this is not justified. Rather, the full convolution or a fractional derivative approximation of it must be used (Schumer et al., 2009). Because of the generality and scale independence embodied in the master equation and the convolutions in Eqs. (5) and (6), the use of "nonlocal" as a qualifier of "transport" in reference to hillslopes actually is redundant (Doane, 2018). Its use is merely a reminder that the flux or its divergence at a position x depends on things happening upslope.
The entrainment rate E s (x), the exceedance probability function R(r; x) and the distribution of travel distances f r (r; x) within the integrals in Eqs. (5) and (6) are treated as continuous functions. However, this does not imply a continuum behavior. Like the Fokker-Planck equation, which describes the evolution of the probability density function of a random variable that may or may not satisfy the continuum hypothesis (see Appendix A in Furbish et al., 2018b), the continuous forms of Eqs. (5) and (6) represent a probabilistic description of expected behavior, not necessarily the behavior of any one realization (system). In practical terms, imagine a rockfall event from a cliff face involving an individual particle or a relatively small number of particles whose subsequent downslope motions then start at position x = 0 at the base of the cliff. Inasmuch as the generalized Pareto distribution f x (x) provides the correct description of the expected behavior of the particles from the rockfall event, then these particles may be viewed as a (small) sample drawn from this distribution. The outcome of each realization (sample) is almost certainly different from all other realizations. Over a period of time the pooled outcomes (travel distances) of many events converge to the smooth representations given by f x (x) and R x (x) -as if Gabet and Mendoza (2012), DiBiase et al. (2017) and Roth et al. (2020) had performed a gazillion additional rock-launching experiments (see second companion paper, Furbish et al. 2021a) and then pooled the outcomes of these experiments. Implications of this idea are examined further in Furbish and Haff (2010), Furbish and Roering (2013), and Furbish et al. (2017band Furbish et al. ( , 2018b, and again in the fourth companion paper (Furbish and Doane, 2021).
The formulation may have interesting implications for examining Martian landforms. For example, the appearance of the acceleration g in Eqs. (51), (56) and (83) immediately suggests the possibility that particle travel distances are on average significantly longer on Mars than on Earth for otherwise similar particle sizes and surface-roughness conditions; we are confident in suggesting that future Martians likely will have far more fun than Earthlings in the sport of boulder rolling, notably on the crater rim of Olympus Mons. Nonetheless, we leave it to folks more familiar with Mars than we are to examine this. A key element of doing this is to either assume that the friction factor µ is similar to what occurs on Earth (which may be entirely reasonable) or further unfold the elements of this factor. We comment on this idea again in the second companion paper (Furbish et al., 2021a). We meanwhile note that a similar question arises in relation to the role of g in setting the friction of granular slopes on Mars. Atwood-Stone and McEwen (2013) address this question by examining dune slip-face angles on Mars and suggest that the similarity of these angles with those observed on Earth weakens any argument for different granular behavior associated with g -consistent with independent assessments (Moore et al., 1987;Tesson et al., 2020) and the idea that this angle is set by the static granular force-chain network (Cates et al., 1998;Furbish et al., 2008).

Appendix A: Choice of terminology
The study of granular materials is concerned with the behavior of the phases of these materials and associated phase transitions (Jaeger et al., 1996;Baldassarri et al., 2005;Daniels and Behringer, 2006;Forterre and Pouliquen, 2008;Jerolmack and Daniels, 2019). These phases and transitions share attributes with ordinary materials -solids, liquids and gases -although granular materials often exhibit behavior that is much different than ordinary materials. Nonetheless, it has become customary in the study of granular materials to adopt terminology similar to that used to describe ordinary materials.
The ideas of heating and cooling of a granular material are straightforward, to mean a change in the granular temperature of the material, specifically the average translational kinetic energy of the particles (but see van Zon andMacKintosh, 2004, andBaldassarri et al., 2005). However, granular materials do not possess an internal energy in the sense that we attribute to the particles of an ordinary liquid or gas. This means that heating of a granular material requires a mechanical input of energy, whereas cooling is associated with dissipative (non-conservative) collisions of particles with each other and with boundaries. In the problem at hand, gravitational heating occurs as particles move downslope, and their gravitational potential energy is converted to kinetic energy. Frictional cooling is associated with dissipative particle-surface interactions (e.g., collisions).
The ideas of melting and freezing of a granular material (Daniels and Behringer, 2006) pertain to the transition between a solid-like phase and a hydrodynamic (fluidlike) phase. However, in the problem at hand, we are concerned with rarefied particle conditions in which disentrainment from the rarefied state to the solid-like state or vice versa does not involve an intermediate hydrodynamic phase (e.g., Haff, 1983;Jenkins and Savage, 1983;Jaeger et al., 1996). Entrainment is akin to sublimation, and disentrainment is akin to deposition (or desublimation). Phase transitions involving an intermediate hydrodynamic phase (evaporation-condensation and melting-freezing) are represented in Earth-surface processes, for example, by melting (entrainment) and freezing (disentrainment) at the base of a granular flow, dry or wet. We recommend the papers by Forterre and Pouliquen (2008), Church (2011), Houssais et al. (2015), and Jerolmack and Daniels (2019) for perspectives on this emerging topic, notably in relation to transport by shear flows.

Appendix B: Particle cohort
In order to clarify the idea of a cohort of particles associated with a control volume with edge length dx (Fig. 2), here we offer a straightforward thought experiment. As a point of reference, the study of granular gases typically involves consideration of the behavior of an individual system composed of many particles that are mechanically heated, where energy dissipation is associated with particle-particle collisions. In contrast, our problem involves an unusual situation in that we must start by considering a system composed of one particle, where energy dissipation occurs with particle-surface collisions, and then in turn consider the behavior of an ensemble of such systems.
Imagine a box containing one particle. We mechanically shake the box and the particle is heated. At any instant the particle has kinetic energy E p . Each time the particle collides with the floor of the box it is re-heated, and each time it collides with a wall of the box energy is extracted. Eventually the particle by chance has sufficiently low energy that when it next encounters a wall it becomes irreversibly deposited (disentrained) onto the wall. Then the box has no moving particle.
Like Gibbs (1902), we now imagine a great number N of nominally identical but independent single-particle systems, where each particle in each system (box) behaves according to the same laws of physics, each undergoing heating and collisional cooling, and occasionally being deposited (Fig. B1). We now choose one instant in time and examine the state of each particle. Some particles previously have been deposited, so at this instant N refers to those systems whose particles are in motion. At this instant each particle has kinetic energy E p , and we may define the ensemble probability density f E p (E p ) of energy states E p . As a consequence we also may at this instant define the ensemble-averaged kinetic energy E p and the total energy E = N E p . (Alternatively, we could imagine all N particles in a single box at one instant, but with the caveat that we must imagine them as not interacting with each other, only with the floor and walls of the box.) We now choose a successive instant in time, namely, t + dt. During dt the number N decreased with deposition of some particles, the distribution f E p (E p ) changed, and the average energy E p and the total energy E changed.
More generally we can choose N different instants in time t, one instant for each box, and examine the state of each particle. Then, upon collecting the particles as a cohort independently of the selected times, like above we observe an ensemble distribution of energy states with specific average energy E p and total energy E. At this point we relax the idea of a box and simply view particle-wall collisions more generally as particle-surface collisions during motions parallel to x; instead of heating the particles via particle-floor collisions we imagine this to occur continuously by gravitational heating. We then let the N selected instants in time coincide with those instants that each of the particles is located at a specified position x. That is, these are the N (x) particles located at the left face of the interval x to x + dx (Fig. B1). We may then examine how the number N (x) and the ensemble distribution f E p (E p , x) and its moments change over the interval x to x + dx as this particle cohort moves downslope. Note that each member of the cohort not deposited within this interval may arrive at position x + dx at a different in- Figure B1. Schematic diagram of surface inclined at angle θ and control volume with edge length dx through which particles move, with Gibbs-like ensemble of single-particle systems leading to definition of the cohort of N (x) particles starting at the left face of the control volume. stant in time. This is unimportant, however, as we are interested only in how the energy states of the particles vary with position x. Similarly, upon choosing any subsequent downslope position x, we must recognize that the N(x) particles reaching this position do so at entirely different instants in time. Here is a final note: in this problem a particle ensemble average is identical to a Gibbs ensemble average.
Consider, then, both similarities and differences between conditions in a "normal" non-equilibrium granular gas and the rarefied conditions of particle motions on hillslopes. As a reminder, the rarefied conditions that we describe do not involve particle-particle collisions, only particle-surface collisions. The Knudsen number in any realization is effectively infinite. As described in the text, the distribution n E p (E p , x) of energy states E p of the particle cohort (ensemble) varies with position x. Because the moments of this distribution are assumed to be defined, we could in fact also define the distribution of downslope velocities, thence the mean velocity and fluctuations about the mean. That is, we could formally define a granular temperature (Goldhirsch, 2008) and then associate this temperature with an internal granular energy content at any position x. However, the granular temperature thus defined for the rarefied problem is not physically relevant to this problem, and a granular internal energy does not physically exist. Indeed, granular energy is neither advected nor diffused in the sense of a normal granular gas system, for example, a granular flow. Moreover, quantities such as the granular density and pressure do not exist. In short, there are no "internal" gas dynamics whatsoever, as the rarefied conditions do not represent a particle system that evolves dynami-cally over time and space, as with a granular gas in a box or in a conduit or over an inclined surface, each of which involves dissipative particle-particle collisions during the gas evolution. Yet the description of the spatial evolution of the distribution of the energy states of the particle ensemble remains entirely relevant. Indeed, the rarefied case examined herein represents a highly unusual granular gas. To our knowledge this particular granular gas problem has not been examined before. The closest direct analogue seems to be that reported by Almazán et al. (2017), who, building from the work of Volfson et al. (2006), show that the formalism used in describing the cooling and thermal collapse of a granular gas is akin to the formalism used in describing the dissipative energetics of a single nonelastic ball bouncing on a smooth horizontal surface (without energy input from vibration or gravitational heating).
Whereas the dynamics of an ordinary granular gas are centered on dissipative particle-particle collisions, in our problem the dynamics are centered on dissipative particle-surface collisions. This dynamic is fundamentally a boundary-related phenomenon, not an internal one. In a standard granular gas, energy dissipation occurs during particle-particle collisions. But note that, whereas a dissipative collision between two particles generally leads to an overall loss of kinetic energy, the kinetic energy of one of the two particles may actually increase. In contrast, in our problem involving only particle-surface collisions, essentially all collisions involve extraction (dissipation) of the particle kinetic energy defined with respect to downslope motion. Thus, all collisions are "cooling" in the sense of reducing particle kinetic energy. Similarly, gravity provides a uniform "heating" in the sense of increasing kinetic energy. Thus, we appeal to the ideas of cooling and heating without reference to fluctuating motions (and granular temperature), where cooling simply refers to the idea that kinetic energy is extracted from the particle cohort via collisional friction and heating refers to the idea that kinetic energy is added to the cohort via conversion of gravitational potential energy into kinetic form. The idea of thermal collapse then is entirely satisfactory (e.g., Volfson et al., 2006). We further reemphasize that our formulation involves the evolution of n E p (E p , x) with respect to space, whereas standard granular gas theory typically involves hydrodynamic-like descriptions of quantities such as the granular density, temperature, pressure and velocity that evolve with respect to time (in an Eulerian manner), where it is assumed that local continuum-like definitions of these quantities exist (e.g., Goldhirsch, 2008) -conditions that are not relevant in the rarefied problem that we examine.

Appendix C: The Fokker-Planck-like equation
Let q = E p (x + dx) − E p (x) denote a change in the energy of a particle over the small distance dx, and let f q (q; E p , x) denote the probability density function of the changes q asso-ciated with the energy state E p and position x. If n E p (E p , x) denotes the number density of particle energies E p , then according to the master equation, Assuming the density f q (q; E p , x) is peaked near q = 0 with finite first and second moments, we may expand the integrand in Eq. (C1) as a Taylor series to second order, subtract n E p (E p , x) from both sides, then divide by dx and take the limit as dx → 0 to obtain a Fokker-Planck-like equation, namely, Here, k 1 (E p , x) is a drift speed and k 2c (E p , x) is a diffusion coefficient defined by The drift speed k 1 (E p , x) has two parts: one associated with gravitational heating and one associated with frictional cooling. Starting with gravitational heating, let h(x) denote the height of a particle within the gravitational field at position x. If E p (x) denotes the particle kinetic energy equal in magnitude to the potential energy mgh(x) at height h(x), then E p (x + dx) = mgh(x + dx) is the magnitude of the particle kinetic energy at the height h(x + dx), assuming a complete conversion of gravitational to kinetic energy without loss. Thus, This indicates that q in Eq. (C3) is independent of the energy state E p and therefore may be removed from the integral. We thus write Eq. (C3) as This is the steady rate of gravitational heating. The part of k 1 (E p , x) associated with frictional cooling is obtained as follows. With particle-surface collisions we may assume that q is proportional to the expected value of E p . In turn we let n x = 1/λ denote the expected number of collisions per unit distance, where λ is the expected travel distance between collisions. This leads to where the overline denotes an average over particles at the energy state E p (rather than a global average). Because gravitational heating is a fixed quantity according to Eq. (C6), heating does not involve diffusion. In turn, the diffusion coefficient k 2c (E p , x) associated with frictional cooling is given by Note that whereas k 1h is a fixed quantity, k 1c and k 2c must be viewed as statistically expected quantities.

Appendix D: Expected travel distance between collisions
Momentarily let v = u 2 1/2 , and then let v 0 denote the surface-parallel velocity of a particle rebounding with reflection angle φ measured from the surface (Fig. D1). We then know that where U 0 denotes the horizontal velocity and W 0 denotes the vertical velocity. For a vertical change in elevation Z over a horizontal distance X associated with the surface-parallel distance λ, we know that Z = −SX = −SU 0 t 0 , where t 0 is the travel time. For a rebounding particle starting at position z 0 = 0 we may deduce from Newton's second law that which gives With λ = X/ cos θ = U 0 t 0 / cos θ, we then combine Eqs. (D1)-(D3) to obtain Upon expanding sin(φ − θ ) and cos(φ − θ ) using difference formulae, algebra and trigonometric identities eventually lead to λ = 2v 2 0 cos θ g tan φ 1 + tan φ tan θ + tan 2 θ + tan φtan 3 θ . For θ = 0, this reduces to λ = 2v 2 0 tan φ/g. If for small slopes tan φ ∼ tan θ and for large slopes tan φ tan θ, then at leading order, λ ≈ 2v 2 0 cos θ g tan φ 1 + tan 2 θ = 2v 2 0 tan φ g cos θ . (D6) For the purpose of scaling, we now assume that v 2 0 ∼ u 2 and write λ ≈ 2 u 2 tan φ g cos θ , which gives Eq. (26) in the text. The soft matter trajectory analysis of Tajima and Fujisawa (2020) includes viscous air resistance, which we neglect.

Appendix E: Energy extraction during collisions
Here we provide a qualitative description of the basis for assuming that a change in the downslope energy of a particle associated with a collision can be expressed as E p = −β x E p wherein both β x and E p must be treated as random variables. We start by noting that the topic of particle collision mechanics is well developed for idealized particle-particle collisions and particle-surface collisions involving spherical particles, as well as peculiarities of noncollinear collisions associated with irregular particles. Relevant elements are covered in Brach (1984Brach ( , 1989Brach ( , 1991, Stronge (1990Stronge ( , 2000, Dunn (1992, 1995), and Ismail and Stronge (2008). Although we cannot directly apply details of this work given the complexity of particle motions on natural rough hillslopes, this work nonetheless offers a clear guide in the interpretation of the relation E p = −β x E p , notably in relation to experimental results presented in the second companion paper (Furbish et al., 2021a). With reference to Fig. E1, consider an idealized collision of a spherical particle with a rigid planar surface with slope angle θ . Let u, w and ω respectively denote the surface parallel velocity, the surface normal velocity and the angular velocity of the particle with mass m and radius r = D/2, and let the subscripts 1 and 2 denote incident and reflection values. With appropriate modification of the coordinate and sign Figure E1. Definition diagram for idealized collision of a spherical particle with a rigid planar surface. convention used by Brach (1991), the momentum components associated with impulses can be expressed as (Brach, 1991(Brach, , 1998Brach and Dunn, 1995) w 2 = − w 1 , (E1) u 2 = u 1 + µ c (1 + )w 1 + gτ (sin θ − µ c cos θ ) and (E2) where w 1 < 0, is the normal coefficient of restitution attributed to Newton, µ c is the ratio of tangential to normal impulses during the collision, and τ is the impulse duration. Note that µ c generally is not a coefficient of friction, although it may be equal to a coefficient of friction in special cases, for example, with sliding throughout the entire duration of the collision (Brach, 1991;Brach and Dunn, 1992). Also note that µ c = 0 if u 1 = 0. The second term on the right side of Eq. (E2) represents the effect of tangential friction on the velocity u, increasing with the magnitude of the normal impulse associated with the velocity w 1 . This term may be considered the dynamic contribution to friction during τ . The term gτ sin θ represents the downslope contribution to the impulse associated with the weight of the particle, and the term gτ µ c cos θ represents an enhancement of friction associated with this weight. The impulse duration τ may be on the order of milliseconds for a hard particle impacting a hard surface. It may be longer for a hard particle impacting a relatively soft surface (Brach, 1991). If the magnitude of w 1 is sufficiently large and τ is sufficiently short, the gravitational terms in Eqs. (E2) and (E3) may be neglected. The second term on the right side of Eq. (E3) represents the effect of tangential friction in contributing to rotational motion, that is, the conversion of translational energy to rotational energy.
Collisions involving small incident angles begin with sliding during the impulse duration τ . If with a sufficient normal dynamic force this initial sliding gives way to stick prior to separation, then for a sphere with moment of inertia I = (2/5)mr 2 , the velocity u 2 = rω 2 at separation. This leads to which represents the outcome of a conversion of translational to rotational motion with stick. Whereas the resultant velocity u 2 can be determined in this situation, the effect of sliding on u 2 cannot be analytically constrained. Nonetheless, Eq. (E3) indicates that collisions induce a conversion of translational to rotational motion in that tangential friction during an impulse exerts a torque on the particle, thereby extracting translational kinetic energy that is in addition to work performed during particle deformation and by friction.
We also note that low-angle collisions likely dominate in the problem at hand. In order to recast the problem in terms of kinetic energy, we start by squaring Eqs. (E1)-(E3) to give We may immediately neglect terms involving τ 2 , and for sufficiently large w 1 and small τ we may neglect terms involving τ . The next task involves scaling the normal velocity w 1 in terms of the tangential velocity u 1 in relation to particle motions down an inclined surface. Hereafter we focus on lowestorder effects. With reference to the analysis presented in Appendix D, let W 0 denote the vertical reflection velocity of a particle following a collision. Assuming downslope motion, then for any finite horizontal reflection velocity U 0 and reflection angle φ, the magnitude of the vertical velocity at the next collision is given by where Z ≤ 0 is the vertical distance between the collisions. That is, From Appendix D, W 0 = v 0 sin(φ − θ )/ cos φ, Z = − 1 2 gt 2 0 + W 0 t 0 and t 0 = 2W 0 /g +2SW 0 /g, where v 0 is the surface parallel velocity associated with W 0 , t 0 is the travel time and S = tan θ . Using these relations with Eq. (E10) we obtain Expanding the trigonometric functions in Eq. (E11) as Taylor series and retaining the lowest-order term in φ, we obtain In effect the magnitude of w 1 is set by the gain in the magnitude of the vertical velocity associated with conversion of gravitational potential energy to translational energy with finite slope. This strengthens the normal impulse of the particle but only up to a slope (nominally 45 • ) beyond which the surface normal component of the vertical velocity begins to decrease.
As in Appendix D we now scale u 1 ∼ v 0 . Using Eq. (E12), at lowest order Eq. (E6) becomes Subtracting u 2 1 from both sides of Eq. (E13) and multiplying by m/2, Omitting subscripts, this is Comparing this result with the assumption E p = −β x E p , we may conclude that Note that Eqs. (E15) and (E16) pertain to a highly idealized collision. In fact, the quantities µ c , and φ are each random variables. Moreover, on an irregular hillslope surface the angle θ also is a random variable when viewed at the particle-surface collision scale. Nonetheless, for the purpose of scaling we may view this angle as a locally averaged value, and we now take the ensemble average of Eq. (E16) to give In turn, with µ = β x /4 tan φ we may write with M(θ ) = 2µ c (1 + ) 4 sin φ cos θ sin θ.
At lowest order, cos θ sin θ ∼ θ . We therefore may expect µ to systematically vary with the slope angle θ. Also note that µ is independent of particle size. We examine both of these points in the second companion paper (Furbish et al., 2021a). Subtracting u 2 1 from both sides of Eq. (8), multiplying by m/2 and retaining the lowest-order term, This is This result indicates that the onset of rotation with stick produces a large change in the slope-parallel kinetic energy. In this case, β x ≈ 0.5. Again notice that this result is independent of particle size. Nonetheless, the numerical factors in Eqs. (4), (8) and (E20) are set by the moment of inertia of the particle, which means that these factors vary with irregular particles. Also note that Eq. (E21) does not imply that half of the translational energy E p is converted entirely to rotational energy. Rather, half is converted to rotational energy and lost to work performed by friction prior to stick and by particle/surface deformation, thence dissipated as heat, vibrations and sound. More generally, with low-incident-angle motions, slip is more likely. Focusing on the first three terms on the right side of Eq. (E7) and using Eq. (E12) with v 0 ∼ u 1 , cos 2 θsin 2 θ cos 2 φ u 2 1 .
Subtracting ω 2 1 from both sides of Eq. (E22) and multiplying by I /2 = (1/5)mr 2 , which is This result suggests that in the absence of initial rotation (E r = 0), a change in rotational energy is directly related to the translational energy E p , where the proportion β x now represents the leading factors in the first term on the right side of Eq. (E24). With extant rotational motion, a weaker conversion of translational to rotational energy occurs according to the second term on the right side. Both cases are slope dependent due to the connection between w 1 and u 1 ∼ v 0 implied by Eq. (E12). Focusing on downslope motions, in general we may write the energy balance of a particle as Here, a positive change in rotational energy E r is seen as an extraction of translational energy. Then, for example, this loss is given explicitly by Eq. (E20) in the specific case of stick with the onset of rotation. An approximation of this loss is given by Eq. (24) for a frictional collision that does not necessarily involve stick. The term f c in Eq. (E25) represents losses associated with particle and surface deformation as well as work performed against friction during collision impulses (converted to heat, sound, etc.). This is represented, for example, by Eq. (15). But this term also includes losses associated with deformation of the surface at a scale larger than that of an idealized particle-surface impulse contact, namely, due to momentum exchanges associated with the sputtering of loose surface particles during collision. (The videos published as supplementary material to DiBiase et al. (2017) nicely illustrate this sputtering as well as the onset of rotational motion.) The term f y in Eq. (E25) represents energy losses not described in the preceding idealized formulation, namely, changes in downslope translational energy associated with glancing collisions that produce transverse translational motions and rotational motions oriented differently than that considered above (Fig. E1). In some cases, as described above, the change in energy E p can be expressed directly in terms of the energy state E p . However, the complexity of particle-surface collisions on natural hillslopes precludes explicitly demonstrating such a relation for all possible scenarios. Nonetheless, the examples above suggest that it is entirely defensible to assume that energy losses can be related to the energy state E p if the elements involved are formally viewed as random variables. Specifically, with the effect of slope angle θ on the impact velocity w 1 and its relation to u 1 ∼ v 0 via Eq. (E12), we can be confident that the loss E p is functionally related to the energy state E p . The simple relation E p = −β x E p thus is to be viewed as a hypothesis to be tested against data, as elaborated in the second companion paper (Furbish et al., 2021a).
Whereas the left side of Eq. (F2) is the arithmetic average E a , the right side is the harmonic average E h . Thus, Because the arithmetic average of a set of positive numbers is always greater than or equal to the harmonic average of this set, this inequality is indeed satisfied. These averages are equal only if all values of the set are equal; that is, the variance of the set is zero. We do not know the form of the underlying distribution f E p (E p , x). For physical reasons, however, it cannot be a distribution that supports E p → 0, as this coincides with particles at rest. For example, f E p (E p , x) cannot be an exponential or Weibull distribution with support E p ∈ [0, ∞). In contrast, the lognormal and gamma distributions with support E p ∈ (0, ∞) are admissible, and the Pareto distribution with support E p ∈ [E pm , ∞) is admissible.
As a point of reference, for a density f E p (E p ) with finite expected value E p , the density f y (y) of the reciprocal y = 1/E p may not have defined moments. This occurs, for example, if f E p (E p ) is exponential with support E p ∈ [0, ∞). Interestingly, if with x = ln(E p ) the density f x (x) is lognormal with mean µ, then with y = 1/x the density f y (y) also is lognormal with mean −µ.
For a density f y (y) of y = 1/E p with undefined mean, the average y calculated from a sample nonetheless is finite, as the probability of sampling precisely a value E p = 0 is identically zero. Moreover, as the variance of E p becomes small for finite mean E p , the product E p y = E p 1/E p → 1, as in the discrete case above.

Appendix G: Deposition rate
Our description of the deposition rate for a granular gas in a box has both similarities and dissimilarities with the processes of deposition (de-sublimation) and condensation. Here we briefly outline key points. In a closed system involving two phases (solid-gas or solid-liquid) at thermodynamic equilibrium, the rates of deposition and sublimation (or condensation and evaporation) are equal. That is, the rate at which molecules move from the solid phase to the gas phase (or from the liquid phase to the gas phase) is balanced by the rate at which molecules move from the gas phase to the solid phase (or from the gas phase to the liquid phase). These rates, in each direction, depend only on the thermal state of the system. Because the system has specified internal energy involving conservative particleparticle collisions, we do not need to appeal to the idea of heating and cooling (although this could be occurring). For a granular gas involving dissipative collisions, however, a nonequilibrium steady state is achieved only if it is continuously mechanically heated, and the rate of heating is matched by the rate of cooling due to the collisions. (Note that we refer to a non-equilibrium steady-state condition rather than thermal equilibrium, as unlike an ordinary gas, a granular gas can exhibit strong spatial correlations in the particle number density (see Brilliantov and Pöschel, 2004, 2005, and Brilliantov et al., 2018and van Zon and MacKintosh, 2004). However, this distinction is unimportant in relation to the behavior of particle motions on a hillslope envisioned as a rarefied granular gas.) Like an ordinary solidgas system, the rate of sublimation (entrainment) is matched by the rate of deposition (disentrainment), and the total particle energy and the average particle energy are fixed. Moreover, like an ordinary solid-gas system, the deposition rate depends on the physics of disentrainment in relation to its thermal state, not on the difference between the heating and cooling rates (which is zero at steady state). Heating modulates the deposition rate as described in the text.
To show how the generalized Pareto distribution is related to the ordinary Lomax distribution, we start by rewriting Eq. (H4) as fx(x) = b 1/a a 1+1/a (x + b/a) 1+1/a .
We now define the shape parameter a L = 1/a and the scale parameter b L = b/a. This gives a Lomax distribution, namely, Thus, for a > 0 the behavior of the generalized distribution, Eq. (H4), is the same as that of a Lomax distribution. The mean is We work with the generalized Pareto distribution in the form of Eq. (H4) because of the clear connection between its parameters and the disentrainment rate function, Eq. (H3), and because the condition a < 0 is physically meaningful.

Appendix I: Kirkby-Statham formulation
The formulation of Kirkby and Statham (1975) assumes that initial particle kinetic energy is dissipated in work performed by a fixed Coulomb-like friction to give an average travel distance. This idea can be formulated in terms of momentum and energy, then recast in terms of the rate of change in energy with respect to position x for comparison with the formulation presented in the main text.
In appealing to a Coulomb-like friction behavior, Kirkby and Statham (1975) start with F x = mg sin θ − µ d mg cos θ. With particle velocity u we write this as du(t) dt = g sin θ − µ d g cos θ.
Note that u(t) must be envisioned as representing an idealized "average" velocity of a group of particles viewed over time. This gives u(t) = (g sin θ − µ d g cos θ ) t + u 0 .
The total travel distance X is thus Using the initial squared velocity u 2 0 = 2 ghsin 2 θ , the instantaneous motion of an individual sliding particle or group of particles acting like a rigid body, the third term on the right side, representing collisional friction, actually is relevant to time-averaged or ensemble-averaged behavior rather than the instantaneous behavior of an individual particle (Riguidel et al., 1994;Samson et al., 1998Samson et al., , 1999. These terms are not additive as written. The gravity and Coulomb friction terms are like those in the formulation of Kirkby and Statham (1975). Because there is confusion in the literature regarding the collisional friction term, here we elaborate its form. Let n t denote the expected number of particle-surface interactions (collisions) per unit time as a particle moves downslope, and let β x denote the proportion of momentum parallel to x that is extracted during an individual collision involving the particle velocity u. Recognizing that both β x and u must be treated as random variables, and letting angle brackets denote an ensemble average, we may now assume that m d u dt ≈ mg sin θ − mn t β x u .
The first term on the right side of Eq. (J2) represents the uniform gravitational force, and the second term on the right side represents a frictional force due to particle-surface collisions (compare with Eq. (2) in Riguidel et al., 1994). As a reminder, this term is entirely analogous to the dissipation term that Haff (1983) introduced (formulated in terms of energy rather than momentum), leading to Haff's cooling law (Brilliantov and Pöschel, 2004;Yu et al., 2020). The proportion of momentum extracted, β x , involves an appropriate coefficient of restitution depending on the geometrical details of the collision. We may now assume that n t ∼ u / l, where l denotes a characteristic length scale representing the expected distance between collisions. This leads to d u dt ≈ g sin θ − 1 l β x u u , which is close to the form of Eq. (J1) with ψ = 2 (neglecting the Coulomb friction term), but not quite. We now focus on uniform, steady conditions such that u is unchanging with position or time, consistent with various experiments (Riguidel et al., 1994;Samson et al., 1998). This leads to β x u u ≈ lg sin θ. (J4) We now write β x = β x +β x and u = u +u , where primes denote deviations about the expected values. Substituting these expressions into Eq. (J4) and taking expected values then leads to β x u 2 + β x u u ≈ lg sin θ.
The product β x u 2 has the appearance of a nominal, nonlinear viscous term. Samson et al. (1998) suggest that this represents a Bagnold-like friction based on analogy with the scaling provided by Bagnold (1954), preceding the critical assessment of Bagnold's experimental work presented by Hunt et al. (2002). The term β x u u , neglected at the outset by Riguidel et al. (1994), looks like a linear viscous term, where the "viscosity" is given by the covariance β x u . Noting that Eq. (J5) is quadratic, we can solve for the velocity u and determine that at lowest order so long as ( β x u / β x ) 2 < 4lg sin θ/ β x . If this inequality is satisfied, then Eq. (J3) becomes giving ψ = 2. Note that the squared average velocity in Eq. (J7) does not imply that collisional friction is scaled with kinetic energy rather than momentum. This result occurs because n t is initially scaled with u and l. Quartier et al. (2000) present an analogous formulation; see their Eq. (4) and explanation of the squared velocity term. Dippel et al. (1997) also discuss this point. In relation to their experiments involving particles of radius R moving down an inclined surface roughened with a quasi-random monolayer of particles with radius r m , Riguidel et al. (1994) and Samson et al. (1998) propose the hypothesis that u ∼ sin θ . This derives from a scaling analysis in which the magnitude of the collisional momentum extraction (i.e., β x ) is written as a function of the relative smoothness R/r m . These authors plot measured values of u versus sin θ and suggest that the linear fit confirms a viscous-like behavior. Note, however, that because of the rather limited experimental range of sin θ (Fig. 2 in Riguide et al., 1994;Fig. 2 in Samson et al., 1998;Fig. 4 in Samson et al., 1999), the data are equally well fit by a straight line in a plot involving √ sin θ (Fig. J1), consistent with the collisional-based formulation, Eq. (J7). (Samson et al. (1999) acknowledge this limitation of the range of sin θ .) In addition, we can scale the length l as l ∼ r m /c A , where c A = 0.67 is the areal concentration of the surface-roughness particles. For a fixed velocity u , Eq. (J7) gives l ≈ β x u 2 /g sin θ .
With R/r m = 4.7 and a coefficient of restitution of ≈ 0.8, we can estimate β x ≈ 0.05. This gives l ≈ 0.5-0.6 mm over the range of measured velocities in Fig, J1, which is close to the experimental value of l = r m /c A = 0.8 mm, thereby reinforcing the collisional basis of Eq. (J7). Thus, a spherical particle that macroscopically rolls over a monolayer roughness is actually going bumpety-bump, colliding with monolayer particles during its motion.
Because the relative smoothness R/r m is not entirely adequate in scaling the collisional friction as c A varies (Samson et al., 1998), it remains unclear whether these experimental conditions involve an apparent viscous-like behavior where Figure J1. Plot of ensemble-averaged particle velocity u versus √ sin θ involving a steel sphere (R = 2.5 mm) moving over glass beads (r m = 0.53 mm) giving R/r m = 4.7; data from Samson et al. (1998). the effective viscosity depends only on roughness geometry (Dippel et al., 1997;Samson et al., 1999) or a squaredvelocity behavior as in Eq. (J7). Dippel et al. (1997) note that there is an apparent cross-over in behavior for very large and heavy spheres. Effects of the covariance of β x and u in relation to roughness geometry and the details of motions, including transverse motions, likely are important. Nonetheless, we emphasize that in the formulations of Riguidel et al. (1994) and Samson et al. (1998), a Coulomb friction behavior is not involved.
Returning to Eq. (J1), similarly there is no clear reason to include a Coulomb-like friction term, as natural irregular particles mostly do not slide down natural rough surfaces. In addition, if the starting point involves the derivative term on the left side and the gravitational term on the right side as written, then the collisional term on the right side should be a random quantity, thus leading to a stochastic differential equation -that is, a Langevin-like equation (Riguidel et al., 1994) -not an ordinary differential equation. Moreover, the idea of a dynamic friction coefficient is misapplied in the situation where rarefied particles tumble, roll and skitter over the surface. A Coulomb model is appropriate for sustained contact, and even then a dynamic friction involves collisional friction at the surface asperity scale. Particle-surface contacts on natural granular surfaces are not smooth at a scale commensurate with a sliding Coulomb model. A rolling coefficient of friction works for spheres moving over a relatively smooth surface, not for irregular tumbling particles involving non-collinear impacts. Moreover, the static normal weight of a particle, mg cos θ , does not set the particlesurface friction. Rather, dynamic forces during collision impulses matter (Brach, 1991;Stronge, 2000). This includes the dynamic Coulomb friction force associated with conversion of translational to rotational kinetic energy during col-lisions (Appendix E). Any resulting dynamic friction coefficient represents an ensemble-averaged ratio of tangential to normal momentum exchanges, both of which are random variables. (This point currently is being examined in studies of bed load and eolian transport; see for example Pähtz and Durán (2018).) Finally, the experiments of Quartier et al. (2000) involved rolling a cylinder over an inclined row of cylinders in an experiment designed to remove the transverse degree of freedom of motion. The Coulomb-like term in their formulation (see their Eq. 5) arises from trapping of the rolling cylinder between bumps and is unrelated to sliding as in a conventional Coulomb model. The condition of a constant roller velocity involving an "equilibrium between gravity driving and dissipation by the shocks" is roughly analogous to isothermal conditions described in the main text, but without effects of deposition. The dynamical angle in these experiments coincides with the situation in which the velocity of the roller is sufficient to prevent trapping, "assuming a permanent contact between the roller and the rough plane". Data availability. No data sets were used in this article.
Author contributions. All authors contributed to the conceptualization of the problem and its technical elements. DJF wrote the paper with critical review and input from the other authors. Because AMA is deceased, we offer the following clarification. AMA made key observations regarding particle motions in initial laboratory experiments during her MS work and worked closely with DJF in conceptualizing a description of these motions. Whereas we did not succeed in formalizing AMA's ideas in mathematical terms while she was pursuing her work, these ideas were essential to a proper description of particle motions and are now formalized in the paper. We also note that AMA's co-authorship carries full approval of her family.