Version of 3/9/05
y Jason Fulman (fulman@math.pitt.edu)
niversity of Pittsburgh Math Department, 414 Thackeray Hall Pittsburgh, PA 15260
<ph f="cmr"> </ph><ph f="cmbx">An Inductive Proof of the Berry-Esseen Theorem for Character Ratios</ph>

B

U
Abstract: Bolthausen used a variation of Stein's method to give an inductive proof of the Berry-Esseen theorem for sums of independent, identically distributed random variables. We modify this technique to prove a Berry-Esseen theorem for character ratios of a random representation of the symmetric group on transpositions. An analogous result is proved for Jack measure on partitions.
2000 Mathematics Subject Classification: 05E10, 60C05.
Key words and phrases: character ratio, Berry-Esseen theorem, Stein's method, Plancherel measure, Jack polynomial.

1 Introduction

The Plancherel measure of a finite group G   is a probability measure on the set of irreducible representations of G   which chooses a representation ρ   with probability d i m ( ρ ) 2 | G |   , where d i m ( ρ )   denotes the dimension of ρ   . For instance if G   is the symmetric group, the irreducible representations are parameterized by partitions λ   of n   , and the Plancherel measure chooses a partition λ   with probability n ! x λ h ( x ) 2   where the product is over boxes in the partition and h ( x )   is the hooklength of a box. The hooklength of a box x   is defined as 1 + number of boxes in same row as x and to the right of x + number of boxes in same column of x and below x. For example we have filled in each box in the partition of 7 below with its hooklength
6 4 2 1
3 1
1
and the Plancherel measure would choose this partition with probability 7 ! ( 6 * 4 * 3 * 2 ) 2   . Recently there has been interest in the statistical properties of partitions chosen from Plancherel measure and we refer the reader to the surveys [AlD, [Deand the seminal papers [J, [O1, [BOOfor a glimpse of the remarkable recent work on Plancherel measure. We recommend [Saas an introduction to representation theory of the symmetric group.
Let
λ   be a partition of n   chosen from the Plancherel measure of the symmetric group S n   and let χ λ ( 12 )   be the irreducible character parameterized by λ   evaluated on the transposition ( 12 )   . The quantity χ λ ( 12 ) d i m ( λ )   is called a character ratio and is crucial for analyzing the convergence rate of the random walk on the symmetric group generated by transpositions [DSh. In fact Diaconis and Shahshahani prove that the eigenvalues for this random walk are the character ratios χ λ ( 12 ) d i m ( λ )   each occurring with multiplicity d i m ( λ ) 2   .
Character ratios on transpositions also play an essential role in work on the moduli space of curves
[EO, [OP.
Given these motivations, it is natural to study the distribution of the character ratio
χ λ ( 12 ) d i m ( λ )   and there has been a substantial amount of work in this direction, which we now summarize. Kerov [K1proved that if λ   is chosen from the Plancherel measure of the symmetric group, then for all real x 0   , l i m n P ( n 1 2 χ λ ( 12 ) d i m ( λ ) x 0 ) = 1 2 π x 0 e t 2 2 d t .   The details of Kerov's argument appeared in [IO, which gave a beautiful development of Kerov's work. Hora [Hogave another proof of Kerov's result, exploiting the fact that the kth moment of a Plancherel distributed character ratio is equal to the chance that the random walk generated by random transpositions is at the identity after k steps. Both of these proofs were essentially combinatorial in nature and used the method of moments (and so information about all moments of the character ratio). Recent work of Sniady [Sn1, [Sn2understands these moments in terms of the genus expansion from random matrix theory.
A more probabilistic approach to Kerov's result appeared in
[F1, which proved that for all n 2   and real x 0   , | P ( n 1 2 χ λ ( 12 ) d i m ( λ ) x 0 ) 1 2 π x 0 e t 2 2 d t | 40.1 n 1 / 4 .   The proof used Stein's method (which is fundamentally different from the method of moments as it only uses information about a few lower order moments) and random walk on the set of irreducible representations of the symmetric group. Note that unlike Kerov's original result, this result includes an error term. The paper [F3used martingale theory to sharpen the error term to C s n s   for any s < 1 2   where C s   is a constant depending on s   . The paper [ShSudeveloped a refinement of Stein's method which led to a proof of the conjecture of [F1that an error term of C n 1 2   holds where C   is a universal constant. A second proof of this conjecture appears in [CF, using a different refinement of Stein's method. Both of these proofs used the “exchangeable pairs” approach to Stein's method.
The purpose of the present paper is to use a completely different technique to prove a bound of
40 n 1 2   . The method is based on Bolthausen's [Bolingenious inductive proof of the Berry-Esseen theorem for sums of independent identically distributed random variables. As in [F3, we write the character ratio as a sum of martingale differences, but these are neither independent nor identically distributed so some subtle combinatorics is required to adapt Bolthausen's method. This is not the first example of adapting Bolthausen's method to the non i.i.d. case; Bolthausen [Bolused the approach to study the distribution of 1 i n A i π ( i )   where A   is a fixed n × n   matrix and π   is a random permutation on n   symbols. But the case of character ratios is of considerable interest and quite unlike any other example to which his technique has been applied.
Note that a central limit theorem is known for some other conjugacy classes of the symmetric group
[K1, [IO, [Ho, but with no information about the error term (see the end of Section  2 for a conjecture). The technique of this paper does not obviously extend, however, since in Section  2 when we write ( n 2 ) χ λ ( 12 ) d i m ( λ )   as a sum of martingale differences, we require the fact that the expected value of the square of a summand given the previous summands is constant. This is false for general conjugacy classes. Also it is a nontrivial combinatorial problem to give upper bounds on the expected absolute value of the cubes of the summands. Fortunately for the case of transpositions this can be done without much difficulty. And the case of transpositions does seem to have unique practical importance [EO, [OP.
The contents of this paper are as follows. Section
 2 develops the combinatorics needed to adapt Bolthausen's method to the case of character ratios, and then proves an upper bound of 40 n 1 / 2   . Section  3 then recalls the Jack α   measure on partitions (here α > 0   is a parameter) and why it is interesting.
It then briefly indicates the modifications to the Plancherel case needed to prove a central limit theorem with an error term of
C α n 1 / 2   , where C α   is a constant depending on α   . This organization is natural since many algebraically inclined readers will want to understand the result for character ratios without needing combinatorics of Jack polynomials; thus a key lemma is given an algebraic proof in Section  2 and a combinatorial proof in Section  3 .

2 Central limit theorem for Plancherel measure

The random variable we wish to study is T n ( λ ) = ( n 2 ) χ λ ( 12 ) d i m ( λ )   where λ   is chosen from the Plancherel measure of the symmetric group S n   . To begin we write T n   as a sum of other random variables. For this we need Kerov's growth process on partitions [K2; this has a natural generalization to arbitrary finite groups [F3, but we only recall it in the case of interest. Given a partition λ ( j )   of size j   , one obtains a partition λ ( j + 1 )   of size j + 1   by choosing λ ( j + 1 )   with probability d i m ( λ ( j + 1 ) ) ( j + 1 ) d i m ( λ ( j ) )   if λ ( j + 1 )   can be obtained from λ ( j )   by adding a single box, and with probability 0 otherwise. Thus starting from λ ( 1 )   , the unique partition of size 1   , one obtains a random sequence ( λ ( 1 ) , , λ ( n ) )   of partitions. Kerov [K2proves that each λ ( j )   is distributed according to the Plancherel measure of S j   .
Given Kerov's growth process, one can write
T n = 1 ( n 2 ) ( X 1 + + X n )   where X 1 = 0   , χ λ ( 1 ) ( 12 )   is defined as 0, and X j = ( j 2 ) χ λ ( j ) ( 12 ) d i m ( λ ( j ) ) ( j 1 2 ) χ λ ( j 1 ) ( 12 ) d i m ( λ ( j 1 ) )   for j 2   .
Lemma
 2.1 states that the X j   are martingale differences satisfying special properties. We remark that [F3extends this lemma to more general conjugacy classes and groups. The notation E ( A | )   means the expected value of A   given   .
Lemma 2.1. ([F3)
Frobenius [Frfound the following explicit formula for the character ratio of the symmetric group on transpositions: χ λ ( 12 ) d i m ( λ ) = 1 ( n 2 ) i ( ( λ i 2 ) ( λ i 2 ) )   where λ i   is the length of row i   of λ   and λ i   is the length of column i   of λ   . From his formula it follows that X j = c ( x )   where x   is the box added to λ ( j 1 )   to obtain λ ( j )   and the “content” c ( x )   of a box is defined as column number of box row number of box.
Lemma
 2.2 gives the conditional second and fourth moments of the X j   's.
We emphasize that these were not derived or even stated in terms of character ratios, but rather were proved in a completely combinatorial way by studying the behavior of the moments of
c ( x )   where x   is the box added during Kerov's growth process. We remark that for other conjugacy classes, there is not an analog of the fact that E ( X j 2 | λ ( j 1 ) )   is independent of λ ( j 1 )   .
Lemma 2.2. Let λ ( j 1 )   be a partition of size j 1 1   .
Lemma  2.3 is a useful identity. Although a combinatorial proof can be given using properties of Schur functions, we defer combinatorial arguments to the more general setting of Jack polynomials in Section  3 and give an algebraic proof.
Lemma 2.3. Let e r ( z 1 , , z n ) = 1 i 1 < < i r n z i 1 z i r   be the rth elementary symmetric function of z 1 , , z n   . For λ   a partition of n   , let e r ( λ )   denote the rth elementary symmetric function of the contents of the boxes of λ   . Then E ( e r ( λ ) ) = 0   for 1 r n   .
Lemma  2.4 gives upper bounds for E ( | X n | 3 )   and for E ( | T n 1 | | X n | 3 )   .
Lemma 2.4. Suppose that n 3   .
Now we adapt Bolthausen's [Bolinductive proof of the Berry-Esseen theorem for i.i.d. random variables to the setting of character ratios. We remark that the unpublished notes of Mann [Manare a useful exposition of Bolthausen's proof and we refer to them in the proof of Theorem  2.5 .
Theorem 2.5. Let λ   be chosen from the Plancherel measure on partitions of size n   . Then for all n 2   and real x 0   , | P ( T n ( λ ) x 0 ) 1 2 π x 0 e t 2 2 d t | 40 n 1 / 2 .  
To conclude this section, we note that it would be of interest to prove the following (more general) conjecture.
Conjecture: Let i 2   be fixed. Then for all n i   and real x 0   , | P ( n ! ( n i ) ! i χ λ ( 12 i ) d i m ( λ ) x 0 ) 1 2 π x 0 e t 2 2 d t | C i n 1 / 2   where C i   is a constant depending on i   .

3 Central limit theorem for Jack measure

For α > 0   the Jack α   measure on partitions of size n   chooses a partition λ   with probability α n n ! x λ ( α a ( x ) + l ( x ) + 1 ) ( α a ( x ) + l ( x ) + α ) ,   where the product is over all boxes in the partition. Here a ( x )   denotes the number of boxes in the same row of x   and to the right of x   (the “arm” of x) and l ( x )   denotes the number of boxes in the same column of x   and below x   (the “leg” of x). For example the partition of 5 below
would have Jack α   measure 30 α 2 ( 3 α + 1 ) ( α + 2 ) ( 2 α + 1 ) ( α + 1 ) 2 .   Note that when α = 1   , Jack measure reduces to Plancherel measure of the symmetric group. The papers [O2, [BOemphasize that for α   fixed the study of Jack α   measure is an important open problem, about which relatively little is known for general values of α   . It is a discrete analog of eigenvalue ensembles from random matrix theory and like Jack polynomials [GHJ, should also be relevant to the moduli space of curves.
Given
α > 0   , the quantity to be studied is T n , α ( λ ) = i ( α ( λ i 2 ) ( λ i 2 ) ) α ( n 2 ) ,   where as usual λ i   is the length of the ith row of λ   and λ i   is the length of the ith column of λ   . It is of interest to study the quantity T n ( α )   under Jack measure for several reasons. When α = 1   it reduces to the study of the character ratio of transpositions under Plancherel measure. When α = 2   it is a spherical function of the Gelfand pair ( S 2 n , H 2 n )   where H 2 n   is the hyperoctahedral group of size 2 n n !   . Also by Corollary 1 of [DHol, there is a natural random walk on perfect matchings of the complete graph on n   vertices, whose eigenvalues are precisely T n , 2 ( λ ) n ( n 1 )   , occurring with multiplicity proportional to the Jack 2   measure of λ   .
The paper
[F2used the “exchangeable pairs” version of Stein's method to prove a central limit theorem for T n , α   with error term C α n 1 / 4   where C α   is a constant depending on α   . This was sharpened in [F3using martingales to C α , s n s   for any s < 1 2   , and the paper [CFgives a bound C α n 1 / 2   using a refinement of the “exchangeable pairs” version of Stein's method.
The main result of this section is Theorem
 3.1 .
Theorem 3.1. Suppose that α 1   and let λ   be chosen from the Jack α   measure on partitions of size n   . Then there is a constant C α   depending on α   so that for all n 2   and real x 0   , | P ( T n , α ( λ ) x 0 ) 1 2 π x 0 e t 2 2 d t | C α n 1 / 2 .  
Note that in Theorem  3.1 we suppose that α 1   since the Jack α   probability of λ   is equal to the Jack 1 α   probability of the transpose of λ   , implying that for any x   , the Jack α   probability that T n , α = x   is equal to the Jack 1 α   probability that T n , 1 α = x   . Also note that C α   must depend on α   , since by Corollary 5.3 of [F2, the random variable T n , α   has mean 0, variance 1, and third moment α 1 α ( n 2 )   .
There is no need to write out a proof of Theorem
 3.1 , which uses exactly the same logic as that of Theorem  2.5 . But it is necessary to give analogs of Lemmas  2.1 ,  2.2 ,  2.3 , and  2.4 , and we do that.
There is an
α   -analog of Kerov's growth process (due to Kerov [K4) giving a sequence of partitions ( λ ( 1 ) , , λ ( n ) )   with λ ( j )   distributed according to the Jack α   measure on partitions of size j   ; see [F3for details. Moreover from the definition of T n , α   , it follows that T n , α = 1 α ( n 2 ) ( X 1 , α + + X n , α ) .   Here X 1 , α = 0   and if j 2   then X j , α = c α ( x )   where x   is the box added to λ ( j 1 )   to obtain λ ( j )   and the “ α   -content” c α ( x )   of a box x   is defined to be α   (column number of x-1) (row number of x-1).
Lemma
 3.2 is an analog of Lemma  2.1 and is generalized in [F3to arbitrary spherical functions of the Gelfand pair ( S 2 n , H 2 n )   .
Lemma 3.2. ([F3)
Lemma  3.3 is the α   version of Lemma  2.2 .
Lemma 3.3. Let λ ( j 1 )   be a partition of size j 1 1   .
Lemma  3.4 is the α   version of Lemma  2.3 . The proof is combinatorial, as opposed to the algebraic argument given for Lemma  2.3 .
Lemma 3.4. Consider the Jack α   measure on partitions of size n   .
Finally, we give the analog of Lemma  2.4 .
Lemma 3.5. Suppose that n 3   . There is a constant D α   such that

4 Acknowledgements

The author was partially supported by NSA grant number H98230-05-1-0031. References

  1. Aldous, D. and Diaconis, P., Longest increasing subsequences: from patience sorting to the Baik-Deift-Johansson theorem, Bull. AMS (N.S.) 36 (1999), 413-432.
  2. Bolthausen, E., An estimate of the remainder term in a combinatorial central limit theorem, Z. Wahrsch. Verw. Gebiete 66 (1984), 379-386.
  3. Borodin, A., Okounkov, A., and Olshanski, G., Asymptotics of Plancherel measures for symmetric groups, J. Amer. Math. Soc. 13 (2000), 481-515.
  4. Borodin, A. and Olshanski, G., Z-measures on partitions and their scaling limits, preprint math-ph/0210048 at http://xxx.lanl.gov.
  5. Chatterjee, S. and Fulman, J., Stein's method for chi-squared approximation and spectral measure of Gelfand pairs, in preparation.
  6. Deift, P., Integrable systems and combinatorial theory, Notices Amer. Math. Soc. 47 (2000), 631-640.
  7. Diaconis, P. and Greene, C., Applications of Murphy's elements, Stanford University technical report no. 335 (1989).
  8. Diaconis, P. and Holmes, S., Random walk on trees and matchings, Elec. J. Probab. 7 (2002), 17 pages (electronic).
  9. Diaconis, P. and Shahshahani, M., Generating a random permutation with random transpositions, Z. Wahr. Verw. Gebiete 57 (1981), 159-179.
  10. Eskin, A. and Okounkov, A., Asymptotics of branched coverings of a torus and volumes of moduli spaces of holomorphic differentials, Invent. Math. 145 (2001), 59-103.
  11. Frobenuis, F., Uber die charaktere der symmetrischen gruppe, Sitz. Konig. Preuss. Akad. Wissen. (1900), 516-534; Gesammelte abhandlungen III, Springer-Verlag, Heidelberg, 1968, 148-166.
  12. Fulman, J., Stein's method and Plancherel measure of the symmetric group, Transac. Amer. Math. Soc. 357 (2005), 555-570.
  13. Fulman, J., Stein's method, Jack measure, and the Metropolis algorithm, J. Combin. Theory Ser. A 108 (2004), 275-296.
  14. Fulman, J., Martingales and character ratios, to appear in Transac. Amer. Math. Soc., available at http://www.math.pitt.edu/   fulman.
  15. Goulden, I., Harer, J., and Jackson, D., A geometric parametrization for the virtual Euler characteristic of the moduli spaces of real and complex algebraic curves, Trans. Amer. Math. Soc. 353 (2001), 4405-4427.
  16. Hora, A., Central limit theorem for the adjacency operators on the infinite symmetric group, Comm. Math. Phys. 195 (1998), 405-416.
  17. Ivanov, V. and Olshanski, G., Kerov's central limit theorem for the Plancherel measure on Young diagrams, in Symmetric Functions 2001: Surveys of developments and perspectives, Kluwer Academic Publishers, Dodrecht, 2002.
  18. Johansson, K., Discrete orthogonal polynomial ensembles and the Plancherel measure, Ann. of Math. (2) 153 (2001), 259-296.
  19. Kerov. S.V., Gaussian limit for the Plancherel measure of the symmetric group, Compt. Rend. Acad. Sci. Paris, Serie I, 316 (1993), 303-308.
  20. Kerov, S.V., The boundary of Young lattice and random Young tableaux, in Formal power series and algebraic combinatorics, DIMACS Ser. Discrete Math. Theoret. Comput. Sci. 24, Amer. Math. Soc., Providence, RI, (1996), 133-158.
  21. Kerov, S.V., Transition probabilities of continual Young diagrams and the Markov moment problem, Funct. Anal. Appl. 27 (1993), 104-117.
  22. Kerov, S.V., Anisotropic Young diagrams and Jack symmetric functions, Funct. Anal. Appl. 34 (2000), 41-51.
  23. Lassalle, M., Jack polynomials and some identities for partitions, Trans. Amer. Math. Soc. 356 (2004), 3455-3476.
  24. Macdonald, I., Symmetric functions and Hall polynomials, Second edition, Oxford University Press, New York, 1995.
  25. Mann, B., Bolthausen's proof of Berry-Esseen, unpublished manuscript (1994).
  26. Murphy, G.E., A new construction of Young's seminormal representation of the symmetric group, J. Algebra 69 (1981), 287-291.
  27. Okounkov, A., Random matrices and random permutations, Internat. Math. Res. Notices 20 (2000), 1043-1095.
  28. Okounkov, A., The uses of random partitions, preprint math-ph/0309015 at http://xxx.lanl.gov.
  29. Okounkov, A. and Pandaripandhe, R., Gromov-Witten theory, Hurwitz numbers, and Matrix models, I, preprint math.AG/0101147 at http://xxx.lanl.gov.
  30. Sagan, B., The symmetric group. Representations, combinatorial algorithms, and symmetric functions, Springer-Verlag, New York, 1991.
  31. Shao, Q., and Su, Z., The Berry-Esseen bound for character ratios, preprint (2004).
  32. Sniady, P., Asymptotics of characters of symmetric groups, Gaussian fluctuations of Young diagrams and genus expansion, preprint math.CO/0411647 at xxx.lanl.gov.
  33. Sniady, P., Gaussian fluctuations of characters of symmetric groups and of Young diagrams, preprint math.CO/0501112 at xxx.lanl.gov.
  34. Stein, C., Approximate computation of expectations, Institute of Mathematical Statistics Lecture Notes, Volume 7, 1986.

U