Function field sieve

In mathematics the Function Field Sieve is one of the most efficient algorithms to solve the Discrete Logarithm Problem (DLP) in a finite field. It has heuristic subexponential complexity. Leonard Adleman developed it in 1994 L. Adleman. "The function field sieve". In: Algorithmic Number Theory (ANTS-I). Lecture Notes in Computer Science. Springer (1994), pp.108-121. and then elaborated it together with M. D. Huang in 1999.L. Adleman, M.D. Huang. "Function Field Sieve Method for Discrete Logarithms over Finite Fields". In: Inf. Comput. 151 (May 1999), pp. 5-16. DOI: 10.1006/inco.1998.2761.

Previous work includes the work of D. Coppersmith D. Coppersmith. (1984), "Fast evaluation of discrete logarithms in fields of characteristic two". In: IEEE Trans. Inform. Theory IT-39 (1984), pp. 587-594. about the DLP in fields of characteristic two.

The discrete logarithm problem in a finite field consists of solving the equation a^x = b for a,b \in \mathbb{F}_{p^n} , p a prime number and n an integer. The function f: \mathbb{F}_{p^n} \to \mathbb{F}_{p^n}, a \mapsto a^x for a fixed x \in \mathbb{N} is a one-way function used in cryptography. Several cryptographic methods are based on the DLP such as the Diffie-Hellman key exchange, the El Gamal cryptosystem and the Digital Signature Algorithm.

Number theoretical background

= Function Fields =

{{main|Algebraic function field}}

Let C(x,y) be a polynomial defining an algebraic curve over a finite field \mathbb{F}_{p} . A function field may be viewed as the field of fractions of the affine coordinate ring \mathbb{F}_p[x,y]/(C(x,y)), where (C(x,y)) denotes the ideal generated by C(x,y). This is a special case of an algebraic function field. It is defined over the finite field \mathbb{F}_p and has transcendence degree one. The transcendent element will be denoted by x.

There exist bijections between valuation rings in function fields and equivalence classes of places, as well as between valuation rings and equivalence classes of valuations.M. Fried and M. Jarden. In: "Field Arithmetic". vol. 11. (Jan. 2005). Chap. 2.1. DOI: 10.1007/b138352. This correspondence is frequently used in the Function Field Sieve algorithm.

= Divisors =

A discrete valuation of the function field K/\mathbb{F}_p, namely a discrete valuation ring \mathbb{F}_p \subset O \subset K, has a unique maximal ideal P called a prime of the function field. The degree of P is deg(P) = [O/P : \mathbb{F}_p] and we also define f_O = [O/P : \mathbb{F}_p].

A divisor is a \mathbb{Z}-linear combination over all primes, so d = \sum \alpha_PP where \alpha_P \in \mathbb{Z} and only finitely many elements of the sum are non-zero. The divisor of an element x \in K is defined as \text{div}(x) = \sum v_P(x)P, where v_P is the valuation corresponding to the prime P.

The degree of a divisor is \deg(d) = \sum \alpha_P \deg(P).

Method

The Function Field Sieve algorithm consists of a precomputation where the discrete logarithms of irreducible polynomials of small degree are found and a reduction step where they are combined to the logarithm of b.

Functions that decompose into irreducible function of degree smaller than some bound B are called B-smooth. This is analogous to the definition of a smooth number and such functions are useful because their decomposition can be found relatively fast. The set of those functions S = \{g(x) \in \mathbb{F}_p[x] \mid \text{ irreductible with } \deg(g) < B\} is called the factor base.

A pair of functions (r,s) is doubly-smooth if rm + s and N(ry+s) are both smooth, where N(\cdot,\cdot) is the norm of an element of K over \mathbb{F}_p, m \in \mathbb{F}_p[x] is some parameter and ry+s is viewed as an element of the function field of C.

The sieving step of the algorithm consists of finding doubly-smooth pairs of functions. In the subsequent step we use them to find linear relations including the logarithms of the functions in the decompositions. By solving a linear system we then calculate the logarithms.

In the reduction step we express \log_a(b) as a combination of the logarithm we found before and thus solve the DLP.

= Precomputation =

== Parameter selection ==

The algorithm requires the following parameters: an irreducible function f of degree n, a function m \in \mathbb{F}_p[x] and a curve C(x,y) of given degree d such that C(x,m) \equiv 0 \text{ mod } f. Here n is the power in the order of the base field \mathbb{F}_{p^n}. Let K denote the function field defined by C.

This leads to an isomorphism \mathbb{F}_{p^n} \simeq \mathbb{F}_p[x]/f and a homomorphism \phi: \mathbb{F}_p[x,y]/C \to \mathbb{F}_p[x]/f, y \mapsto m. Using the isomorphism each element of \mathbb{F}_{p^n} can be considered as a polynomial in \mathbb{F}_p[x]/f.

One also needs to set a smoothness bound B for the factor base S.

== Sieving ==

In this step doubly-smooth pairs of functions (r,s) \in \mathbb{F}_p[x] \times \mathbb{F}_p[x] are found.

One considers functions of the form f = (rm+s)N(ry+s), then divides f by any g \in S as many times as possible. Any f that is reduced to one in this process is B-smooth. To implement this, Gray code can be used to efficiently step through multiples of a given polynomial.

This is completely analogous to the sieving step in other sieving algorithms such as the Number Field Sieve or the index calculus algorithm. Instead of numbers one sieves through functions in \mathbb{F}_p[x] but those functions can be factored into irreducible polynomials just as numbers can be factored into primes.

== Finding linear relations ==

This is the most difficult part of the algorithm, involving function fields, places

and divisors as defined above. The goal is to use the doubly-smooth pairs of functions to find linear relations involving the discrete logarithms of elements in the factor base.

For each irreducible function in the factor base we find places v_1, v_2, ... of K that lie over them and surrogate functions \alpha_1, \alpha_2, ... that correspond to the places. A surrogate function \alpha_i \in K corresponding to a place v_i satisfies \text{div}(\alpha_i)=h(v_i-f_{v_i}u) where h is the class number of K and u is any fixed discrete valuation with f_u=1. The function defined this way is unique up to a constant in

\mathbb{F}_p.

By the definition of a divisor \text{div}(ry+s) = \sum a_i v_i for a_i = v_i(ry+s). Using this and the fact that \sum a_i f_{v_i} = \deg(\text{div}(ry+s)) = 0 we get the following expression:

:\text{div}((ry + s)^h) = \sum ha_iv_i = \sum ha_iv_i - \sum ha_if_{v_i}v + hv \sum a_if_{v_i} = \sum a_ih(v_i-f_{v_i}v) ) = \text{div}(\prod \alpha_i^{a_i})

where v is any valuation with f_v = 1. Then, using the fact that the divisor of a surrogate function is unique up to a constant, one gets

:(ry+s)^h = c \prod \alpha_i^{a_i} \text{ for some } c \in F_p^*

:\implies \phi((ry+s)^h) = \phi(c) \prod \phi(\alpha_i)^{a_i}

We now use the fact that \phi(ry+s) = rm+s and the known decomposition of this expression into irreducible polynomials. Let e_g be the power of g \in S in this decomposition. Then

: \prod_{g \in S} g^{he_g} \equiv \phi(c) \prod \phi(\alpha_i)^{a_i} \text{ mod } f

Here we can take the discrete logarithm of the equation up to a unit. This is called the restricted discrete logarithm \log_*(x). It is defined by the equation a^{\log_*(x)} = ux for some unit u \in \mathbb{F}_p.

: \sum_{g \in S} e_g \log_*g \equiv \sum a_i h_1 \log_*(\phi(\alpha_i)) \text{ mod } (p^n-1)/(p-1),

where h_1 is the inverse of h modulo (p^n-1)/(p-1).

The expressions h_1 \log_* (\phi(\alpha_i)) and the logarithms \log_*(g) are unknown. Once enough equations of this form are found, a linear system can be solved to find \log_*(g) for all g \in S. Taking the whole expression h_1 log_* (\phi(\alpha_i)) as an unknown helps to gain time, since h, h_1, \alpha_i or \phi(\alpha_i) don't have to be computed.

Eventually for each g \in S the unit corresponding to the restricted discrete logarithm can be calculated which then gives \log_a(g) = \log_*(g) - \log_a(u).

= Reduction step =

First a^lb mod f are computed for a random l < n . With sufficiently high probability this is \sqrt{nB} -smooth, so one can factor it as a^lb = \prod b_i for b_i \in \mathbb{F}_p[x] with \deg(b_i) < \sqrt{nB} . Each of these polynomials b_i can be reduced to polynomials of smaller degree using a generalization of the Coppersmith method. We can reduce the degree until we get a product of B-smooth polynomials. Then, taking the logarithm to the base a, we can eventually compute

:\log_a(b) = \sum_{g_i \in S} \log_a(g_i) - l, which solves the DLP.

Complexity

The Function Field Sieve is thought to run in subexponential time in

:\exp\left( \left(\sqrt[3]{\frac{32}{9}} + o(1)\right)(\ln p)^{\frac{1}{3}}(\ln \ln p)^{\frac{2}{3}}\right) =L_p\left[\frac{1}{3},\sqrt[3]{\frac{32}{9}}\right]

using the L-notation. There is no rigorous proof of this complexity since it relies on some heuristic assumptions. For example in the sieving step we assume that numbers of the form (rm+s)N(ry+s) behave like random numbers in a given range.

Comparison with other methods

There are two other well known algorithms that solve the discrete logarithm problem in sub-exponential time: the index calculus algorithm and a version of the Number Field Sieve.D. Gordon. "Discrete Logarithm in GF(P) Using the Number Field Sieve". In: Siam Journal on Discrete Mathematics - SIAMDM 6 (Feb. 1993), pp. 124-138. DOI: 10.1137/0406010. In their easiest forms both solve the DLP in a finite field of prime order but they can be expanded to solve the DLP in \mathbb{F}_{p^n} as well.

The Number Field Sieve for the DLP in \mathbb{F}_{p^n} has a complexity of L_p[1/3, (64/9)^{1/3} + o(1)] R. Barbulescu, P. Gaudry, T. Kleinjung. "The Tower Number Field Sieve". In: Advances in Cryptology – Asiacrypt 2015. Vol. 9453. Springer, May 2015. pp. 31-58 and is therefore slightly slower than the best performance of the Function Field Sieve. However, it is faster than the Function Field Sieve when n << (\log(p))^{1/2} . It is not surprising that there exist two similar algorithms, one with number fields and the other one with function fields. In fact there is an extensive analogy between these two kinds of global fields.

The index calculus algorithm is much easier to state than the Function Field Sieve and the Number Field Sieve since it does not involve any advanced algebraic structures. It is asymptotically slower with a complexity of L_p[1/2, \sqrt{2}]. The main reason why the Number Field Sieve and the Function Field Sieve are faster is that these algorithms can run with a smaller smoothness bound B, so most of the computations can be done with smaller numbers.

See also

References

{{reflist}}

{{Number-theoretic algorithms}}

Category:Field (mathematics)