document the arf module

2025-03-05 09:21:38 -05:00 · 2014-05-08 19:35:52 +02:00 · 2014-05-08 19:35:52 +02:00 · decf8ca7e6
commit decf8ca7e6
parent 719ede74af
2 changed files with 461 additions and 8 deletions
--- a/doc/source/arf.rst
+++ b/doc/source/arf.rst
@ -3,5 +3,431 @@
 **arf.h** -- arbitrary-precision real floating-point numbers
 ===============================================================================

-TBD.
+The :type:`arf_t` type is essentially identical semantically to
+the :type:`fmpr_t` type, but uses an internal representation that
+generally allows operation to be performed more efficiently.
+The only significant difference that the user
+has to be aware of is that some :type:`arf_t` functions return an :type:`int`
+indicating whether a result is inexact, whereas the corresponding
+:type:`fmpr_t` functions return a :type:`long` encoding the relative
+exponent of the error.
+
+Types, macros and constants
+-------------------------------------------------------------------------------
+
+.. type:: arf_struct
+
+.. type:: arf_t
+
+    An :type:`arf_t` is defined as an array of length one of type
+    :type:`arf_struct`, permitting an :type:`arf_t` to be passed by reference.
+
+.. type:: arf_rnd_t
+
+    Specifies the rounding mode for the result of an approximate operation.
+
+.. macro:: ARF_RND_DOWN
+
+    Specifies that the result of an operation should be rounded to the
+    nearest representable number in the direction towards zero.
+
+.. macro:: ARF_RND_UP
+
+    Specifies that the result of an operation should be rounded to the
+    nearest representable number in the direction away from zero.
+
+.. macro:: ARF_RND_FLOOR
+
+    Specifies that the result of an operation should be rounded to the
+    nearest representable number in the direction towards minus infinity.
+
+.. macro:: ARF_RND_CEIL
+
+    Specifies that the result of an operation should be rounded to the
+    nearest representable number in the direction towards plus infinity.
+
+.. macro:: ARF_RND_NEAR
+
+    Specifies that the result of an operation should be rounded to the
+    nearest representable number, rounding to an odd mantissa if there is a tie
+    between two values. *Warning*: this rounding mode is currently
+    not implemented (except for a few conversions functions where this 
+    stated explicitly).
+
+.. macro:: ARF_PREC_EXACT
+
+    If passed as the precision parameter to a function, indicates that no
+    rounding is to be performed. This must only be used when it is known
+    that the result of the operation can be represented exactly and fits
+    in memory (the typical use case is working with small integer values).
+    Note that, for example, adding two numbers whose exponents are far
+    apart can easily produce an exact result that is far too large to
+    store in memory.
+
+Memory management
+-------------------------------------------------------------------------------
+
+.. function:: void arf_init(arf_t x)
+
+    Initializes the variable *x* for use. Its value is set to zero.
+
+.. function:: void arf_clear(arf_t x)
+
+    Clears the variable *x*, freeing or recycling its allocated memory.
+
+Special values
+-------------------------------------------------------------------------------
+
+.. function:: void arf_zero(arf_t x)
+
+.. function:: void arf_one(arf_t x)
+
+.. function:: void arf_pos_inf(arf_t x)
+
+.. function:: void arf_neg_inf(arf_t x)
+
+.. function:: void arf_nan(arf_t x)
+
+    Sets *x* respectively to 0, 1, `+\infty`, `-\infty`, NaN.
+
+.. function:: int arf_is_zero(const arf_t x)
+
+.. function:: int arf_is_one(const arf_t x)
+
+.. function:: int arf_is_pos_inf(const arf_t x)
+
+.. function:: int arf_is_neg_inf(const arf_t x)
+
+.. function:: int arf_is_nan(const arf_t x)
+
+    Returns nonzero iff *x* respectively equals 0, 1, `+\infty`, `-\infty`, NaN.
+
+.. function:: int arf_is_inf(const arf_t x)
+
+    Returns nonzero iff *x* equals either `+\infty` or `-\infty`.
+
+.. function:: int arf_is_normal(const arf_t x)
+
+    Returns nonzero iff *x* is a finite, nonzero floating-point value, i.e.
+    not one of the special values 0, `+\infty`, `-\infty`, NaN.
+
+.. function:: int arf_is_special(const arf_t x)
+
+    Returns nonzero iff *x* is one of the special values
+    0, `+\infty`, `-\infty`, NaN, i.e. not a finite, nonzero
+    floating-point value.
+
+.. function:: int arf_is_finite(arf_t x)
+
+    Returns nonzero iff *x* is a finite floating-point value,
+    i.e. not one of the values `+\infty`, `-\infty`, NaN.
+    (Note that this is not equivalent to the negation of
+    :func:`arf_is_inf`.)
+
+
+Assignment, rounding and conversions
+-------------------------------------------------------------------------------
+
+.. function:: void arf_set(arf_t y, const arf_t x)
+
+.. function:: void arf_set_mpz(arf_t y, const mpz_t x)
+
+.. function:: void arf_set_fmpz(arf_t y, const fmpz_t x)
+
+.. function:: void arf_set_ui(arf_t y, ulong x)
+
+.. function:: void arf_set_si(arf_t y, long x)
+
+.. function:: void arf_set_mpfr(arf_t y, const mpfr_t x)
+
+.. function:: void arf_set_fmpr(arf_t y, const fmpr_t x)
+
+    Sets *y* exactly to *x*.
+
+.. function:: void arf_swap(arf_t y, arf_t x)
+
+    Swaps *y* and *x* efficiently.
+
+.. function:: void arf_init_set_ui(arf_t y, ulong x)
+
+.. function:: void arf_init_set_si(arf_t y, long x)
+
+    Initialises *y* and sets it to *x* in a single operation.
+
+.. function:: int arf_set_round(arf_t y, const arf_t x, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_set_round_si(arf_t x, long v, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_set_round_ui(arf_t x, ulong v, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_set_round_mpz(arf_t y, const mpz_t x, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_set_round_fmpz(arf_t y, const fmpz_t x, long prec, arf_rnd_t rnd)
+
+    Sets *y* to *x*, rounded to *prec* bits in the direction
+    specified by *rnd*.
+
+.. function:: void arf_set_si_2exp_si(arf_t y, long m, long e)
+
+.. function:: void arf_set_ui_2exp_si(arf_t y, ulong m, long e)
+
+.. function:: void arf_set_fmpz_2exp(arf_t y, const fmpz_t m, const fmpz_t e)
+
+    Sets *y* to `m \times 2^e`.
+
+.. function:: int arf_set_round_fmpz_2exp(arf_t y, const fmpz_t x, const fmpz_t e, long prec, arf_rnd_t rnd)
+
+    Sets *y* to `x \times 2^e`, rounded to *prec* bits in the direction
+    specified by *rnd*.
+
+.. function:: void arf_get_fmpz_2exp(fmpz_t m, fmpz_t e, const arf_t x)
+
+    Sets *m* and *e* to the unique integers such that
+    `x = m \times 2^e` and *m* is odd,
+    provided that *x* is a nonzero finite fraction.
+    If *x* is zero, both *m* and *e* are set to zero. If *x* is
+    infinite or NaN, the result is undefined.
+
+.. function:: void arf_get_fmpr(fmpr_t y, const arf_t x)
+
+    Sets *y* exactly to *x*.
+
+.. function:: int arf_get_mpfr(mpfr_t y, const arf_t x, mpfr_rnd_t rnd)
+
+    Sets the MPFR variable *y* to the value of *x*. If the precision of *x*
+    is too small to allow *y* to be represented exactly, it is rounded in
+    the specified MPFR rounding mode. The return value (-1, 0 or 1)
+    indicates the direction of rounding, following the convention
+    of the MPFR library.
+
+Comparisons and bounds
+-------------------------------------------------------------------------------
+
+.. function:: int arf_equal(const arf_t x, const arf_t y)
+
+    Returns nonzero iff *x* and *y* are exactly equal. This function does
+    not treat NaN specially, i.e. NaN compares as equal to itself.
+
+.. function:: int arf_cmp(const arf_t x, const arf_t y)
+
+    Returns negative, zero, or positive, depending on whether *x* is
+    respectively smaller, equal, or greater compared to *y*.
+    Comparison with NaN is undefined.
+
+.. function:: int arf_cmpabs(const arf_t x, const arf_t y)
+
+    Compares the absolute values of *x* and *y*.
+
+.. function:: int arf_cmp_2exp_si(const arf_t x, long e)
+
+.. function:: int arf_cmpabs_2exp_si(const arf_t x, long e)
+
+    Compares *x* (respectively its absolute value) with `2^e`.
+
+.. function:: int arf_sgn(const arf_t x)
+
+    Returns `-1`, `0` or `+1` according to the sign of *x*. The sign
+    of NaN is undefined.
+
+.. function:: void arf_min(arf_t z, const arf_t a, const arf_t b)
+
+.. function:: void arf_max(arf_t z, const arf_t a, const arf_t b)
+
+    Sets *z* respectively to the minimum and the maximum of *a* and *b*.
+
+.. function:: long arf_bits(const arf_t x)
+
+    Returns the number of bits needed to represent the absolute value
+    of the mantissa of *x*, i.e. the minimum precision sufficient to represent
+    *x* exactly. Returns 0 if *x* is a special value.
+
+.. function:: int arf_is_int(const arf_t x)
+
+    Returns nonzero iff *x* is integer-valued.
+
+.. function:: int arf_is_int_2exp_si(const arf_t x, long e)
+
+    Returns nonzero iff *x* equals `n 2^e` for some integer *n*.
+
+.. function:: void arf_abs_bound_lt_2exp_fmpz(fmpz_t b, const arf_t x)
+
+    Sets *b* to the smallest integer such that `|x| < 2^b`.
+    If *x* is zero, infinity or NaN, the result is undefined.
+
+.. function:: void arf_abs_bound_le_2exp_fmpz(fmpz_t b, const arf_t x)
+
+    Sets *b* to the smallest integer such that `|x| \le 2^b`.
+    If *x* is zero, infinity or NaN, the result is undefined.
+
+.. function:: long arf_abs_bound_lt_2exp_si(const arf_t x)
+
+    Returns the smallest integer *b* such that `|x| < 2^b`, clamping
+    the result to lie between -*ARF_PREC_EXACT* and *ARF_PREC_EXACT*
+    inclusive. If *x* is zero, -*ARF_PREC_EXACT* is returned,
+    and if *x* is infinity or NaN, *ARF_PREC_EXACT* is returned.
+
+.. function:: void arf_get_mag(mag_t y, const arf_t x)
+
+.. function:: void arf_get_mag_lower(mag_t y, const arf_t x)
+
+.. function:: void arf_set_mag(arf_t y, const mag_t x)
+
+.. function:: void mag_init_set_arf(mag_t y, const arf_t x)
+
+.. function:: void mag_fast_init_set_arf(mag_t y, const arf_t x)
+
+Random number generation
+-------------------------------------------------------------------------------
+
+.. function:: void arf_randtest(arf_t x, flint_rand_t state, long bits, long mag_bits)
+
+    Generates a finite random number whose mantissa has precision at most
+    *bits* and whose exponent has at most *mag_bits* bits. The
+    values are distributed non-uniformly: special bit patterns are generated
+    with high probability in order to allow the test code to exercise corner
+    cases.
+
+.. function:: void arf_randtest_not_zero(arf_t x, flint_rand_t state, long bits, long mag_bits)
+
+    Identical to :func:`arf_randtest`, except that zero is never produced
+    as an output.
+
+.. function:: void arf_randtest_special(arf_t x, flint_rand_t state, long bits, long mag_bits)
+
+    Indentical to :func:`arf_randtest`, except that the output occasionally
+    is set to an infinity or NaN.
+
+Input and output
+-------------------------------------------------------------------------------
+
+.. function:: void arf_debug(const arf_t x)
+
+    Prints information about the internal representation of *x*.
+
+.. function:: void arf_print(const arf_t x)
+
+    Prints *x* as an integer mantissa and exponent.
+
+.. function:: void arf_printd(const arf_t y, long d)
+
+    Prints *x* as a decimal floating-point number, rounding to *d* digits.
+    This function is currently implemented using MPFR,
+    and does not support large exponents.
+
+Arithmetic
+-------------------------------------------------------------------------------
+
+.. function:: void arf_abs(arf_t y, const arf_t x)
+
+    Sets *y* to the absolute value of *x*.
+
+.. function:: void arf_neg(arf_t y, const arf_t x)
+
+    Sets `y = -x` exactly.
+
+.. function:: int arf_neg_round(arf_t y, const arf_t x, long prec, arf_rnd_t rnd)
+
+    Sets `y = -x`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact.
+
+.. function:: void arf_mul_2exp_si(arf_t y, const arf_t x, long e)
+
+.. function:: void arf_mul_2exp_fmpz(arf_t y, const arf_t x, const fmpz_t e)
+
+    Sets `y = x 2^e` exactly.
+
+.. function:: int arf_mul(arf_t z, const arf_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_mul_ui(arf_t z, const arf_t x, ulong y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_mul_si(arf_t z, const arf_t x, long y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_mul_mpz(arf_t z, const arf_t x, const mpz_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_mul_fmpz(arf_t z, const arf_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+    Sets `z = x \times y`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact.
+
+.. function:: int arf_add(arf_t z, const arf_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_add_si(arf_t z, const arf_t x, long y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_add_ui(arf_t z, const arf_t x, ulong y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_add_fmpz(arf_t z, const arf_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+    Sets `z = x + y`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact.
+
+.. function:: int arf_sub(arf_t z, const arf_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_sub_si(arf_t z, const arf_t x, long y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_sub_ui(arf_t z, const arf_t x, ulong y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_sub_fmpz(arf_t z, const arf_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+    Sets `z = x - y`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact.
+
+.. function:: int arf_addmul(arf_t z, const arf_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_addmul_ui(arf_t z, const arf_t x, ulong y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_addmul_si(arf_t z, const arf_t x, long y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_addmul_mpz(arf_t z, const arf_t x, const mpz_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_addmul_fmpz(arf_t z, const arf_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+    Sets `z = z + x \times y`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact.
+
+.. function:: int arf_submul(arf_t z, const arf_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_submul_ui(arf_t z, const arf_t x, ulong y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_submul_si(arf_t z, const arf_t x, long y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_submul_mpz(arf_t z, const arf_t x, const mpz_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_submul_fmpz(arf_t z, const arf_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+    Sets `z = z - x \times y`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact.
+
+.. function:: int arf_div(arf_t z, const arf_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_div_ui(arf_t z, const arf_t x, ulong y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_ui_div(arf_t z, ulong x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_div_si(arf_t z, const arf_t x, long y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_si_div(arf_t z, long x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_div_fmpz(arf_t z, const arf_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_fmpz_div(arf_t z, const fmpz_t x, const arf_t y, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_fmpz_div_fmpz(arf_t z, const fmpz_t x, const fmpz_t y, long prec, arf_rnd_t rnd)
+
+    Sets `z = x / y`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact. The result is NaN if *y* is zero.
+
+.. function:: int arf_sqrt(arf_t z, const arf_t x, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_sqrt_ui(arf_t z, ulong x, long prec, arf_rnd_t rnd)
+
+.. function:: int arf_sqrt_fmpz(arf_t z, const fmpz_t x, long prec, arf_rnd_t rnd)
+
+    Sets `z = \sqrt{x}`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact. The result is NaN if *x* is negative.
+
+.. function:: int arf_rsqrt(arf_t z, const arf_t x, long prec, arf_rnd_t rnd)
+
+    Sets `z = 1/\sqrt{x}`, rounded to *prec* bits in the direction specified by *rnd*,
+    returning nonzero iff the operation is inexact. The result is NaN if *x* is
+    negative, and `+\infty` if *x* is zero.

--- a/doc/source/mag.rst
+++ b/doc/source/mag.rst
@ -4,7 +4,8 @@
 ===============================================================================

 The :type:`mag_t` type is an unsigned floating-point type with a
-fixed-precision mantissa (30 bits) and unlimited exponent range, suited for
+fixed-precision mantissa (30 bits) and an arbitrary-precision
+exponent (represented as an :type:`fmpz_t`), suited for
 representing and rigorously manipulating magnitude bounds efficiently.
 Operations always produce a strict upper or lower bound, but for performance
 reasons, no attempt is made to compute the best possible bound
@ -13,10 +14,6 @@ The special values zero and positive infinity are supported (but not NaN).
 Applications requiring more flexibility (such as correct rounding, or
 higher precision) should use the :type:`arf_t` type instead.

-The exponent is represented as an :type:`fmpz`. Special "fast" methods are
-provided for manipulating :type:`mag_t` objects under the assumption that
-no infinities are present and that exponents stay within a certain range.
-
 Types, macros and constants
 -------------------------------------------------------------------------------

@ -75,7 +72,7 @@ Arithmetic

 .. function:: void mag_mul(mag_t z, const mag_t x, const mag_t y)

-    Sets `z` to an upper bound for `x + y`.
+    Sets `z` to an upper bound for `xy`.

 .. function:: void mag_addmul(mag_t z, const mag_t x, const mag_t y)

@ -92,20 +89,50 @@ Arithmetic
 Fast versions
 -------------------------------------------------------------------------------

+The following methods assume that all inputs are finite and that all exponents
+(in all inputs as well as the final result) fit as *fmpz* inline values.
+They also assume that the output variables do not have promoted exponents,
+as they will be overwritten directly (thus leaking memory).
+
 .. function:: void mag_fast_init_set(mag_t x, const mag_t y)

+    Initialises *x* and sets it to the value of *y*.
+
 .. function:: void mag_fast_zero(mag_t x)

+    Sets *x* to zero.
+
 .. function:: int mag_fast_is_zero(const mag_t x)

-.. function:: void mag_fast_init_set_arf(mag_t y, const arf_t x)
+    Returns nonzero iff *x* to zero.

 .. function:: void mag_fast_mul(mag_t z, const mag_t x, const mag_t y)

+    Sets `z` to an upper bound for `xy`.
+
 .. function:: void mag_fast_addmul(mag_t z, const mag_t x, const mag_t y)

+    Sets `z` to an upper bound for `z + xy`.
+
 .. function:: void mag_fast_add_2exp_si(mag_t z, const mag_t x, long e)

+    Sets `z` to an upper bound for `x + 2^e`.
+
+Input and output
+-------------------------------------------------------------------------------
+
+.. function:: void mag_print(const mag_t x)
+
+    Prints *x* to standard output.
+
+Random generation
+-------------------------------------------------------------------------------
+
+.. function:: void mag_randtest(mag_t x, flint_rand_t state, long expbits)
+
+    Sets *x* to a random value, with an exponent up to *expbits* bits large.
+    The special values zero and infinity are occasionally generated.
+
 Conversions
 -------------------------------------------------------------------------------