Cubic function

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Graph of a cubic function with 3 real roots (where the curve crosses the horizontal axis—where y = 0) (and hence no complex roots). The case shown has two critical points. Here the function is ƒ(x) = (x3 + 3x2 − 6x − 8) / 4. There would be only one real root, and two complex roots, if the local minimum were above the x axis, or if the local maximum were below the x axis, or if there were no local minimum or maximum and the inflection point were off the x axis.

In algebra, a cubic function is a function of the form

f(x)=ax^3+bx^2+cx+d,\,

where a is nonzero. In other words, a cubic function is defined by a polynomial of degree three.

Setting ƒ(x) = 0 produces a cubic equation of the form:

ax^3+bx^2+cx+d=0.\,

Usually, the coefficients a, b, c, d are real numbers. However much of the theory of cubic equations for real coefficients applies to other types of coefficients (such as complex ones).[1]

Solving the cubic equation is equivalent to finding the particular value (or values) of x for which ƒ(x) = 0. There are various methods to solve cubic equations. The solutions of a cubic equation, also called roots of the cubic function, can always be found algebraically. (This is also true of a quadratic or quartic (fourth degree) equation, but no higher-degree equation, by the Abel–Ruffini theorem). The roots can also be found trigonometrically. Alternatively, one can find a numerical approximation of the roots in the field of the real or complex numbers such as by using root-finding algorithms like Newton's method.

History

Cubic equations were known to the ancient Babylonians, Greeks, Chinese, Indians, and Egyptians.[2][3][4] Babylonian (20th to 16th centuries BC) cuneiform tablets have been found with tables for calculating cubes and cube roots.[5][6] The Babylonians could have used the tables to solve cubic equations, but no evidence exists to confirm that they did.[7] The problem of doubling the cube involves the simplest and oldest studied cubic equation, and one for which the ancient Egyptians did not believe a solution existed.[8] In the 5th century BC, Hippocrates reduced this problem to that of finding two mean proportionals between one line and another of twice its length, but could not solve this with a compass and straightedge construction,[9] a task which is now known to be impossible. Methods for solving cubic equations appear in The Nine Chapters on the Mathematical Art, a Chinese mathematical text compiled around the 2nd century BC and commented on by Liu Hui in the 3rd century.[3] In the 3rd century, the ancient Greek mathematician Diophantus found integer or rational solutions for some bivariate cubic equations (Diophantine equations).[4][10] Hippocrates, Menaechmus and Archimedes are believed to have come close to solving the problem of doubling the cube using intersecting conic sections,[9] though historians such as Reviel Netz dispute whether the Greeks were thinking about cubic equations or just problems that can lead to cubic equations. Some others like T. L. Heath, who translated all Archimedes' works, disagree, putting forward evidence that Archimedes really solved cubic equations using intersections of two cones, but also discussed the conditions where the roots are 0, 1 or 2.[11]

Two-dimensional graph of a cubic, the polynomial ƒ(x) = 2x3 − 3x2 − 3x + 2.

In the 7th century, the Tang dynasty astronomer mathematician Wang Xiaotong in his mathematical treatise titled Jigu Suanjing systematically established and solved 25 cubic equations of the form x^3+px^2+qx=N, 23 of them with p,q \ne 0, and two of them with q = 0.[12]

In the 11th century, the Persian poet-mathematician, Omar Khayyám (1048–1131), made significant progress in the theory of cubic equations. In an early paper, he discovered that a cubic equation can have more than one solution and stated that it cannot be solved using compass and straightedge constructions. He also found a geometric solution.[13][14] In his later work, the Treatise on Demonstration of Problems of Algebra, he wrote a complete classification of cubic equations with general geometric solutions found by means of intersecting conic sections.[15][16]

In the 12th century, the Indian mathematician Bhaskara II attempted the solution of cubic equations without general success. However, he gave one example of a cubic equation:[17]

x^3+12x=6x^2+35 \,

In the 12th century, another Persian mathematician, Sharaf al-Dīn al-Tūsī (1135–1213), wrote the Al-Mu'adalat (Treatise on Equations), which dealt with eight types of cubic equations with positive solutions and five types of cubic equations which may not have positive solutions. He used what would later be known as the "Ruffini-Horner method" to numerically approximate the root of a cubic equation. He also developed the concepts of a derivative function and the maxima and minima of curves in order to solve cubic equations which may not have positive solutions.[18] He understood the importance of the discriminant of the cubic equation to find algebraic solutions to certain types of cubic equations.[19]

Leonardo de Pisa, also known as Fibonacci (1170–1250), was able to closely approximate the positive solution to the cubic equation x3 + 2x2 + 10x = 20, using the Babylonian numerals. He gave the result as 1,22,7,42,33,4,40 (equivalent to 1 + 22/60 + 7/602 + 42/603 + 33/604 + 4/605 + 40/606),[20] which differs from the correct value by only about three trillionths.

In the early 16th century, the Italian mathematician Scipione del Ferro (1465–1526) found a method for solving a class of cubic equations, namely those of the form x3 + mx = n. In fact, all cubic equations can be reduced to this form if we allow m and n to be negative, but negative numbers were not known to him at that time. Del Ferro kept his achievement secret until just before his death, when he told his student Antonio Fiore about it.

Niccolò Fontana Tartaglia

In 1530, Niccolò Tartaglia (1500–1557) received two problems in cubic equations from Zuanne da Coi and announced that he could solve them. He was soon challenged by Fiore, which led to a famous contest between the two. Each contestant had to put up a certain amount of money and to propose a number of problems for his rival to solve. Whoever solved more problems within 30 days would get all the money. Tartaglia received questions in the form x3 + mx = n, for which he had worked out a general method. Fiore received questions in the form x3 + mx2 = n, which proved to be too difficult for him to solve, and Tartaglia won the contest.

Later, Tartaglia was persuaded by Gerolamo Cardano (1501–1576) to reveal his secret for solving cubic equations. In 1539, Tartaglia did so only on the condition that Cardano would never reveal it and that if he did write a book about cubics, he would give Tartaglia time to publish. Some years later, Cardano learned about Ferro's prior work and published Ferro's method in his book Ars Magna in 1545, meaning Cardano gave Tartaglia 6 years to publish his results (with credit given to Tartaglia for an independent solution). Cardano's promise with Tartaglia stated that he not publish Tartaglia's work, and Cardano felt he was publishing del Ferro's, so as to get around the promise. Nevertheless, this led to a challenge to Cardano by Tartaglia, which Cardano denied. The challenge was eventually accepted by Cardano's student Lodovico Ferrari (1522–1565). Ferrari did better than Tartaglia in the competition, and Tartaglia lost both his prestige and income.[21]

Cardano noticed that Tartaglia's method sometimes required him to extract the square root of a negative number. He even included a calculation with these complex numbers in Ars Magna, but he did not really understand it. Rafael Bombelli studied this issue in detail and is therefore often considered as the discoverer of complex numbers.

François Viète (1540–1603) independently derived the trigonometric solution for the cubic with three real roots, and René Descartes (1596–1650) extended the work of Viète.[22]

Critical points of a cubic function

The roots, turning points, stationary points, inflection point and concavity of a cubic polynomial x³ - 3x² - 144x + 432 (black line) and its first and second derivatives (red and blue).

The critical points of a cubic equation are those values of x where the slope of the cubic function is zero. They are found by setting derivative of the cubic equation equal to zero obtaining: f ′(x) = 3ax2 + 2bx + c = 0. The solutions of that equation are the critical points of the cubic equation and are given by: (using the quadratic formula)

x=\frac{-b \pm \sqrt {b^2-3ac}}{3a}.

If b2 − 3ac > 0, then the cubic function has a local maximum and a local minimum. If b2 − 3ac = 0, then the cubic's inflection point is the only critical point. If b2 − 3ac < 0, then there are no critical points. In the cases where b2 − 3ac ≤ 0, the cubic function is strictly monotonic.

Roots of a cubic function

The general cubic equation has the form

ax^3+bx^2+cx+d=0 \qquad(1)

with a\neq 0\,.

This section describes how the roots of such an equation may be computed. The coefficients a, b, c, d are generally assumed to be real numbers, but most of the results apply when they belong to any field of characteristic not 2 or 3.

The nature of the roots

Every cubic equation (1) with real coefficients has at least one solution x among the real numbers; this is a consequence of the intermediate value theorem. We can distinguish several possible cases using the discriminant,

 \Delta = 18abcd -4b^3d + b^2c^2 - 4ac^3 - 27a^2d^2. \,

The following cases need to be considered: [23]

  • If Δ > 0, then the equation has three distinct real roots.
  • If Δ = 0, then the equation has a multiple root and all its roots are real.
  • If Δ < 0, then the equation has one real root and two nonreal complex conjugate roots.

For information about the location in the complex plane of the roots of a polynomial of any degree, including degree three, see Properties of polynomial roots and Routh–Hurwitz stability criterion

General formula for roots

For the general cubic equation

a x^3 + b x^2 + c x + d = 0

the general formula for the roots, in terms of the coefficients, is as follows:[24]

x_k = - \frac{1}{3a}\left(b\ +\ u_k C\ +\ \frac{\Delta_0}{u_kC}\right)\ , \qquad k \in \{1,2, 3\}

where

u_1 = 1\ ,\qquad u_2 = {-1 + i\sqrt{3} \over 2}\ ,\qquad u_3 = {-1 - i\sqrt{3} \over 2}

are the three cube roots of unity, and where

C = \sqrt[3]{\frac{\Delta_1 + \sqrt{\Delta_1^2 - 4 \Delta_0^3}}{2}} \qquad \qquad {\color{white}.} (see below for special cases)

with

\begin{align}
  \Delta_0 &= b^2-3 a c \\
  \Delta_1 &= 2 b^3-9 a b c+27 a^2 d
\end{align}

and

\Delta_1^2 - 4 \Delta_0^3 = -27\,a^2\,\Delta\ , where \Delta is the discriminant discussed above.

In these formulae, \sqrt{~~} and \sqrt[3]{~~} denote any choice for the square or cube roots. Changing of choice for the square root amounts to exchanging x_2 and x_3. Changing of choice for the cube root amounts to circularly permuting the roots. Thus the freeness of choosing a determination of the square or cube roots corresponds exactly to the freeness for numbering the roots of the equation.

Four centuries ago, Gerolamo Cardano proposed a similar formula (see below), which still appears in many textbooks:

x_k = - \frac{1}{3a}\left(b\ +\ u_k C\ +\ \bar u_k \bar C\right)

where

\bar C = \sqrt[3]{\frac{\Delta_1 - \sqrt{\Delta_1^2 - 4 \Delta_0^3}}{2}}

and \bar u_k is the complex conjugate of u_k (note that C\bar C=\Delta_0).

However, this formula is applicable without further explanation only when a, b, c, d are real numbers and the operand of the square root, i.e., \Delta_1^2 - 4 \Delta_0^3, is non-negative. When this operand is real and non-negative, the square root refers to the principal (positive) square root and the cube roots in the formula are to be interpreted as the real ones. Otherwise, there is no real square root and one can arbitrarily choose one of the imaginary square roots (the same one everywhere in the solution). For extracting the complex cube roots of the resulting complex expression, we have also to choose among three cube roots in each part of each solution, giving nine possible combinations of one of three cube roots for the first part of the expression and one of three for the second. The correct combination is such that the two cube roots chosen for the two terms in a given solution expression are complex conjugates of each other (whereby the two imaginary terms in each solution cancel out).

The next sections describe how these formulas may be obtained.

Special cases

If \Delta \neq 0 and \Delta_0 = 0, the sign of \sqrt{\Delta_1^2 - 4 \Delta_0^3}=\sqrt{\Delta_1^2} has to be chosen to have C \neq 0, that is one should define \sqrt{\Delta_1^2} =\Delta_1, whichever is the sign of \Delta_1.

If \Delta = 0 and \ \Delta_0 = 0, the three roots are equal:

x_1=x_2=x_3=-\frac{b}{3a}.

If \Delta=0 and  \Delta_0 \neq 0, the above expression for the roots is correct but misleading, hiding the fact that no radical is needed to represent the roots. In fact, in this case, there is a double root,

 x_1=x_2=\frac{9ad-bc}{2\Delta_0},

and a simple root

 x_3=\frac{4abc-9a^2d-b^3}{a\Delta_0}.

Reduction to a depressed cubic

Dividing Equation (1) by a and substituting  t-\frac{b}{3a} for x (the Tschirnhaus transformation) we get the equation

t^3+pt+q=0 \qquad(2)

where

\begin{align}
p=&\frac{3ac-b^2}{3a^2} \\
q=&\frac{2b^3-9abc+27a^2d}{27a^3}.
\end{align}

The left hand side of equation (2) is a monic trinomial called a depressed cubic.

Any formula for the roots of a depressed cubic may be transformed into a formula for the roots of Equation (1) by substituting the above values for p and q and using the relation  x=t-\frac{b}{3a}.

Therefore, only Equation (2) is considered in the following.

Cardano's method

The solutions can be found with the following method due to Scipione del Ferro and Tartaglia, published by Gerolamo Cardano in 1545.[25]

This method applies to the depressed cubic

 t^3 + pt + q = 0\,. \qquad (2)

We introduce two variables u and v linked by the condition

u+v=t\,

and substitute this in the depressed cubic (2), giving

 u^3+v^3+(3uv+p)(u+v)+q=0 \qquad (3)\,.

At this point Cardano imposed a second condition for the variables u and v:

 3uv+p=0\,.

As the first parenthesis vanishes in (3), we get  u^3+v^3=-q and  u^3v^3=-p^3/27. The combination of these two equations leads to a quadratic equation (since they are the sum and the product of  u^3 and  v^3 ). Thus  u^3 and  v^3 are the two roots of the following quadratic equation:

 z^2 + qz - {p^3\over 27} = 0\,.

At this point, Cardano, who did not know complex numbers, supposed that the roots of this equation were real, that is that  \frac{q^2}{4}+\frac{p^3}{27} >0\,.

Solving this equation and using the fact that  u and  v may be exchanged, we find

 u^{3}=-{q\over 2} - \sqrt{{q^{2}\over 4}+{p^{3}\over 27}} and v^{3}=-{q\over 2} + \sqrt{{q^{2}\over 4}+{p^{3}\over 27}}.

As these expressions are real, their cube roots are well-defined and, like Cardano, we get

 t_1=u+v=\sqrt[3]{-{q\over 2}- \sqrt{{q^{2}\over 4}+{p^{3}\over 27}}} +\sqrt[3]{-{q\over 2} +\sqrt{{q^{2}\over 4}+{p^{3}\over 27}}.}

Given the assumption that  \frac{q^2}{4}+\frac{p^3}{27} >0\, , Equation (2) also has two complex roots. These are obtained by considering the complex cube roots appearing in the above formula; the fact  uv is real implies that one is obtained by multiplying the first of the above cube roots by \,\tfrac{-1}{2} + i\tfrac{\sqrt{3}}{2}\, and the second by \,\tfrac{-1}{2} - i\tfrac{\sqrt{3}}{2}\,, and vice versa for the other one.

If  \frac{q^2}{4}+\frac{p^3}{27}\, is not necessarily positive, we have to choose a cube root of u^3. As there is no direct way to choose the corresponding cube root of v^3, one has to use the relation v=-\frac{p}{3u}, which gives

 u=\sqrt[3]{-{q\over 2}- \sqrt{{q^{2}\over 4}+{p^{3}\over 27}}} \qquad (4)

and

t=u-\frac{p}{3u}\,.

Note that the sign of the square root does not affect the resulting t, because changing it amounts to exchanging u and v. We have chosen the minus sign to have u\ne 0 when p = 0 and q\ne 0, in order to avoid a division by zero. With this choice, the above expression for t always works, except when p = q=0, where the second term becomes 0/0. In this case there is a triple root  t=0.

Note also that in several cases the solutions are expressed with fewer square or cube roots

If p=q=0 then we have the triple real root
t=0.\,
If p=0 and q\ne 0 then
u=-\sqrt[3]{q} \text{ and } v = 0
and the three roots are the three cube roots of -q.
If p\ne 0 and q=0 then
u=\sqrt{{p\over 3}} \qquad \text{and} \qquad v=-\sqrt{{p\over 3}},
in which case the three roots are
t=u+v=0 , \qquad t=\omega_1u-{p\over 3\omega_1u}=\sqrt{-p} , \qquad t={u\over \omega_1}-{\omega_1p\over 3u}=-\sqrt{-p} ,
where
\omega_1=e^{i\frac{2\pi}{3}}=-\tfrac{1}{2} + \tfrac{\sqrt{3}}{2}i.
Finally if 4p^3+27q^2=0 \text{ and } p\ne 0, there are a double root and an isolated root which may be expressed rationally in terms of p \text{ and } q , but these expressions may not be immediately deduced from the general expression of the roots:
 t_1=t_2= -\frac{3q}{2p}\quad \text{and} \quad t_3=\frac{3q}{p}\,.

To pass from these roots of t in Equation (2) to the general formulas for roots of x in Equation (1), subtract \frac{b}{3a} and replace  p and  q by their expressions in terms of a,b,c,d.

Vieta's substitution

Starting from the depressed cubic

t^3 + pt + q = 0,

we make the following substitution, known as Vieta's substitution:

t = w - \frac{p}{3w}

This results in the equation

w^3 + q - \frac{p^3}{27w^3} = 0.

Multiplying by w3, it becomes a sextic equation in w, which is in fact a quadratic equation in w3:

w^6 + qw^3 - \frac{p^3}{27} = 0

The quadratic formula allows this to be solved for w3. If w1, w2 and w3 are the three cube roots of one of the solutions in w3, then the roots of the original depressed cubic are

t_1 = w_1 - \frac{p}{3w_1}, \quad t_2 = w_2 - \frac{p}{3w_2}\quad\text{and} \quad t_3 = w_3 - \frac{p}{3w_3}.

Lagrange's method

In his paper Réflexions sur la résolution algébrique des équations ("Thoughts on the algebraic solving of equations"), Joseph Louis Lagrange introduced a new method to solve equations of low degree.

This method works well for cubic and quartic equations, but Lagrange did not succeed in applying it to a quintic equation, because it requires solving a resolvent polynomial of degree at least six.[26][27][28] This is explained by the Abel–Ruffini theorem, which proves that such polynomials cannot be solved by radicals. Nevertheless, the modern methods for solving solvable quintic equations are mainly based on Lagrange's method.[28]

In the case of cubic equations, Lagrange's method gives the same solution as Cardano's. By drawing attention to a geometrical problem that involves two cubes of different size Cardano explains in his book Ars Magna how he arrived at the idea of considering the unknown of the cubic equation as a sum of two other quantities. Lagrange's method may also be applied directly to the general cubic equation (1) without using the reduction to the depressed cubic equation (2). Nevertheless, the computation is much easier with this reduced equation.

Suppose that x0, x1 and x2 are the roots of equation (1) or (2), and define \zeta = -\tfrac{1}{2} + \tfrac{\sqrt{3}}{2}i (a complex cube root of 1, i.e. a primitive third root of unity) which satisfies the relation \zeta^2+\zeta+1=0. We now set

s_0 = x_0 + x_1 + x_2,\,
s_1 = x_0 + \zeta x_1 + \zeta^2 x_2,\,
s_2 = x_0 + \zeta^2 x_1 + \zeta x_2.\,

This is the discrete Fourier transform of the roots: observe that while the coefficients of the polynomial are symmetric in the roots, in this formula an order has been chosen on the roots, so these are not symmetric in the roots. The roots may then be recovered from the three si by inverting the above linear transformation via the inverse discrete Fourier transform, giving

x_0 = \tfrac13(s_0 + s_1 + s_2),\,
x_1 = \tfrac13(s_0 + \zeta^2 s_1 + \zeta s_2),\,
x_2 = \tfrac13(s_0 + \zeta s_1 + \zeta^2 s_2).\,

The polynomial s_0 is an elementary symmetric polynomial and is thus equal to -b/a in case of Equation (1) and to zero in case of Equation (2), so we only need to seek values for the other two.

The polynomials s_1 and s_2 are not symmetric functions of the roots: s_0 is invariant, while the two non-trivial cyclic permutations of the roots send s_1 to \zeta s_1 and s_2 to \zeta^2 s_2, or s_1 to \zeta^2 s_1 and s_2 to \zeta s_2 (depending on which permutation), while transposing x_1 and x_2 switches s_1 and s_2; other transpositions switch these roots and multiply them by a power of \zeta.

Thus, s_1^3, s_2^3 and s_1 s_2 are left invariant by the cyclic permutations of the roots, which multiply them by \zeta^3=1. Also s_1 s_2 and s_1^3+s_2^3 are left invariant by the transposition of x_1 and x_2 which exchanges s_1 and s_2. As the permutation group S_3 of the roots is generated by these permutations, it follows that s_1^3+s_2^3 and s_1 s_2 are symmetric functions of the roots and may thus be written as polynomials in the elementary symmetric polynomials and thus as rational functions of the coefficients of the equation. Let s_1^3+s_2^3=A and s_1 s_2=B in these expressions, which will be explicitly computed below.

We have that s_1^3 and s_2^3 are the two roots of the quadratic equation

z^2-Az+B^3 = 0 \,.

Thus the resolution of the equation may be finished exactly as described for Cardano's method, with s_1 and s_2 in place of u and v.

Computation of A and B

Setting E_1=x_0+x_1+x_2, E_2=x_0x_1+x_1x_2+x_2x_0 and E_3=x_0x_1x_2, the elementary symmetric polynomials, we have, using that \zeta^3=1:

s_1^3=x_0^3+x_1^3+x_2^3+3\zeta (x_0^2x_1+x_1^2x_2+x_2^2x_0) +3\zeta^2 (x_0x_1^2+x_1x_2^2+x_2x_0^2) +6x_0x_1x_2\,.

The expression for s_2^3 is the same with \zeta and \zeta^2 exchanged. Thus, using \zeta^2+\zeta=-1 we get


A=s_1^3+s_2^3=2(x_0^3+x_1^3+x_2^3)-3(x_0^2x_1+x_1^2x_2+x_2^2x_0+x_0x_1^2+x_1x_2^2+x_2x_0^2)+12x_0x_1x_2\,,

and a straightforward computation gives


A=s_1^3+s_2^3=2E_1^3-9E_1E_2+27E_3\,.

Similarly we have


B=s_1s_2=x_0^2+x_1^2+x_2^2+(\zeta+\zeta^2)(x_0x_1+x_1x_2+x_2x_0)=E_1^2-3E_2\,.

When solving Equation (1) we have

E_1=-b/a, E_2=c/a and E_3=-d/a

With Equation (2), we have E_1=0, E_2=p and E_3=-q and thus:

A=-27q and B=-3p.

Note that with Equation (2), we have x_0 = \tfrac13(s_1 + s_2) and s_1s_2=-3p, while in Cardano's method we have set x_0 = u+v and uv=-\frac13p\,. Thus we have, up to the exchange of u and v:

s_1=3u and  s_2=3v.

In other words, in this case, Cardano's and Lagrange's method compute exactly the same things, up to a factor of three in the auxiliary variables, the main difference being that Lagrange's method explains why these auxiliary variables appear in the problem.

Trigonometric (and hyperbolic) method

Trigonometric method for three real roots

When a cubic equation has three real roots, the formulas expressing these roots in terms of radicals involve complex numbers. It has been proved that when none of the three real roots is rational—the casus irreducibilis— one cannot express the roots in terms of real radicals. Nevertheless, purely real expressions of the solutions may be obtained using hypergeometric functions,[29] or more elementarily in terms of trigonometric functions, specifically in terms of the cosine and arccosine functions.

The formulas which follow, due to François Viète,[22] are true in general (except when p = 0), are purely real when the equation has three real roots, but involve complex cosines and arccosines when there is only one real root.

Starting from Equation (2), t^3+pt+q=0, let us set t=u\cos\theta\,. The idea is to choose u to make Equation (2) coincide with the identity

4\cos^3\theta-3\cos\theta-\cos(3\theta)=0\,.

In fact, choosing u=2\sqrt{-\frac{p}{3}} and dividing Equation (2) by \frac{u^3}{4} we get

4\cos^3\theta-3\cos\theta-\frac{3q}{2p}\sqrt{\frac{-3}{p}}=0\,.

Combining with the above identity, we get

\cos(3\theta)=\frac{3q}{2p}\sqrt{\frac{-3}{p}}

and thus the roots are[30]

t_k=2\sqrt{-\frac{p}{3}}\cos\left(\frac{1}{3}\arccos\left(\frac{3q}{2p}\sqrt{\frac{-3}{p}}\right)-\frac{2\pi k}{3}\right) \quad \text{for} \quad k=0,1,2 \,.

This formula involves only real terms if p<0 and the argument of the arccosine is between −1 and 1. The last condition is equivalent to  4p^3+27q^2\leq 0\,, which implies also p<0. Thus the above formula for the roots involves only real terms if and only if the three roots are real.

Denoting by C(p,q) the above value of t0, and using the inequalities 0\le \arccos(u) \le \pi for a real number u such that -1\le u\le 1\,, the three roots may also be expressed as

t_0=C(p,q),\qquad t_2=-C(p,-q), \qquad t_1=-t_0-t_2\,.

If the three roots are real, we have

t_0\ge t_1\ge t_2\,.

All these formulas may be straightforwardly transformed into formulas for the roots of the general cubic equation (1), using the back substitution described in Section Reduction to a depressed cubic.

Hyperbolic method for one real root

When there is only one real root (and p ≠ 0), it may be similarly represented using hyperbolic functions, as[31][32]

t_0=-2\frac{|q|}{q}\sqrt{-\frac{p}{3}}\cosh\left(\frac{1}{3}\operatorname{arcosh}\left(\frac{-3|q|}{2p}\sqrt{\frac{-3}{p}}\right)\right) \quad \text{if } \quad 4p^3+27q^2>0 \text{ and } p<0\,,
t_0=-2\sqrt{\frac{p}{3}}\sinh\left(\frac{1}{3}\operatorname{arsinh}\left(\frac{3q}{2p}\sqrt{\frac{3}{p}}\right)\right) \quad \text{if } \quad p>0\,.

If p ≠ 0 and the inequalities on the right are not satisfied (the case of three real roots), the formulas remain valid but involve complex quantities.

When  p=\pm 3, the above values of t_0 are sometimes called the Chebyshev cube root.[33] More precisely, the values involving cosines and hyperbolic cosines define, when  p=-3, the same analytic function denoted C_{\frac13}(q), which is the proper Chebyshev cube root. The value involving hyperbolic sines is similarly denoted S_{\frac13}(q), when  p=3.

Factorization

If the cubic equation ax^3 + bx^2 + cx +d=0 with integer coefficients has a rational real root, it can be found using the rational root test: If the root is r = m / n fully reduced, then m is a factor of d and n is a factor of a, so all possible combinations of values for m and n can be checked for whether they satisfy the cubic equation.

The rational root test may also be used for a cubic equation with rational coefficients: by multiplication by the lowest common denominator of the coefficients, one gets an equation with integer coefficients which has exactly the same roots.

The rational root test is particularly useful when there are three real roots because the algebraic solution unhelpfully expresses the real roots in terms of complex entities; if the test yields a rational root, it can be factored out and the remaining roots can be found by solving a quadratic. The rational root test is also helpful in the presence of one real and two complex roots because again, if it yields a rational root, it allows all of the roots to be written without the use of cube roots: If r is any root of the cubic, then we may factor out (xr ) using polynomial long division to obtain

ax^3+bx^2+cx+d=\left (x-r\right )\left (ax^2+(b+ar)x+c+br+ar^2 \right ).

Hence if we know one root, perhaps from the rational root test, we can find the other two by using the quadratic formula to solve the quadratic ax^2+(b+ar)x+c+br+ar^2 , giving

 \frac{-b-ra \pm \sqrt{b^2-4ac-2abr-3a^2r^2}}{2a}

for the other two roots.

A numerical solution

A possible way to numerically find the real roots of a cubic polynomial can proceed in 6 steps:[citation needed]

  1. normalization, i.e., division of the polynomial coefficients by the cubic coefficient, which leads to {\displaystyle f(x) \equiv x^3 + A_2 x^2 + A_1 x + A_0 = 0}
  2. determination of the inflection point, {\displaystyle x_{\rm infl}=-A_2/3}
  3. determination of an initial value for the subsequent iteration depending on the value of {\displaystyle D = A_2^2 - 3 A_1} If D > 0 , and the roots are all real, the bounds of the roots are established with the Laguerre–Nair–Samuelson inequality: {\displaystyle x_{\rm low,high} = x_{\rm infl} \pm \frac{2}{3} \sqrt{D}}The initial value can be chosen as {\displaystyle x = \begin{cases} x_{\rm low} & D > 0, f(x_{\rm infl}) > 0 \\
x_{\rm infl} & D < 0 \\
x_{\rm high} & D > 0, f(x_{\rm infl}) < 0 \end{cases}}If f(x_{\rm infl}) = 0 , x_{\rm infl} is already a root of the polynomial. If D = 0 , the (only) root lies at x = x_{\rm infl} - \sqrt[3]{f(x_{\rm infl})} . In these two cases, no iteration is needed.
  4. iteration of the first root, x_1, by Halley's method
  5. deflation, i.e., division of the polynomial by (x - x_1) , which leads to{\displaystyle g(x) \equiv x^2 + B_1 x + B_0 = 0 \quad \text{with}\quad B_1 = x_1 + A_2,\; B_0 = B_1 x_1 + A_1}
  6. solving the quadratic polynomial with the quadratic formula

If the roots differ by several orders of magnitude, round-off errors may affect the precision of the results (but this is true for cubic solvers based on Cardano's rule, too); it is possible, however, to minimize their effect by careful encoding. A C++ subroutine has been published [34] (see the electronic supplement). If there is only 1 real root, the algorithm outlined above was found to be 1.3–1.7 times faster than a Cardano solver (Cardano's method + 1 post-iteration Newton step); if there are 3 real roots, the speedup is in the range 8–10.

Geometric interpretation of the roots

Three real roots

For the cubic  x^3+bx^2+cx+d=0 with three real roots, the roots are the projection on the x-axis of the vertices A, B, and C of an equilateral triangle. The center of the triangle has the same abscissa as the inflection point.

Viète's trigonometric expression of the roots in the three-real-roots case lends itself to a geometric interpretation in terms of a circle.[22][35] When the cubic is written in depressed form t^3+pt+q=0, as shown above, the solution can be expressed as

t_k=2\sqrt{-\frac{p}{3}}\cos\left(\frac{1}{3}\arccos\left(\frac{3q}{2p}\sqrt{\frac{-3}{p}}\right)-k\frac{2\pi}{3}\right) \quad \text{for} \quad k=0,1,2 \,.

Here \arccos\left(\frac{3q}{2p}\sqrt{\frac{-3}{p}}\right) is an angle in the unit circle; taking \tfrac{1}{3} of that angle corresponds to taking a cube root of a complex number; adding -k\frac{2\pi}{3} for k = 1, 2 finds the other cube roots; and multiplying the cosines of these resulting angles by 2\sqrt{-\frac{p}{3}} corrects for scale.

For the non-depressed case x^3+bx^2+cx+d=0 (shown in the accompanying graph), the depressed case as indicated previously is obtained by defining t such that x=t-\tfrac{b}{3} so t=x+\tfrac{b}{3}. Graphically this corresponds to simply shifting the graph horizontally when changing between the variables t and x, without changing the angle relationships. This shift moves the point of inflection and the centre of the circle onto the y-axis. Consequently, the roots of the equation in t sum to zero.

One real and two complex roots

In the Cartesian plane
The slope of line RA is twice that of RH. Denoting the complex roots of the cubic as g±hi, g = \scriptstyle\overline{OM} (negative here) and h = \scriptstyle\sqrt{\tan ORH} = \scriptstyle\sqrt{\text{slope of line RH}} = \scriptstyle\overline{BE} = \scriptstyle\overline{DA}.

If a cubic is plotted in the Cartesian plane, the real root can be seen graphically as the horizontal intercept of the curve. But further,[36][37][38] if the complex conjugate roots are written as g\pm hi, then g is the abscissa (the positive or negative horizontal distance from the origin) of the tangency point of a line that is tangent to the cubic curve and intersects the horizontal axis at the same place as does the cubic curve; and |h| is the square root of the tangent of the angle between this line and the horizontal axis.

In the complex plane

With one real and two complex roots, the three roots can be represented as points in the complex plane, as can the two roots of the cubic's derivative. There is an interesting geometrical relationship among all these roots.

The points in the complex plane representing the three roots serve as the vertices of an isosceles triangle. (The triangle is isosceles because one root is on the horizontal (real) axis and the other two roots, being complex conjugates, appear symmetrically above and below the real axis.) Marden's Theorem says that the points representing the roots of the derivative of the cubic are the foci of the Steiner inellipse of the triangle—the unique ellipse that is tangent to the triangle at the midpoints of its sides. If the angle at the vertex on the real axis is less than \tfrac{\pi}{3} then the major axis of the ellipse lies on the real axis, as do its foci and hence the roots of the derivative. If that angle is greater than \tfrac{\pi}{3}, the major axis is vertical and its foci, the roots of the derivative, are complex conjugates. And if that angle is \tfrac{\pi}{3}, the triangle is equilateral, the Steiner inellipse is simply the triangle's incircle, its foci coincide with each other at the incenter, which lies on the real axis, and hence the derivative has duplicate real roots.

Omar Khayyám's solution

Omar Khayyám's geometric solution of a cubic equation, for the case a=2, b=16, giving the root 2. The fact that the vertical line intersects the x-axis at the center of the circle is specific to this particular example

As shown in this graph, to solve the third-degree equation x^3 + a^2x = b where b>0, Omar Khayyám constructed the parabola y=x^2/a, the circle which has as a diameter the line segment [0, b/a^2] of the positive x-axis, and a vertical line through the point above the x-axis, where the circle and parabola intersect. The solution is given by the length of the horizontal line segment from the origin to the intersection of the vertical line and the x-axis.

A simple modern proof of the method is the following: multiplying by x the equation, and regrouping the terms gives

\frac{x^4}{a^2}= x\,(\frac{b}{a^2}-x)\,.

The left-hand side is the value of y2 on the parabola. The equation of the circle being y^2+x\,\left(x-\frac{b}{a^2}\right)=0, the right hand side is the value of y2 on the circle.

Collinearities

The tangent lines to a cubic at three collinear points intercept the cubic again at collinear points.[39]:p. 425,#290

Symmetry

A cubic polynomial has 180° rotational or point symmetry about its inflection point.[40] The inflection point of a general cubic polynomial y = f (x ) = ax3 + bx2 + cx + d occurs at <templatestyles src="Sfrac/styles.css" />d2y/dx2 = 6ax + 2b = 0 i.e. xi = − <templatestyles src="Sfrac/styles.css" />b/3a and thus yi = <templatestyles src="Sfrac/styles.css" />2b3/27a2 − <templatestyles src="Sfrac/styles.css" />bc/3a + d. Translating the function so that the inflection point is at the origin, fT(x ) = f (x + xi) - yi = a (x − <templatestyles src="Sfrac/styles.css" />b/3a )3 + b (x − <templatestyles src="Sfrac/styles.css" />b/3a )2 + c (x − <templatestyles src="Sfrac/styles.css" />b/3a ) + d - (<templatestyles src="Sfrac/styles.css" />2b3/27a2 − <templatestyles src="Sfrac/styles.css" />bc/3a + d ) = ax3 + (c − <templatestyles src="Sfrac/styles.css" />b2/3a )x. As all terms are odd powers of x, fT(−x ) = −fT(x ), proving that all cubic functions are rotationally symmetrical about their inflection points.[41]

Applications

Cubic equations arise in various other contexts.

Marden's theorem states that the foci of the Steiner inellipse of any triangle can be found by using the cubic function whose roots are the coordinates in the complex plane of the triangle's three vertices. The roots of the first derivative of this cubic are the complex coordinates of those foci.

Given the cosine (or other trigonometric function) of an arbitrary angle, the cosine of one-third of that angle is one of the roots of a cubic.

The solution of the general quartic equation relies on the solution of its resolvent cubic.

In analytical chemistry, the Charlot equation, which can be used to find the pH of buffer solutions, can be solved using a cubic equation.

The plastic number, 1.3247... , is the sole real solution of the cubic equation x^3=x+1\, .

The eigenvalues of a 3×3 matrix are the roots of a cubic polynomial which is the characteristic polynomial of the matrix.

The characteristic equation of a third-order linear difference equation or differential equation is a cubic equation.

In chemical engineering and thermodynamics, cubic equations of state are used to model the PVT (pressure, volume, temperature) behavior of substances.

Kinematic equations involving changing rates of acceleration are cubic.

See also

Notes

  1. Exceptions include fields of characteristic 2 and 3.
  2. British Museum BM 85200
  3. 3.0 3.1 Lua error in package.lua at line 80: module 'strict' not found.
  4. 4.0 4.1 Van der Waerden, Geometry and Algebra of Ancient Civilizations, chapter 4, Zurich 1983 ISBN 0-387-12159-5
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. Lua error in package.lua at line 80: module 'strict' not found.
  7. Lua error in package.lua at line 80: module 'strict' not found.
  8. Guilbeau (1930, p. 8) states that "the Egyptians considered the solution impossible, but the Greeks came nearer to a solution."
  9. 9.0 9.1 Guilbeau (1930, pp. 8–9)
  10. Lua error in package.lua at line 80: module 'strict' not found.
  11. Lua error in package.lua at line 80: module 'strict' not found.
  12. Lua error in package.lua at line 80: module 'strict' not found.
  13. A paper of Omar Khayyam, Scripta Math. 26 (1963), pages 323–337
  14. In Lua error in package.lua at line 80: module 'strict' not found.. one may read This problem in turn led Khayyam to solve the cubic equation x3 + 200x = 20x2 + 2000 and he found a positive root of this cubic by considering the intersection of a rectangular hyperbola and a circle. An approximate numerical solution was then found by interpolation in trigonometric tables. The then in the last assertion is erroneous and should, at least, be replaced by also. The geometric construction was perfectly suitable for Omar Khayyam, as it occurs for solving a problem of geometric construction. At the end of his article he says only that, for this geometrical problem, if approximations are sufficient, then a simpler solution may be obtained by consulting trigonometric tables. Textually: If the seeker is satisfied with an estimate, it is up to him to look into the table of chords of Almagest, or the table of sines and versed sines of Mothmed Observatory. This is followed by a short description of this alternate method (seven lines).
  15. J. J. O'Connor and E. F. Robertson (1999), Omar Khayyam, MacTutor History of Mathematics archive, states, "Khayyam himself seems to have been the first to conceive a general theory of cubic equations."
  16. Guilbeau (1930, p. 9) states, "Omar Al Hay of Chorassan, about 1079 AD did most to elevate to a method the solution of the algebraic equations by intersecting conics."
  17. Datta and Singh, History of Hindu Mathematics, p. 76,Equation of Higher Degree; Bharattya Kala Prakashan, Delhi, India 2004 ISBN 81-86050-86-8
  18. Lua error in package.lua at line 80: module 'strict' not found..
  19. Lua error in package.lua at line 80: module 'strict' not found.
  20. Lua error in package.lua at line 80: module 'strict' not found.
  21. Lua error in package.lua at line 80: module 'strict' not found.
  22. 22.0 22.1 22.2 Lua error in package.lua at line 80: module 'strict' not found.
  23. Lua error in package.lua at line 80: module 'strict' not found., Chapter 10 ex 10.14.4 and 10.17.4, pp. 154–156
  24. Lua error in package.lua at line 80: module 'strict' not found., Extract of page 179
  25. Jacobson 2009, p. 210
  26. Lua error in package.lua at line 80: module 'strict' not found., §6.2, p. 134
  27. Lua error in package.lua at line 80: module 'strict' not found., Algebra in the Eighteenth Century: The Theory of Equations
  28. 28.0 28.1 Daniel Lazard, "Solving quintics in radicals", in Olav Arnfinn Laudal, Ragni Piene, The Legacy of Niels Henrik Abel, pp. 207–225, Berlin, 2004,. ISBN 3-540-43826-2
  29. Zucker, I. J., "The cubic equation — a new look at the irreducible case", Mathematical Gazette 92, July 2008, 264–268.
  30. Lua error in package.lua at line 80: module 'strict' not found.
  31. These are Formulas (80) and (83) of Weisstein, Eric W. 'Cubic Formula'. From MathWorld—A Wolfram Web Resource. http://mathworld.wolfram.com/CubicFormula.html, rewritten for having a coherent notation.
  32. Holmes, G. C., "The use of hyperbolic cosines in solving cubic polynomials", Mathematical Gazette 86. November 2002, 473–477.
  33. Abramowitz, Milton; Stegun, Irene A., eds. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables, Dover (1965), chap. 22 p. 773
  34. Lua error in package.lua at line 80: module 'strict' not found.
  35. Lua error in package.lua at line 80: module 'strict' not found. See esp. Fig. 2.
  36. Lua error in package.lua at line 80: module 'strict' not found.
  37. Lua error in package.lua at line 80: module 'strict' not found.
  38. Lua error in package.lua at line 80: module 'strict' not found.
  39. Whitworth, William Allen. Trilinear Coordinates and Other Methods of Modern Analytical Geometry of Two Dimensions, Forgotten Books, 2012 (orig. Deighton, Bell, and Co., 1866). http://www.forgottenbooks.com/search?q=Trilinear+coordinates&t=books
  40. Lua error in package.lua at line 80: module 'strict' not found.
  41. Lua error in package.lua at line 80: module 'strict' not found.

References

  • Lua error in package.lua at line 80: module 'strict' not found. Ch. 24.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.

External links