I think you're being somewhat misled by your terminology and by the ambiguity of the standard, compact notation. Note that some texts use only the symbol "$\mathrm{d}/\mathrm{d}\tau$", or only "$\mathrm{D}_\tau$" (or similar symbols), for both operations you mention. Because its meaning is determined by what it is applied to. A general warning here is not to take mathematical notation at face value: as usual it is there to remind us, or give us a summary or signpost, of what we are doing; but its syntax cannot be taken too strictly (as it can in some formal-logic contexts instead). For example, "$\frac{\mathrm{d}x^\mu}{\mathrm{d}\tau}$" is already inaccurate: if $x^\mu$ is a coordinate, it is a function from spacetime to the real numbers, $x^\mu \colon M \to \mathbf{R}$, and therefore any derivative with respect to some parameter $\tau$ makes no sense at all.
Geometrically we are taking the following steps:
We consider a curve into spacetime, that is, a map from the reals (or an interval thereof) into spacetime, $C \colon \mathbf{R}\to M$.
We consider the field of tangent vectors $\pmb{u}$ to the curve; this is what is denoted by "$\frac{\mathrm{d}x^\mu}{\mathrm{d}\tau}$". These vectors embody the effect of deriving a function defined over spacetime, $f \colon M \to \mathbf{R}$, along the curve, which we can write as $\pmb{u}(f)$. This is an operation that we can do because the composite function $f\circ C \colon \mathbf{R} \to \mathbf{R}$ is from the reals to the reals. I personally prefer to avoid the inconsistent notation "$\mathrm{d}x^\mu/\mathrm{d}\tau$", and denote the function mapping the parameter $\tau$ to the coordinate chart by, say, $C^\mu(\tau)$, which is just $x^\mu[c(\tau)]$. It now makes sense to take the derivative of $C^\mu$, and we can denote it with a dot: $\dot{C}{}^\mu := \mathrm{d}C^\mu/\mathrm{d}\tau$. These are the components of the tangent vector $\pmb{u}$, which we can also write in invariant notation: $\pmb{u} = \dot{C}{}^\mu\ \partial_\mu$. The action of this vector on a function is then $\pmb{u}(f) = \dot{C}{}^\mu\ \partial_\mu f$.
We cannot derive any other tensor field along the curve, though, because we would not know how to "move" the tensor from one point on the curve to another nearby, to take their difference, which is necessary if we are considering a derivative. This is also true of the vector field $\pmb{u}$ defined above. In flat space this is done by moving the tensor keeping it parallel to itself, but in a generic manifold there is no notion of parallelism. To move the tensor we must therefore introduce a notion of "nearby parallelism", which is not unique. This is embodied in the choice of a connection. The directional covariant derivative $\nabla_{\pmb{u}}$ expresses the result of infinitesimally moving a tensor along the direction of the vector $\pmb{u}$, keeping the tensor parallel to itself according to the connection chosen, and then taking the difference with the tensor at the end place. We can in particular apply this to $\pmb{u}$ itself: $\nabla_{\pmb{u}}\pmb{u}$.
This shows that we do not use $\nabla_{\pmb{u}}$ because "we need to account for the effect of curvature". Rather, we need to introduce curvature – or properly speaking a connection – if we want to be able to take a derivative of $\pmb{u}$ or of other tensors along the curve.
But I think no answer here can really be a substitute for a good book on differential geometry, especially one applied to relativity. I recommend:
This post imported from StackExchange Physics at 2025-01-21 21:54 (UTC), posted by SE-user pglpm