Actually the ∂μ is a "total derivative", otherwise it wouldn't be a total divergence and we would not be able to get rid of it during the derivation of Lagrange equations
∂μL≡∂L∂ϕ∂μϕ+∂L∂(∂νϕ)∂μ∂νϕ
This is a slightly confusing but in the end somewhat practical convention in classical field theory. For instance, you can see that this very same derivative is used in the Lagrange equations
∂μ(∂L∂(∂μϕ))−∂L∂ϕ=0
etc. This notation is possible only thanks to the fact that the "real partial derivative" of L is postulated to be always zero and that there cannot thus be any ambiguity in what is meant by ∂μL.
Some authors prefer to use notation such as d/dxμ or D/dxμ to underline the "totalness" but this always feels like notation abuse to me. My opinion is that if anything should change then it is the discussion of the "omitted pullback" L(ϕ,...)→L(ϕ(xμ),...), what does ∂/∂(∂μϕ) really mean in term of the pullback and so on.