We discuss how to take a derivative of an implicitly defined function.
Video Lecture
There is a supplemental video on understanding when and why implicit differentiation can be used! You can find the video here:
(Supplemental Videos are included via external link so you don’t have to watch them to earn credit.)
Text and Additional Details
Classically we are used to functions being defined in a way that makes the output obvious. For example; has a clear input variable, , and a clear output variable , and most importantly it is very easy to get the value of when you plug in . In this case we would say the function is “explicitly defined” in terms of .
But not all functions are defined like this. Indeed, as you may have seen in precalculus, you can have implicitly defined functions, which is when the dependent variable (aka output) is no longer isolated like it was in the previous example. For instance, consider the implicitly defined function . Here, is still the output variable, and is still the input variable, but after substituting a value for it takes more effort to determine what the value of would be.
As we move forward in calculus then, we want to know how to handle these kinds of functions as well.
It’s important to realize that implicitly defined functions are still essentially the same kind of object as explicitly defined functions in many ways; indeed the one (rather notable) difference is that our dependent variable isn’t isolated. This difference can take a lot of forms. One possibility is that our dependent variable is only on one side of the equation, but with operations being performed on it - for example . Another possibility is that the dependent and independent variable are intermixed, like in .
This is where our normal notation kind of hurts us though. We write the dependent variable as if it is just a variable... but in reality it is a function of an independent variable (this is what distinguishes the implicitly defined expression from the explicitly defined multivariable expression ... two different things that nonetheless look totally identical - very unfortunate). For this reason, it can be useful to write instead of just to make it clear that is really a function being applied to an input .
Why does this matter for us? Because when we see this should look very similar (and indeed is functionally the same as) ; and we know how to deal with taking a derivative of ... it’s just .
This seemingly simple observation is the heart of implicit differentiation. When we have implicitly defined functions, we can take the derivative as normal, but we need to keep track of where the derivative is being applied to the dependent variable. Let’s look at the example .
We are going to do this problem in two parallel columns to see what it looks like to do the “implicit differentiation” part, versus how we might think of doing it if we replaced “” with the “” style notation. We will use in the “Notes” section as a stand-in for both and notations (so if you see , you can sub in either or to see what we did in the corresponding column for that line).
-Version | -Version | Notes |
Apply Derivative | ||
Derivative of and chain rule. | ||
Apply Product Rule | ||
As we can see, the idea of implicit differentiation is nothing but the chain rule, applied to the fact that the dependent variable is really a disguised version of the more familiar function notation - so when we write something like we really mean .
It is important to note that it is expected of students in this segment to solve for the derivative term. For example, if we continue from where we left off with the version above, we want to isolate ...
From previous work. | |
Subtract from both sides. | |
Factor out from both terms on the left. | |
Divide both sides by . | |
So we can see we have isolated the derivative term (the ). Notice that it still has mixed into the derivative - this is pretty normal. Since we started with an implicitly defined function, it is natural that the derivative would also be implicitly defined.
Finally, you may wonder how to know if a term is implicitly defined or not; meaning, when do you include something like a or not? The good news is, the answer is entirely contained in the notation!
When we read something like , we usually think of this as “the derivative of ”, but this is missing an absolutely vital part of the notation. The derivative operator, the “” symbol part is indeed telling us we want to take a derivative... but it is also telling us what we are taking the derivative with respect to. In particular, the in is telling you that is the variable that gets treated as the independent variable, and any other variable will be treated as a dependent variable. Thus to compute this we would have the following:
Derivative Sum Rule | ||
Since the Derivative was with respect to , the derivative of is . | ||
Since the Derivative was with respect to , we first treat as if it were an “” to get , | ||
but then we have to apply the chain rule and multiply by the derivative of ; which is just . | ||
Simplified the expression. |
So, if the variable in the bottom of the (the “”) matches the variable you are taking a derivative of, you don’t have any chain rule “prime” terms; but if they don’t match, you should end up with the chain-rule giving you a “prime” version of the variable multiplying the expression. Here’s a quick few examples:
The last column may look strange with the since we are use to taking derivatives with respect to , but this just means that the derivative being taken is with respect to a variable other than and so the derivative assumes that is now the dependent variable and not an independent variable.
As we’ve seen, implicit differentiation is nothing more than using the chain rule and recognizing that we sometimes use a dependent variable (like ) in lieu of the functional notation . Using a variable like instead of can make a lot of sense, and if you are interested in where or why you might run into implicitly defined functions in the real world, there is an optional video covering it (linked at the top of the page after the lecture video). However, the real takeaway here is to not let the notation trip you up - you want to keep track of what the derivative is respect to (say, ) and what the function is in terms of (say, , and ). Wherever these two don’t match up, you need to include a leftover prime notation due to the chain rule. So if you are taking , since the derivative is in terms of and the term you are differentiating is ... which is not an , you have a leftover .