How Math Academy Creates its Knowledge Graph

by Justin Skycak (@justinskycak) on November 13, 2024

We do it all manually, entirely by hand.

Want to get notified about new posts? Join the mailing list and follow on X/Twitter.

Been getting the following question a lot:

“How do you create your knowledge graph? How do you map out the topics at the appropriate level of granularity and set the connectivity (prerequisite & encompassing relationships)?”

The answer is actually pretty simple but it tends to frustrate the kinds of people who are always searching for some “secret trick” to avoid work that feels difficult and tedious.

The answer: We do it all manually, entirely by hand.

The secret sauce is really just doing a shit-ton of work to accumulate domain expertise in math and teaching and then doing a shit-ton more work to encode that domain expertise into our knowledge graph structure.

Whenever we plan out a course we take a look at numerous other textbooks and online resources, but ultimately it just comes down to our own best judgement based on our own teaching/tutoring experience and knowledge of the subject matter.

Doing this manually might feel challenging and tedious at the beginning but in my/our experience it comes easier as you gain experience with it.

You just kind of get a “feeling” for what the right chunk size is and what level of detail needs to be addressed within a single chunk.

We’ve also spent years refining the chunks, sometimes realizing that a chunk needs to be split, and after enough of that we’ve gotten to the point where we can pull it off really well on the first try.

How about the connectivity? Each of our ~2500 topics has 3-4 knowledge points, and each knowledge point is linked to one or more (typically several) prerequisites. Each prerequisite has an encompassing weight that says what fraction of the prerequisite topic is encompassed, on average, by solving a problem in the post-requisite topic – in other words, how much “credit” should a simpler topic get for doing an advanced topic where the simpler topic is a component skill.

Yes, that’s a lot of connectivity, and yes, we set that manually too. Alex sets all the prerequisite relationships and I set all the encompassing weights. It was and still is quite a bit of work, but not a prohibitive amount of work. “Right at the edge of human scale” as Jason likes to say.

By the way, the “edge of human scale” does not mean “the edge of what you can do in a day” or “the edge of when you get tired/bored.”

When we were first encoding encompassings into the knowledge graph (after already building up a hefty content base) I spent about 8 hours per day setting encompassings, for a month. No joke, I basically did domain-expert data entry full time for a month:


1500 topics at the time
x 5 prereq relationships per topic
x 2 minutes to estimate the encompassing value for each of those relationships
= 15000 minutes
= 250 hours
~ 8hr/day for 30 days

And yeah, it suuuuuucked! 🤮

But guess what? At the end of that month the task selection model started working really, really well because it had such accurate and comprehensive data to go off of. It was not a fun month but it was 100% worth it.

Like “picking gold off a mountain” (another Jason phrase) where there’s a ton of gold scattered around, and it’s tedious to load it up into the wagon and come up and down the mountain, but it’s insanely valuable and you’d be an idiot not to just go collect it even if it’s not super fun.

In other words… just because something’s tedious doesn’t mean it’s low ROI. Sometimes the highest-ROI things are extremely tedious and you just have to suck it up and do it because it’s a sure path to get you to where you need to be in a reasonable time frame.

Okay okay, so the topics and connectivity are created by hand… what about the questions and solutions?

Yeah, those too. By hand.

We also manually estimate a question time for each question type, which we use to help calibrate how much XP each task is worth.

All of our content, metadata, and exercises are created manually in-house, carefully crafted over many years by a team of math experts.

Natural follow-up question: Why generate questions manually instead of algorithmically?

Well, beyond simple arithmetic, there is so much dimensionality and nuance in math questions that generating questions algorithmically would require a custom algorithm for each question type, which would take far longer to build than just biting the bullet and manually generating a sufficiently large pool of questions. Once you create one question of a type, it’s pretty easy to clone it and tweak it into new, different questions within the same type.

Furthermore, in our case, a “sufficiently large” pool of questions in each knowledge point does not actually have to be that large: because our students work hyper-efficiently, engaging in minimum effective doses of practice across a highly segmented curriculum, we don’t actually need that many questions within each question type to ensure that students are not seeing the same questions over again. About 20 questions per knowledge point is more than sufficient. (A typical course consists of about 500-1,000 knowledge points, requiring a pool of about 10,000-20,000 questions.)

Again, that’s right at the edge of human scale. It’s definitely too much content for a single person to create, but it’s doable when you have a strong content team working hard and efficiently under strong leadership (Alex).

There was a time in my life when I was attracted to the idea of creating general-purpose solutions to a wide class of problems, but I’ve come to realize that the real edge – and, at least for me, the real satisfaction – typically comes from

using your human brain / accumulated experience to grapple with the microstructure of a seemingly intractable problem,
finding an exploit that you can pry open just wide enough to shove yourself through with some elbow grease, and
doing that over and over again on each successive problem, periodically looking up to see that despite how intractable it seemed at the outset, you're making serious progress in the direction that you're trying to go.

Like Matt Damon in the movie The Martian: “You solve one problem and you solve the next one and then the next. And if you solve enough problems, you get to come home.”

Follow-Up Questions

Where do the exercises and content on Math Academy come from? Are they all made in-house or pulled from other materials?

All of our content and exercises are created in-house, carefully crafted over many years by a team of experts led by Alex. We perform curriculum comparisons to ensure that our content is comprehensive, but everything on Math Academy has been carefully crafted by an expert human.

Why generate questions manually instead of algorithmically? Beyond simple arithmetic, there is so much dimensionality and nuance in math questions that generating questions algorithmically would require a custom algorithm for each question type, which would take far longer to build than just biting the bullet and manually generating a sufficiently large pool of questions. Furthermore, a sufficiently large pool of questions does not have to be that large: because Math Academy students work hyper-efficiently, engaging in minimum effective doses of practice across a highly segmented curriculum, we don’t actually need that many questions within each question type to ensure that students are not seeing the same questions over again. About 20 questions per knowledge point is more than sufficient. (A typical course consists of about 500-1,000 knowledge points, requiring a pool of about 10,000-20,000 questions.)

Additionally, all of the information in our knowledge graph – tens of thousands of prerequisite links, estimates for the time it takes to work out questions, and encompassing relations (i.e., how much “credit” should a simpler topic get for doing an advanced topic where the simpler topic is a component skill), have also been carefully crafted by expert humans. (Alex constructs the prerequisite graph, his team sets the question times, and I construct the encompassing graph.)

If that sounds like a ridiculous amount of work, then, well… that’s about right and it gives you an idea of why this kind of system is so difficult to build. Even without all the fancy tech, the amount of content that’s needed is enough to form the basis of a full-fledged publishing company. You could say the same about the software: even without the core base of content, the amount of software that’s needed is enough to form the basis of a full-fledged tech company. Even individual components of the software – the content management system, the student interface, the AI expert system that makes all the complicated behind-the-scenes decisions regarding what the student needs to work on – could each on their own form the basis of a full-fledged company.

Jason often jokes that he and Sandy had no idea what they were signing up for when they started Math Academy. The amount of work it’s taken to get to this point has been absolutely insane.

Want to get notified about new posts? Join the mailing list and follow on X/Twitter.