cat articles/noteable
Using ChatGPT's Noteable and WebPilot plugins to build a notebook that predicts OpenCALM 14B performance
I saw people saying that the Noteable plugin, which lets you create notebooks through conversation on ChatGPT, was impressive. I tried it with ChatGPT, or GPT-4, and it was better than I expected. We have reached a convenient time where AI can write the notebook for small analyses, and you can check the results in ChatGPT without opening the notebook itself.
For example, I asked it to fetch the number of parameters and PPL for OpenCALM models from the Hugging Face page and plot them. The result looked like this:

The PPL performance graph per parameter looked like this:

It also built a model using the parameters from 1B onward, where a linear regression seemed plausible. The predicted graph for a hypothetical OpenCALM-14B model looked like this. Since the 1B, 3B, and 7B models decrease linearly, it feels like performance will continue improving as larger models appear.

先ほど作成した線形回帰モデルの切片と係数は以下の通りです:
切片(Intercept): 10.7928
係数(Coefficient): -0.000383
これは、モデルが以下の形式で表されることを意味します:
Dev ppl = 10.7928 - 0.000383 * Params
つまり、Paramsが1増えると、Dev pplは約0.000383減少すると予測されます。
The ChatGPT conversation looked like this. It is not written in this exchange, but if I ask it to model the data with something nonlinear, such as SVM, a neural network, or a polynomial instead of only linear regression, it will express it that way too. It seems very convenient.
- https://chat.openai.com/share/e3f754bb-0a9f-4bc9-a04e-399296f55884
- ChatGPT recently added share URLs. Noteable images seem to be cached only for a few dozen minutes during the conversation, so they disappear after a while.
The notebook created by Noteable looked like this. The data scraped by WebPilot is placed in the first cell.
Until now, the natural flow was to collect data from a web page, shape it into CSV or Python code, and then analyze it in a notebook using familiar steps. Being able to do that quickly by writing natural language is extremely convenient. If I want to do something more complex, the notebook already exists, so I can continue the analysis by adding a little myself. Having the usual notebook workflow become "mostly let AI do it, then have a human make the final adjustments" is a strong point.
Tedious work keeps disappearing, which feels good.