Convert any PDF into a fillable form with 🪄AI magic. I built a free tool that: (a) takes seconds, (b) outperforms all the commercial solutions, and (c) is open source!
If you've got 308GB to spare and want a high-quality document dataset to add to your VLM tasks, CommonForms is now hosted on @huggingface! It only took 24 hours to upload from my home network. 😭
huggingface.co/datasets/jbarr…
This is such a good paper. I love NLP error analysis papers -- @chrmanning and co do them so well (another great example will always be "Part-of-Speech Tagging from 97% to 100%").
Alright @Dorialexander, the Qwen2.5 VL localization is actually insanely good.
This is 0-shot detection/localization/classification of checkboxes in a random document image I found online (a typically difficult task given the small resolution of the objects).
MLX is great! The Python code base is concise and high quality, making it easy to understand and extend. And it’s *fast* — I didn’t add timing information yet, but on my M2 MacBook Air, BERT is notably faster than the torch equivalent (running on its CPU). A very Apple product.
If you’re unfamiliar with AllenNLP but are a beginning NLP researcher or interested party, I have a “Learn AllenNLP The Hard Way” tutorial on my site: jbarrow.ai/allennlp-the-h…
Hey @awnihannun, I've now got an MLX implementation of phi-2 working! Missing the generate function rn, but will put together a PR tonight or tomorrow morning. Free to help make it faster?
For readability, I've been throwing a `| fold -s -w 100` to the end of this to get nice, readable line breaks (e.g., screenshot is the function I have for Mistral)