🔎 Search the information available on a webpage using natural language instead of an exact string match.
Search the information available on a webpage using natural language instead of an exact string match. Uses MobileBERT fine-tuned on SQuAD via TensorFlowJS to search for answers and mark relevant elements on the web page.
This extension is an experiment. Deep learning models like BERT are powerful but may return unpredictable and/or biased results that are tough to interpret. Please apply best judgement when analyzing search results.
Ctrl-F uses exact string-matching to find information within a webpage. String match is inherently a proxy heuristic for the true content -- in most cases it works very well, but in some cases it can be a bad proxy.
In our example above we search
https://stripe.com/docs/testing, aiming to
understand the difference between test mode and live mode. With string
matching, you might search through some relevant phrases
"test mode", and/or
"difference" and scan through results. With semantic search, you
can directly phrase your question
"What is the difference between live mode and test mode?". We see that the model returns a relevant result, even though
the page does not contain the term "
Every time a user executes a search:
<ol>elements on the page and extracts text from each.
There are three main components that interact via Message Passing to orchestrate the extension:
popup.js): React application that renders the search bar, controls searching and iterating through the results.
content.js): Runs in the context of the current tab, responsible for reading from and manipulating the DOM.
background.js): Background script that loads and executes the TensorFlowJS model on question-context pairs.
src/js/message_types.js contains the messages used to interact between these
Make sure you have these dependencies installed.
The unpacked extension will be placed inside of
build/. See Google Chrome
documentation to load the
unpacked extension into your Chrome browser in development mode.
A zipped extension file ready for upload will be placed inside of