[Websearch] update (#427)
* Fix reuqest body
* update webSearchQueryPromptTemplate
* update generate google query parser
* Add today's date to google search query creator
* crawl top stories if exts; remove answer_box & knowledgeGraph
* Create paragraph chunks from top articles
* flattened paragprah chunks
* update status texts
* add gradio client
* call gradio app for RAG
* Web scrape only "p, li, span" els
* add MAX_N_CHUNKS
* gradio result typing
* parse only <p> elements
* rm dev change
* update typing WebSearch
* buld RAG prompt
* Rm dev change
* change websearch context msg from user to assisntat type
* use hosted gradio app
* fix lint
* prompt engineering
* more prompt engineering
* MAX_N_PAGES_SCRAPE = 10
* better error msg
* more prompt engineering
* revert websearch prompt to previous
* rm `top_stories` from websearch as the results are not good
* Stop using gradio client, use regular fetch
* chore
* Rm websearchsummary references as it is no longer used
* update readme
* Apply suggestions from code review
Co-authored-by: Julien Chaumond <julien@huggingface.co>
* Use tfjs to do embeddings in server node
* fix websearch component disapperar after finishing generation
* Show sources of closest embeddings used in RAG
* fix prompting and also add current date
* add comment
* comment for search query
* sources
* hide www
* using hostname direclty
* Show successful web pages instead of failed ones
* rm noisy messages
* google query generation using previous messaages as context
* handle falcon generation
* bring back Browsing webpage msg
---------
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Victor Mustar <victor.mustar@gmail.com>