unstructured
Feat: Bag of words for testing metric
#1650
Merged

Feat: Bag of words for testing metric #1650

mallorih merged 49 commits into main from feat/bag-of-words
mallorih
local embedding model from huggingface
fe1767d0
Merge branch 'main' of https://github.com/Unstructured-IO/unstructured
a339690c
Merge branch 'main' of https://github.com/Unstructured-IO/unstructure…
c0808f82
add arguments
672bc8d0
begin coding bag of words
a6f9fbbe
bag of words function
8511de1e
fix syntax
2722e09d
mallorih Merge branch 'main' into feat/bag-of-words
fc2754fd
format
ed42bc18
Merge branch 'feat/bag-of-words' of https://github.com/Unstructured-I…
0ae04ea9
remove unwanted file
332c70ad
mallorih mallorih requested a review from shreyanid shreyanid 2 years ago
mallorih Merge branch 'main' into feat/bag-of-words
ced5db6f
mallorih mallorih removed review request from shreyanid shreyanid 2 years ago
Merge branch 'main' of https://github.com/Unstructured-IO/unstructure…
4f1c9ec4
Merge branch 'feat/bag-of-words' of https://github.com/Unstructured-I…
394e3bb8
update changelog and version
81ba8759
mallorih Merge branch 'main' into feat/bag-of-words
866e8e3e
fix test
c4114f71
Merge branch 'feat/bag-of-words' of https://github.com/Unstructured-I…
bdefeae7
shreyanid
added test
71b5656e
redo logic for bag of words
2e041198
update tests
5d1769a4
remove funky words
f8ecffad
update version
010477a4
shreyanid
shreyanid commented on 2023-10-06
Klaijan Merge branch 'main' into feat/bag-of-words
b8518624
mallorih
shreyanid
Klaijan
Merge branch 'main' of https://github.com/Unstructured-IO/unstructure…
34334b37
fix bag of words and move code to correct files
b36a310d
conflict
7da1314a
formatting
7e060545
mallorih Merge branch 'main' into feat/bag-of-words
21bd5fdc
fix typing
c5128fc2
Merge branch 'feat/bag-of-words' of https://github.com/Unstructured-I…
ca30d9da
restore core.py file
f1d32cbf
correct typing
fbd1abb4
fix syntax
58a670a3
shreyanid
shreyanid
shreyanid commented on 2023-10-09
shreyanid
shreyanid commented on 2023-10-09
add new condition
dcd053f6
shreyanid
shreyanid commented on 2023-10-09
remove additional code
e86da521
shreyanid
shreyanid commented on 2023-10-10
removes hypens at the beginning of sentence
88ba596c
formatted
bd462039
shreyanid adding test for dash and hyphen
1838b956
add test
128ea22a
Merge branch 'feat/bag-of-words' of https://github.com/Unstructured-I…
9ad8073a
mallorih Merge branch 'main' into feat/bag-of-words
8d8dcde9
removed test
8dd9b06a
Merge branch 'feat/bag-of-words' of https://github.com/Unstructured-I…
9a1aaa00
fix logic to remove punctuation with spaces around it.
999cfc85
fix test
adfec61e
shreyanid Merge branch 'main' into feat/bag-of-words
dc690c4f
shreyanid
shreyanid approved these changes on 2023-10-10
mallorih Merge branch 'main' into feat/bag-of-words
0a47804c
mallorih Merge branch 'main' into feat/bag-of-words
b699f9b9
mallorih mallorih enabled auto-merge 2 years ago
mallorih mallorih merged a5d7ae46 into main 2 years ago
mallorih mallorih deleted the feat/bag-of-words branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone