I ran create_datatset.py on a PDF and got Datatest_parsed.bkp. When I run summarize.py I get an error: ValueError: attempt to get argmax of an empty sequence
Complete Error:
2022-06-20 15:59:26,118 - root - INFO: Dataset - Parsing Test Data
0 / 10000
multiprocess.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/multiprocess/pool.py", line 121, in worker
result = (True, func(*args, **kwds))
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/multiprocess/pool.py", line 44, in mapstar
return list(map(*args))
File "/home/mayyankg/Desktop/newspulse/summarization/SumTO_financial_summarization/components/Dataset.py", line 118, in job_each_file_test
d["raw_text"] = open(self.test_dir + k, "r", encoding="utf-8").read()
FileNotFoundError: [Errno 2] No such file or directory: './Data/test_articles/Annual Report for the year ended 31 December 2021.txt'
"""
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "create_dataset.py", line 28, in
data.parse_test_data()
File "/home/mayyankg/Desktop/newspulse/summarization/SumTO_financial_summarization/components/Dataset.py", line 133, in parse_test_data
p.map(self.job_each_file_test, list_keys)
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/multiprocess/pool.py", line 268, in map
return self._map_async(func, iterable, mapstar, chunksize).get()
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/multiprocess/pool.py", line 657, in get
raise self._value
FileNotFoundError: [Errno 2] No such file or directory: './Data/test_articles/Annual Report for the year ended 31 December 2021.txt'
(finance_summ) mayyankg@D562:/Desktop/newspulse/summarization/SumTO_financial_summarization$ python create_dataset.py
2022-06-20 16:00:35,075 - root - INFO: Dataset - Parsing Test Data
(finance_summ) mayyankg@D562:/Desktop/newspulse/summarization/SumTO_financial_summarization$ python create_dataset.py
2022-06-20 16:01:37,503 - root - INFO: Dataset - Parsing Test Data
(finance_summ) mayyankg@D562:~/Desktop/newspulse/summarization/SumTO_financial_summarization$ python summarize.py
2022-06-20 16:01:56,973 - root - INFO: Summarizer - initializing summarizer
2022-06-20 16:01:56,973 - root - INFO: Summarizer - Loading model (auto)
Traceback (most recent call last):
File "summarize.py", line 25, in
summy = Summarizer(test_set, "morenolq/SumTO_FNS2020")
File "/home/mayyankg/Desktop/newspulse/summarization/SumTO_financial_summarization/Summarizer.py", line 41, in init
free_gpu = int(self.get_freer_gpu())
File "/home/mayyankg/Desktop/newspulse/summarization/SumTO_financial_summarization/Summarizer.py", line 53, in get_freer_gpu
return np.argmax(memory_available)
File "<array_function internals>", line 6, in argmax
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/numpy/core/fromnumeric.py", line 1195, in argmax
return _wrapfunc(a, 'argmax', axis=axis, out=out)
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/numpy/core/fromnumeric.py", line 54, in _wrapfunc
return _wrapit(obj, method, *args, **kwds)
File "/home/mayyankg/.conda/envs/finance_summ/lib/python3.7/site-packages/numpy/core/fromnumeric.py", line 43, in _wrapit
result = getattr(asarray(obj), method)(*args, **kwds)
ValueError: attempt to get argmax of an empty sequence