Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,100 @@
|
|
1 |
-
---
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: en
|
3 |
+
tags:
|
4 |
+
- tessar
|
5 |
+
- table-question-answering
|
6 |
+
- svector
|
7 |
+
- neural-sql-executor
|
8 |
+
datasets:
|
9 |
+
- Stanford/wikitablequestions
|
10 |
+
license: mit
|
11 |
+
pipeline_tag: table-question-answering
|
12 |
+
---
|
13 |
+
|
14 |
+
# Tessar (Large-Sized Model)
|
15 |
+
|
16 |
+
Tessar is an advanced table reasoning model developed by SVECTOR, building upon the groundbreaking research and pushing the boundaries of neural table understanding.
|
17 |
+
|
18 |
+
## Model Description
|
19 |
+
|
20 |
+
Tessar (**Te**xtual **S**QL **A**nalysis and **R**easoning) is a sophisticated neural model designed to excel in table-based question answering. Tessar implements an innovative neural SQL executor that can interpret and reason over complex tabular data with remarkable precision.
|
21 |
+
|
22 |
+
The model is constructed using the BART architecture, featuring a bidirectional encoder and an autoregressive decoder. This design allows Tessar to capture intricate contextual relationships within tabular data and generate accurate, contextually relevant answers.
|
23 |
+
|
24 |
+
### Key Features
|
25 |
+
- Advanced neural SQL execution capabilities
|
26 |
+
- State-of-the-art performance on complex table question answering
|
27 |
+
- Robust handling of nuanced and multi-step queries
|
28 |
+
- Fine-tuned on the WikiTableQuestions dataset
|
29 |
+
|
30 |
+
## Intended Uses
|
31 |
+
|
32 |
+
Tessar is particularly powerful for solving complex table-based questions across various domains. Here are some example questions the model can effectively address:
|
33 |
+
|
34 |
+
| Question | Example Answer |
|
35 |
+
|:---: |:---:|
|
36 |
+
| According to the table, what is the last title produced? | Specific Title |
|
37 |
+
| What is the difference in a specific comparative metric? | Numerical Difference |
|
38 |
+
| Which entity had the most significant impact in a given context? | Identified Entity |
|
39 |
+
| What were the first and last entries in a specific column? | Comparative Entries |
|
40 |
+
|
41 |
+
### How to Use
|
42 |
+
|
43 |
+
Here's a comprehensive example of using Tessar with the Transformers library:
|
44 |
+
|
45 |
+
```python
|
46 |
+
from transformers import TessarTokenizer, BartForConditionalGeneration
|
47 |
+
import pandas as pd
|
48 |
+
|
49 |
+
# Load Tessar model and tokenizer
|
50 |
+
tokenizer = TessarTokenizer.from_pretrained("SVECTOR-CORPORATION/Tessar-largest")
|
51 |
+
model = BartForConditionalGeneration.from_pretrained("SVECTOR-CORPORATION/Tessar-largest")
|
52 |
+
|
53 |
+
# Prepare sample table data
|
54 |
+
data = {
|
55 |
+
"year": [1896, 1900, 1904, 2004, 2008, 2012],
|
56 |
+
"city": ["athens", "paris", "st. louis", "athens", "beijing", "london"]
|
57 |
+
}
|
58 |
+
table = pd.DataFrame.from_dict(data)
|
59 |
+
|
60 |
+
# Ask a specific query
|
61 |
+
query = "In which year did beijing host the Olympic Games?"
|
62 |
+
encoding = tokenizer(table=table, query=query, return_tensors="pt")
|
63 |
+
|
64 |
+
# Generate answer
|
65 |
+
outputs = model.generate(**encoding)
|
66 |
+
|
67 |
+
# Decode and print result
|
68 |
+
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
|
69 |
+
# Expected output: [' 2008.0']
|
70 |
+
```
|
71 |
+
|
72 |
+
### Evaluation
|
73 |
+
|
74 |
+
For comprehensive evaluation scripts and benchmarking, please refer to the SVECTOR documentation and research repositories.
|
75 |
+
|
76 |
+
### Performance Highlights
|
77 |
+
- Exceptional accuracy on complex table reasoning tasks
|
78 |
+
- Robust handling of multi-step and contextual queries
|
79 |
+
- State-of-the-art performance on WikiTableQuestions dataset
|
80 |
+
|
81 |
+
### Citation
|
82 |
+
|
83 |
+
If you use Tessar in your research the SVECTOR implementation:
|
84 |
+
|
85 |
+
```bibtex
|
86 |
+
@inproceedings{svector2025tessar,
|
87 |
+
title={{Tessar}: Advanced Neural Table Reasoning},
|
88 |
+
author={{SVECTOR Team}},
|
89 |
+
year={2025},
|
90 |
+
institution={SVECTOR Research}
|
91 |
+
}
|
92 |
+
```
|
93 |
+
|
94 |
+
### Contact and Support
|
95 |
+
|
96 |
+
For further information, support, or collaboration opportunities, please contact SVECTOR's research team at [email protected].
|
97 |
+
|
98 |
+
### License
|
99 |
+
|
100 |
+
This model is released under the MIT License. Please review the licensing terms before use.
|