Support Long context for DocSum #1255

XinyaoWa · 2024-12-17T03:11:18Z

Description

Support Long context for DocSum with five modes:
• Auto(Default mode): switch to stuff mode if input token < max_input_token, otherwise switch to "refine" mode
• Stuff (Default mode): input actual tokens, need to increase the max_input_token if want to use large context
• Truncate: truncate the tokens exceed the limitation
• Map_reduce: split the inputs into multiple chunks, map each document to an individual summary, then consolidate those summaries into a single global summary
• Refine: split the inputs into multiple chunks, generate summary for the first one, then combine with the second, loops over every remaining chunks to get the final summary

Related PR: opea-project/GenAIComps#981 opea-project/GenAIComps#1046

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

Signed-off-by: Xinyao Wang <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Xinyao Wang <[email protected]>

github-actions · 2024-12-17T06:11:24Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

Signed-off-by: Xinyao Wang <[email protected]>

for more information, see https://pre-commit.ci

eero-t · 2024-12-17T17:03:01Z

@lvliang-intel Dockerfile check fails to a pre-existing "MultimodalQnA" docs issue, this PR does not touch them:

Missing Dockerfile: GenAIComps/comps/retrievers/multimodal/redis/langchain/Dockerfile (Referenced in GenAIExamples/./MultimodalQnA/docker_compose/intel/cpu/xeon/README.md:127)
Missing Dockerfile: GenAIComps/comps/retrievers/multimodal/redis/langchain/Dockerfile (Referenced in GenAIExamples/./MultimodalQnA/docker_compose/intel/hpu/gaudi/README.md:78)
Error: Process completed with exit code 1.

"DocSum, xeon" CI test failed to:

...
[ docsum-xeon-backend-server ] Content is as expected.
...
[ docsum-gaudi-backend-server ] Content is as expected.
...
[ docsum-gaudi-backend-server ] Content is as expected.
...
[ docsum-gaudi-backend-server ] Content does not match the expected result: 
Error response from daemon: No such container: docsum-gaudi-backend-server

Which is rather surprising when test is supposed to run on Xeon, and there's a separate test that runs same things.

"DocSum, gaudi" test fails due to backend error exit:

...
[ docsum-gaudi-backend-server ] Content is as expected.
...
[ docsum-gaudi-backend-server ] Content is as expected.
...
[ docsum-gaudi-backend-server ] Content does not match the expected result: 
 Error: Process completed with exit code 1.

"DocSum, rocm" test fails to something that looks like CI issue:
aiohttp.client_exceptions.ClientConnectorError: Cannot connect to host 0.0.0.0:7079 ssl:default [Connect call failed ('0.0.0.0', 7079)]

Signed-off-by: Xinyao Wang <[email protected]>

XinyaoWa · 2024-12-19T09:13:54Z

CICD pending for this PR: opea-project/GenAIComps#1046

Signed-off-by: Xinyao Wang <[email protected]>

for more information, see https://pre-commit.ci

mkbhanda · 2025-01-08T01:22:29Z

@XinyaoWa thank you for this PR! I am curious on a couple of issues. 1) what happens with map-reduce mode if the number of summaries totals an input token length greater than max-input-length .. hierarchical map-reduce or refine? 2) Also if possible would you explain the formulas used, 50 etc. 3) Are there speed/accuracy trade-offs to using smaller chunks?

support docsum for four modes

059aa8a

Signed-off-by: Xinyao Wang <[email protected]>

XinyaoWa requested a review from lvliang-intel as a code owner December 17, 2024 03:11

pre-commit-ci bot and others added 3 commits December 17, 2024 03:11

[pre-commit.ci] auto fixes from pre-commit.com hooks

7fd171a

for more information, see https://pre-commit.ci

Merge branch 'main' into docsum_four

f84e40e

fix bug

53f6f52

Signed-off-by: Xinyao Wang <[email protected]>

XinyaoWa and others added 5 commits December 17, 2024 14:50

fix bug

ed29f6b

Signed-off-by: Xinyao Wang <[email protected]>

fix bug

329ae5c

Signed-off-by: Xinyao Wang <[email protected]>

fix bug

f8ca3a7

Signed-off-by: Xinyao Wang <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

68f2dd9

for more information, see https://pre-commit.ci

Merge remote-tracking branch 'origin/docsum_four' into docsum_four

08f791b

XinyaoWa added 3 commits December 19, 2024 13:03

fix conflict

8845144

Signed-off-by: Xinyao Wang <[email protected]>

fix bug

6ceb438

Signed-off-by: Xinyao Wang <[email protected]>

fix bug

fad31b8

Signed-off-by: Xinyao Wang <[email protected]>

XinyaoWa requested review from ftian1 and chensuyue as code owners December 19, 2024 05:47

add auto mode

c4f06c4

Signed-off-by: Xinyao Wang <[email protected]>

Merge branch 'main' into docsum_four

42dd79f

XinyaoWa mentioned this pull request Dec 20, 2024

FaqGen param fix #1277

Merged

4 tasks

XinyaoWa and others added 2 commits December 20, 2024 13:19

fix bug in refine mode

f6bb623

Signed-off-by: Xinyao Wang <[email protected]>

Merge branch 'main' into docsum_four

a94e9f8

lkk12014402 approved these changes Dec 20, 2024

View reviewed changes

XinyaoWa and others added 6 commits December 20, 2024 15:41

add complete UT

8c962dd

Signed-off-by: Xinyao Wang <[email protected]>

Merge remote-tracking branch 'origin/docsum_four' into docsum_four

894822f

refine faqgen readme

4da6c32

Signed-off-by: Xinyao Wang <[email protected]>

fix port

4158082

Signed-off-by: Xinyao Wang <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

5042303

for more information, see https://pre-commit.ci

Merge remote-tracking branch 'origin/docsum_four' into docsum_four

f6b1376

pre-commit-ci bot and others added 2 commits December 20, 2024 08:33

[pre-commit.ci] auto fixes from pre-commit.com hooks

e0876b3

for more information, see https://pre-commit.ci

Merge remote-tracking branch 'origin/docsum_four' into docsum_four

4cbe8d0

lvliang-intel approved these changes Dec 20, 2024

View reviewed changes

lvliang-intel merged commit 50dd959 into opea-project:main Dec 20, 2024
29 checks passed

eero-t mentioned this pull request Dec 30, 2024

Increase Total tokens to 128K (currently 4K) #1186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Long context for DocSum #1255

Support Long context for DocSum #1255

XinyaoWa commented Dec 17, 2024 •

edited

Loading

github-actions bot commented Dec 17, 2024 •

edited

Loading

eero-t commented Dec 17, 2024

XinyaoWa commented Dec 19, 2024

mkbhanda commented Jan 8, 2025

Support Long context for DocSum #1255

Support Long context for DocSum #1255

Conversation

XinyaoWa commented Dec 17, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

github-actions bot commented Dec 17, 2024 • edited Loading

Dependency Review

Scanned Files

eero-t commented Dec 17, 2024

XinyaoWa commented Dec 19, 2024

mkbhanda commented Jan 8, 2025

XinyaoWa commented Dec 17, 2024 •

edited

Loading

github-actions bot commented Dec 17, 2024 •

edited

Loading