Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TA] Add analyze tasks feature support #17267

Merged
Merged
Show file tree
Hide file tree
Changes from 22 commits
Commits
Show all changes
25 commits
Select commit Hold shift + click to select a range
7b0e24c
TA-Healthcare
mssfang Nov 5, 2020
446bb42
remove checkstyle supppresion
mssfang Nov 5, 2020
5ac3e30
regenerate swagger ater a new change merged in
mssfang Nov 5, 2020
c03aa75
transfer to laptop
mssfang Nov 5, 2020
968acb9
cancellation is working now
mssfang Nov 6, 2020
1b5dbc8
address mari's most feedbacks
mssfang Nov 6, 2020
53e0d0e
replace a wrong json file for async pagination test
mssfang Nov 6, 2020
53654d6
init analyze tasks
mssfang Nov 6, 2020
8d79350
removed xxxTaskState class and TaskStete
mssfang Nov 6, 2020
5c7f4ce
add changelog
mssfang Nov 6, 2020
8c57c6c
resolve conflict and updates changes
mssfang Nov 6, 2020
37c3bf1
resolved conflict and add more tests
mssfang Nov 11, 2020
febe0be
regenerate code base on swagger 5ef5a597b3f2342bfd254ed79b97b2fe160e50a1
mssfang Nov 13, 2020
9a25060
address feedbacks
mssfang Nov 13, 2020
ee4ae09
improve PLAYBACK test speed
mssfang Nov 13, 2020
cd58ae8
checekstyle issue
mssfang Nov 13, 2020
d1c02b5
update Analyze API endpoint
mssfang Nov 17, 2020
c94420c
add issue link to TODO lists
mssfang Nov 17, 2020
8802740
renaming some APIs and address feedbacks
mssfang Nov 18, 2020
ae88549
make final class if possible
mssfang Nov 18, 2020
047f2b5
make HealthcareEntityCollection final class
mssfang Nov 18, 2020
201e025
service already fixed the nextLink to @nextLink
mssfang Nov 18, 2020
d52decd
add NPE and add live test setup
mssfang Nov 19, 2020
668650b
address last feedbacks
mssfang Nov 19, 2020
b819e4f
update readme content and links
mssfang Nov 19, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
24 changes: 13 additions & 11 deletions sdk/textanalytics/azure-ai-textanalytics/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,9 @@
# Release History
## 5.1.0-beta.3 (Unreleased)
**New features**
- Added support for Healthcare analysis, it is a long-running operation, and the cancellation supported.
### New features
- Added support for healthcare analysis feature. It is represented as a long-running operation. Cancellation supported.
- Added support for analyze tasks feature, It analyzes multiple tasks (such as, entity recognition, PII entity recognition
and key phrases extraction) simultaneously in a list of document.

## 5.1.0-beta.2 (2020-10-06)
### Breaking changes
Expand All @@ -17,7 +19,7 @@
- `length` is the number of characters in the text of these models
- `offset` is the offset of the text from the start of the document

**New features**
### New features
- Updated Text Analytics SDK's default service API version to `v3.1-preview.2` from `v3.0`.
- Added support for Personally Identifiable Information(PII) entity recognition feature.
To use this feature, you need to make sure you are using the service's v3.1-preview.1 API.
Expand All @@ -38,14 +40,14 @@ about the returned entity.
- First stable release of `azure-ai-textanalytics`.

## 1.0.0-beta.5 (2020-05-27)
**New features**
### New features
- Added Text property and `getText()` to `SentenceSentiment`.
- `Warnings` property added to each document-level response object returned from the endpoints. It is a list of `TextAnalyticsWarnings`.
- Added `CategorizedEntityCollection`, `KeyPhrasesCollection`, `LinkedEntityCollection` for having `getWarnings()` to retrieve warnings.
- Added a new enum value `ADDRESS` to `EntityCategory`.
- Text analytics SDK update the service to version `v3.0` from `v3.0-preview.1`.

**Breaking changes**
### Breaking changes
- Removed pagination feature, which removed `TextAnalyticsPagedIterable`, `TextAnalyticsPagedFlux` and `TextAnalyticsPagedResponse`
- Removed overload methods for API that takes a list of String, only keep max-overload API that has a list of String, language or country hint, and `TextAnalyticsRequestOption`.
- Renamed `apiKey()` to `credential()` on TextAnalyticsClientBuilder.
Expand Down Expand Up @@ -73,18 +75,18 @@ about the returned entity.
## 1.0.0-beta.4 (2020-04-07)
- Throws an illegal argument exception when the given list of documents is an empty list.

**Breaking changes**
### Breaking changes
- Renamed all input parameters `text` to `document`, and `inputTexts` to `documents`.
- Removed all PII endpoints and update with related changes, such as remove related models, samples, codesnippets, docstrings, etc from this library.
- Replaced `TextAnalyticsApiKeyCredential` with `AzureKeyCredential`.

## 1.0.0-beta.3 (2020-03-10)
**New features**
### New features
- Introduced `TextAnalyticsPagedFlux`, `TextAnalyticsPagedIterable`, and `TextAnalyticsPagedResponse` type. Moved `modelVersion` amd `TextDocumentBatchStatistics` into `TextAnalyticsPagedResponse`. All collection APIs are return `TextAnalyticsPagedFlux` and `TextAnalyticsPagedIterable` in the asynchronous and synchronous client, respectively. So `DocumentResultCollection` is no longer required. Most of existing API surface are changes. Please check up `TextAnalyticsAsyncClient` and `TextAnalyticsClient` for more detail.
- Introduced `EntityCategory` class to support major entity categories that the service supported.
- Added `getDefaultCountryHint()`, `getDefaultLanguage()` and `getServiceVersion()` to `TextAnalyticsClient`

**Breaking changes**
### Breaking changes
- Supported `Iterable<T>` instead of `List<T>` text inputs.
- Default language and country hint can only be assigned value when building a Text Analytics client.
- Renamed `showStatistics()` to `isIncludeStatistics()` in the `TextAnalyticsRequestOptions`.
Expand All @@ -97,7 +99,7 @@ about the returned entity.

## 1.0.0-beta.2 (2020-02-12)

**Breaking changes**
### Breaking changes

- The single text, module-level operations return an atomic type of the operation result. For example, `detectLanguage(String text)` returns a `DetectedLanguage` rather than a `DetectLanguageResult`.

Expand Down Expand Up @@ -137,11 +139,11 @@ about the returned entity.
- `getLinkedEntities()` to `getEntities()` and variable `linkedEntities` to `entities`.
- Added suffix of `batch` to all operations' method name that takes a collection of input.

**New features**
### New features

- Credential class `TextAnalyticsApiKeyCredential` provides an `updateCredential()` method which allows you to update the API key for long-lived clients.

**Fixes and improvements**
### Breaking changes

- If you try to access a result attribute on a `DocumentError` object, a `TextAnalyticsException` is raised with a custom error message that provides the document ID and error of the invalid document.

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@
import com.azure.core.util.polling.PollingContext;
import reactor.core.publisher.Mono;

import java.time.Duration;
import java.util.Arrays;
import java.util.List;
import java.util.Map;
Expand All @@ -44,7 +45,7 @@
import java.util.stream.Collectors;

import static com.azure.ai.textanalytics.TextAnalyticsAsyncClient.COGNITIVE_TRACING_NAMESPACE_VALUE;
import static com.azure.ai.textanalytics.implementation.Utility.DEFAULT_POLL_DURATION;
import static com.azure.ai.textanalytics.implementation.Utility.DEFAULT_POLL_INTERVAL;
import static com.azure.ai.textanalytics.implementation.Utility.inputDocumentsValidation;
import static com.azure.ai.textanalytics.implementation.Utility.parseModelId;
import static com.azure.ai.textanalytics.implementation.Utility.parseNextLink;
Expand Down Expand Up @@ -74,12 +75,16 @@ PollerFlux<TextAnalyticsOperationResult, PagedFlux<HealthcareTaskResult>> beginA
try {
inputDocumentsValidation(documents);
String modelVersion = null;
Duration pollInterval = DEFAULT_POLL_INTERVAL;
if (options != null) {
modelVersion = options.getModelVersion();
pollInterval = options.getPollInterval();
samvaity marked this conversation as resolved.
Show resolved Hide resolved
}
final Boolean finalIncludeStatistics = options == null ? null : options.isIncludeStatistics();
final Integer finalTop = options == null ? null : options.getTop();
final Integer finalSkip = options == null ? null : options.getSkip();
return new PollerFlux<>(
DEFAULT_POLL_DURATION,
pollInterval,
activationOperation(service.healthWithResponseAsync(
new MultiLanguageBatchInput().setDocuments(toMultiLanguageInput(documents)),
modelVersion,
Expand All @@ -92,12 +97,12 @@ PollerFlux<TextAnalyticsOperationResult, PagedFlux<HealthcareTaskResult>> beginA
parseModelId(healthResponse.getDeserializedHeaders().getOperationLocation()));
return textAnalyticsOperationResult;
})),
pollingOperation(jobId -> service.healthStatusWithResponseAsync(jobId, null, null,
finalIncludeStatistics, context)),
pollingOperation(healthcareTaskId -> service.healthStatusWithResponseAsync(healthcareTaskId,
finalTop, finalSkip, finalIncludeStatistics, context)),
(activationResponse, pollingContext) ->
monoError(logger, new RuntimeException("Use the `beginCancelHealthcareJob` to cancel the job")),
fetchingOperation(resultId -> Mono.just(getHealthcareFluxPage(resultId,
finalIncludeStatistics == null ? false : finalIncludeStatistics, context)))
fetchingOperation(resultId -> Mono.just(getHealthcareFluxPage(resultId, finalTop, finalSkip,
finalIncludeStatistics, context)))
);
} catch (RuntimeException ex) {
return PollerFlux.error(ex);
Expand All @@ -109,12 +114,16 @@ PollerFlux<TextAnalyticsOperationResult, PagedIterable<HealthcareTaskResult>> be
try {
inputDocumentsValidation(documents);
String modelVersion = null;
Duration pollInterval = DEFAULT_POLL_INTERVAL;
if (options != null) {
modelVersion = options.getModelVersion();
pollInterval = options.getPollInterval();
}
final Boolean finalIncludeStatistics = options == null ? null : options.isIncludeStatistics();
final Integer finalTop = options == null ? null : options.getTop();
final Integer finalSkip = options == null ? null : options.getSkip();
return new PollerFlux<>(
DEFAULT_POLL_DURATION,
pollInterval,
mssfang marked this conversation as resolved.
Show resolved Hide resolved
activationOperation(service.healthWithResponseAsync(
new MultiLanguageBatchInput().setDocuments(toMultiLanguageInput(documents)),
modelVersion,
Expand All @@ -127,36 +136,37 @@ PollerFlux<TextAnalyticsOperationResult, PagedIterable<HealthcareTaskResult>> be
parseModelId(healthResponse.getDeserializedHeaders().getOperationLocation()));
return textAnalyticsOperationResult;
})),
pollingOperation(jobId -> service.healthStatusWithResponseAsync(jobId, null, null,
finalIncludeStatistics, context)),
pollingOperation(healthcareTaskId -> service.healthStatusWithResponseAsync(healthcareTaskId, null,
null, finalIncludeStatistics, context)),
(activationResponse, pollingContext) ->
monoError(logger, new RuntimeException("Use the `beginCancelHealthcareJob` to cancel the job")),
fetchingOperationIterable(resultId -> Mono.just(new PagedIterable<>(getHealthcareFluxPage(resultId,
finalIncludeStatistics == null ? false : finalIncludeStatistics, context))))
finalTop, finalSkip, finalIncludeStatistics, context))))
);
} catch (RuntimeException ex) {
return PollerFlux.error(ex);
}
}

PagedFlux<HealthcareTaskResult> getHealthcareFluxPage(UUID jobID, boolean showStats, Context context) {
PagedFlux<HealthcareTaskResult> getHealthcareFluxPage(UUID healthcareTaskId, Integer top, Integer skip,
Boolean showStats, Context context) {
return new PagedFlux<>(
() -> getPage(null, jobID, showStats, context),
continuationToken -> getPage(continuationToken, jobID, showStats, context));
() -> getPage(null, healthcareTaskId, top, skip, showStats, context),
continuationToken -> getPage(continuationToken, healthcareTaskId, top, skip, showStats, context));
}

Mono<PagedResponse<HealthcareTaskResult>> getPage(String continuationToken, UUID jobID,
boolean showStats, Context context) {
Mono<PagedResponse<HealthcareTaskResult>> getPage(String continuationToken, UUID healthcareTaskId, Integer top,
Integer skip, Boolean showStats, Context context) {
try {
if (continuationToken != null) {
final Map<String, Integer> continuationTokenMap = parseNextLink(continuationToken);
final Integer topValue = continuationTokenMap.getOrDefault("$top", null);
final Integer skipValue = continuationTokenMap.getOrDefault("$skip", null);
return service.healthStatusWithResponseAsync(jobID, topValue, skipValue, showStats, context)
return service.healthStatusWithResponseAsync(healthcareTaskId, topValue, skipValue, showStats, context)
.map(this::toTextAnalyticsPagedResponse)
.onErrorMap(Utility::mapToHttpResponseExceptionIfExist);
} else {
return service.healthStatusWithResponseAsync(jobID, null, null, showStats, context)
return service.healthStatusWithResponseAsync(healthcareTaskId, top, skip, showStats, context)
.map(this::toTextAnalyticsPagedResponse)
.onErrorMap(Utility::mapToHttpResponseExceptionIfExist);
}
Expand All @@ -174,8 +184,9 @@ private PagedResponse<HealthcareTaskResult> toTextAnalyticsPagedResponse(
final List<TextAnalyticsError> errors = healthcareJobState.getErrors();

final HealthcareTaskResult healthcareTaskResult = new HealthcareTaskResult(
// TODO: change back to UUID after service support it.
healthcareJobState.getJobId().toString(),
// TODO: [Service-Bug] change back to UUID after service support it.
// https://github.com/Azure/azure-sdk-for-java/issues/17629
healthcareJobState.getJobId(),
healthcareJobState.getCreatedDateTime(),
healthcareJobState.getLastUpdateDateTime(),
toJobState(healthcareJobState.getStatus()),
Expand All @@ -196,12 +207,13 @@ private PagedResponse<HealthcareTaskResult> toTextAnalyticsPagedResponse(
null);
}

PollerFlux<TextAnalyticsOperationResult, Void> beginCancelAnalyzeHealthcare(UUID jobId, Context context) {
PollerFlux<TextAnalyticsOperationResult, Void> beginCancelAnalyzeHealthcare(String healthTaskId,
RecognizeHealthcareEntityOptions options, Context context) {
try {
Objects.requireNonNull(jobId, "'jobId' is required and cannot be null.");
Objects.requireNonNull(healthTaskId, "'healthTaskId' is required and cannot be null.");
return new PollerFlux<>(
DEFAULT_POLL_DURATION,
activationOperation(service.cancelHealthJobWithResponseAsync(jobId,
options == null ? DEFAULT_POLL_INTERVAL : options.getPollInterval(),
activationOperation(service.cancelHealthJobWithResponseAsync(UUID.fromString(healthTaskId),
samvaity marked this conversation as resolved.
Show resolved Hide resolved
context.addData(AZ_TRACING_NAMESPACE_KEY, COGNITIVE_TRACING_NAMESPACE_VALUE))
.map(healthResponse -> {
final TextAnalyticsOperationResult textAnalyticsOperationResult =
Expand Down Expand Up @@ -286,7 +298,7 @@ private Mono<PollResponse<TextAnalyticsOperationResult>> processAnalyzeModelResp
logger.info("LongRunningOperation-Cancelling");
status = LongRunningOperationStatus.IN_PROGRESS;
break;
case NOTSTARTED:
case NOT_STARTED:
case RUNNING:
status = LongRunningOperationStatus.IN_PROGRESS;
break;
Expand Down
Loading