With the emergence of large e-infrastructures and the widespread adoption of the Service Oriented Architecture (SOA) paradigm, more and more language technology is being made available through web services. Extending such services to linguistic processing pipelines, tool evaluation or LR production and validation involves considering both the methodologies and technical aspects specific to the application domains.
Distributed architectures such as web services allow communication and data exchange between applications. They are a suitable instrument for automatic, less often semi-automatic, tool evaluation as well as resource production processes both for practical and conceptual reasons. At a practical level, web services support quick results, centralised data storage, remote access etc.; at a conceptual level, they allow for the combination of more than one processing components that may be located on different sites. Such processing pipelines are set up to tackle a particular analysis task. To support these, new techniques have to be developed that organise well-established practices into workflows and support the exchange of data by standards and open tool architectures.
The workshop focuses on current uses and best practices for the deployment of web services and web interfaces in the HLT domain, including processing pipelines, LR production and validation, and evaluation of tools. It highlights relevant aspects for the integration of linguistic or evaluation web services within infrastructures (e.g. authorisation and authentication, service registries) and infrastructural requirements (e.g. interface harmonisation, metadata generation). The workshop also aims at demonstrating different approaches on how to combine linguistic web services into a composite web service.
The expected outcome of the workshop is a comparison of the practices in architectures and processing pipelines that people build and discussion of the issues involved. Topics of interest include, but are not limited to:
Technical aspects: approaches, protocols, management of huge amounts of data, data structures and formats, performance, manual components (e.g. annotation or evaluation), composition and configuration, interoperability, security, monitoring and recovery strategies, standardisation of APIs, tools and frameworks supporting HLT services deployment, architectures.
Scientific aspects: influence of web services on evaluation or resource production, meta-evaluation / validation of architectures, annotation agreements, needs for tools evaluation and resource production, status of the data produced.
Commercial aspects: licensing, privacy, advertising, brokering, business possibilities, challenges, exploitation of the resulting data.