<?xml 
version="1.0" encoding="utf-8"?>
<rss version="2.0" 
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
>

<channel xml:lang="en">
	<title>LTPD 2012 Workshop</title>
	<link>http://workshops.elda.org/ltpd2012/</link>
	
	<language>en</language>
	<generator>SPIP - www.spip.net</generator>




<item xml:lang="en">
		<title>Main Conference</title>
		<link>http://workshops.elda.org/ltpd2012/Main-Conference,7</link>
		<guid isPermaLink="true">http://workshops.elda.org/ltpd2012/Main-Conference,7</guid>
		<dc:date>2009-12-17T15:59:52Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>WSPP</dc:creator>


		<dc:subject>ouvert_rubrique</dc:subject>

		<description>The Workshop on Language Technology for Patent Data: Language Resources and Evaluation is held in conjunction with the LREC 2012 conference organised by ELRA.

-
&lt;a href="http://workshops.elda.org/ltpd2012/-Main-Conference-" rel="directory"&gt;Main Conference&lt;/a&gt;

/ 
&lt;a href="http://workshops.elda.org/ltpd2012/+-ouvert_rubrique,5-+" rel="tag"&gt;ouvert_rubrique&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;The Workshop on &lt;i&gt;Language Technology for Patent Data: Language Resources and Evaluation&lt;/i&gt; is held in conjunction with the &lt;a href='http://www.lrec-conf.org/lrec2012/' class='spip_out' rel='external'&gt;LREC 2012 conference&lt;/a&gt; organised by ELRA.&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Call for Papers</title>
		<link>http://workshops.elda.org/ltpd2012/Call-for-Papers,6</link>
		<guid isPermaLink="true">http://workshops.elda.org/ltpd2012/Call-for-Papers,6</guid>
		<dc:date>2009-12-17T15:50:32Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>WSPP</dc:creator>


		<dc:subject>ouvert_rubrique</dc:subject>

		<description>In the last few years, the use of patents in automatic processing has shown a growing interest in the NLP community. This has been particularly the case in the context of Machine Translation (MT) or Cross-Lingual Information Retrieval (CLIR). Nowadays this has become a major topic and besides the development of the technology itself, some key points remain regarding the resources available and the way of evaluating the quality of the technology. A large number of language resources is (...)

-
&lt;a href="http://workshops.elda.org/ltpd2012/-Description,1-" rel="directory"&gt;Call for Papers&lt;/a&gt;

/ 
&lt;a href="http://workshops.elda.org/ltpd2012/+-ouvert_rubrique,5-+" rel="tag"&gt;ouvert_rubrique&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;In the last few years, the use of patents in automatic processing has shown a growing interest in the NLP community. This has been particularly the case in the context of Machine Translation (MT) or Cross-Lingual Information Retrieval (CLIR). Nowadays this has become a major topic and besides the development of the technology itself, some key points remain regarding the resources available and the way of evaluating the quality of the technology.&lt;/p&gt; &lt;p&gt;A large number of language resources is already available for the community, but the development of systems, in particular the statistical ones, always requires more and more data. As there is a growing interest for patents and their processing, a workshop on the topic which gathers all those involved in the different aspects concerned is a good opportunity to move forward.&lt;/p&gt; &lt;p&gt;The domain of patents itself is increasing and the amount of potential material does not cease to increase. It is this potential material that gives hope to the community for improving the systems. For instance, in China, the number of patents have been multiplied by 3 in 5 years and they exceed 1 million published documents per year by now. EPO (the European Patent Office) uses more than 150 translation pairs per day. Every patent office receives more and more patents every day, needs a daily use of automatic tools to translate the documents, looks for existing patents and their translation, manages complex content, etc. As we can see, this is a domain in considerable demand and since the content of the patents is technical and needs high skills in a specific domain, providing documents that are sufficiently understandable to the end users is very complex. This is a real challenge for all NLP developers.&lt;/p&gt; &lt;p&gt;Above all, this challenge is about corpora and their management. The main topic concerns their acquisition and how to collect useful data. For most of the researchers, this consists in harvesting web pages, cleaning them, getting the useful content according to a specific task, aligning the sentences, etc. The acquisition task may also be done using OCR tools on PDF. Monolingual corpora are easier to retrieve (e.g. from databases) compared to parallel corpora. However, parallel translations exist and aligned corpora as well, or corpora that could be easily aligned. Following the question of the acquisition of such documents, there is that of database management. One could say that all these questions are not only related to patent data, however this workshop would like focus on this particular domain and make some effort to improve things.&lt;/p&gt; &lt;p&gt;Currently, the corpora are mainly used for MT. For a technical end-user in a patent office, the end goal is to manage to understand the content of a document. This may not require a very high quality translation since this person only needs to grasp the relevance of the document. However, in MT, we still need to measure quantitatively the performance of the systems. This is basically made using automatic and/or human measures, while most of the system developers are using typical automatic metrics such as BLEU to get their results. Even if the drawbacks of such metrics are well-known, it could be still relevant, for instance, to compare different versions of a system. However, even when using BLEU, the content of patent documents is very particular, which implies that different kinds of linguistic specificity need to be tackled: these include the already expected terminological level, but also a syntactic level, a semantic one, and even the structure of the documents may be different from that of other documents (for instance, patents typically comprise of a title, an abstract, a technical description of the invention, and a list of novel claims). Human measures may be also difficult to apply as patent documents are written in a way which makes them difficult to read for the layman. Furthermore, both automatic and human evaluations should have the chance to realise a deep analysis of the results, which is not trivial working with patents. However, given the often formulaic nature of the text found in patents &#8211; which is enforced on the author due to legal constraints &#8211; there may be opportunities to exploit this for evaluation. For instance, claims are constructed as a single sentence with an introductory phrase and a body linked by frequently occurring terms such as &#8220;in a certain embodiment&#8221;, &#8220;consisting essentially of&#8221;, and clauses and lists introduced using colons, e.g. &#8220;comprising: &#8230;&#8221;&lt;/p&gt; &lt;p&gt;The use of patents in CLIR suffers from the same kind of issues, either for the evaluation of systems or for the collection of corpora. Sentence alignment may also have specific issues related to the content of the documents, and many other types of tools may have their own thoughts using patents. Through all those technologies, one can see their usage implies several challenges, such as the integration of tools into patent information applications. The different tools should help end-users to search, examine or classify patent documents, most of the time from translations and not available in English. Web services should also be an extension of the tools and web services should be connected through workflows, helping end-users in their daily work.&lt;/p&gt; &lt;p&gt;Among all the topics previously mentioned, we would like to contribute to the improvement of the challenging patent field, by sharing the knowledge from the whole community.&lt;/p&gt; &lt;p&gt;The different topics addressed during the workshop will be (but are not limited to):&lt;/p&gt; &lt;p&gt;&lt;img src=&quot;http://workshops.elda.org/ltpd2012/sites/ltpd2012/local/cache-vignettes/L8xH11/puce-32883.gif&quot; width='8' height='11' class='puce' alt=&quot;-&quot; style='height:11px;width:8px;' /&gt; Corpora aspects: collecting data, cleaning, alignment, parallel corpora, etc.;
&lt;br /&gt;&lt;img src=&quot;http://workshops.elda.org/ltpd2012/sites/ltpd2012/local/cache-vignettes/L8xH11/puce-32883.gif&quot; width='8' height='11' class='puce' alt=&quot;-&quot; style='height:11px;width:8px;' /&gt; Evaluation of technologies: definition of metrics, patent specificity;
&lt;br /&gt;&lt;img src=&quot;http://workshops.elda.org/ltpd2012/sites/ltpd2012/local/cache-vignettes/L8xH11/puce-32883.gif&quot; width='8' height='11' class='puce' alt=&quot;-&quot; style='height:11px;width:8px;' /&gt; Integration of patent applications: web services, end-user applications;
&lt;br /&gt;&lt;img src=&quot;http://workshops.elda.org/ltpd2012/sites/ltpd2012/local/cache-vignettes/L8xH11/puce-32883.gif&quot; width='8' height='11' class='puce' alt=&quot;-&quot; style='height:11px;width:8px;' /&gt; IPR issues and licensing.&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Submissions</title>
		<link>http://workshops.elda.org/ltpd2012/Call-for-Papers,4</link>
		<guid isPermaLink="true">http://workshops.elda.org/ltpd2012/Call-for-Papers,4</guid>
		<dc:date>2009-12-17T15:49:21Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>WSPP</dc:creator>


		<dc:subject>ouvert_rubrique</dc:subject>

		<description>Full papers up to 8 pages should be formatted according to LREC 2010 guidelines and be submitted through the online submission form on START. The templates for paper are published on the LREC web site When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this new initiative, please refer (...)

-
&lt;a href="http://workshops.elda.org/ltpd2012/-Call-for-Papers-" rel="directory"&gt;Submissions&lt;/a&gt;

/ 
&lt;a href="http://workshops.elda.org/ltpd2012/+-ouvert_rubrique,5-+" rel="tag"&gt;ouvert_rubrique&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;Full papers up to 8 pages should be formatted according to LREC 2010 guidelines and be submitted through the &lt;a href='https://www.softconf.com/lrec2012/PATENT2012/' class='spip_out' rel='external'&gt;online submission form&lt;/a&gt; on START.&lt;/p&gt; &lt;p&gt;The templates for paper are published on the &lt;a href='http://www.lrec-conf.org/lrec2010/?Author-s-Kit-and-Templates' class='spip_out' rel='external'&gt;LREC web site&lt;/a&gt;&lt;/p&gt; &lt;p&gt;When submitting a paper through the START page, authors will be kindly asked to provide relevant information about the resources that have been used for the work described in their paper or that are the outcome of their research. For further information on this new initiative, please refer to &lt;a href='http://www.lrec-conf.org/lrec2012/?LRE-Map-2012' class='spip_out' rel='external'&gt;http://www.lrec-conf.org/lrec2012/?LRE-Map-2012&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Important Dates</title>
		<link>http://workshops.elda.org/ltpd2012/Important-Dates,3</link>
		<guid isPermaLink="true">http://workshops.elda.org/ltpd2012/Important-Dates,3</guid>
		<dc:date>2009-12-17T15:25:59Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>WSPP</dc:creator>


		<dc:subject>ouvert_rubrique</dc:subject>

		<description>Extended deadline for submission: Friday 2 March 2012 Notification of acceptance: Friday 23 March 2012 Final version due: Friday 30 March 2012 Workshop : 27 May 2012 (afternoon)

-
&lt;a href="http://workshops.elda.org/ltpd2012/-Important-Dates-" rel="directory"&gt;Important Dates&lt;/a&gt;

/ 
&lt;a href="http://workshops.elda.org/ltpd2012/+-ouvert_rubrique,5-+" rel="tag"&gt;ouvert_rubrique&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;&lt;span style=&quot;color:red;&quot;&gt;Extended deadline for submission: Friday 2 March 2012&lt;/span&gt;&lt;/p&gt; &lt;p&gt;Notification of acceptance: Friday 23 March 2012&lt;/p&gt; &lt;p&gt;Final version due: Friday 30 March 2012&lt;/p&gt; &lt;p&gt;Workshop : 27 May 2012 (afternoon)&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Committees</title>
		<link>http://workshops.elda.org/ltpd2012/Committees,2</link>
		<guid isPermaLink="true">http://workshops.elda.org/ltpd2012/Committees,2</guid>
		<dc:date>2009-12-17T15:24:07Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>WSPP</dc:creator>


		<dc:subject>ouvert_rubrique</dc:subject>

		<description>Organizing Committee Olivier Hamon (ELDA &#8211; Evaluations and Language resources Distribution Agency, France) John Tinsley (PLUTO - Patent Language Translations Online, Ireland) Heidi Depraetere (Crosslang, Belgium) &lt;p&gt; &lt;/p&gt; Programme Committee Victoria Arranz (ELDA &#8211; Evaluations and Language resources Distribution Agency, France) Karim Boudhamane (DGA, France) Alexandru Ceasusu (PLUTO - Patent Language Translations Online, Ireland) Khalid Choukri (ELDA, France) Terumasa Ehara (Yamanashi (...)

-
&lt;a href="http://workshops.elda.org/ltpd2012/-Committees-" rel="directory"&gt;Committees&lt;/a&gt;

/ 
&lt;a href="http://workshops.elda.org/ltpd2012/+-ouvert_rubrique,5-+" rel="tag"&gt;ouvert_rubrique&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;div class=&quot;cs_sommaire cs_sommaire_avec_fond&quot; id=&quot;outil_sommaire&quot;&gt; &lt;div class=&quot;cs_sommaire_inner&quot;&gt; &lt;div class=&quot;cs_sommaire_titre_avec_fond&quot;&gt; Table of contents &lt;/div&gt; &lt;ul&gt; &lt;li&gt;&lt;a title=&quot;Organizing Committee&quot; href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend#outil_sommaire_0&quot;&gt;Organizing Committee&lt;/a&gt;, p1&lt;/li&gt;&lt;li&gt;&lt;a title=&quot;Programme Committee&quot; href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend&amp;artpage=2-2#outil_sommaire_1&quot;&gt;Programme Committee&lt;/a&gt;, p2&lt;/li&gt; &lt;/ul&gt; &lt;/div&gt; &lt;/div&gt;&lt;div id='decoupe_haut' class='pagination decoupe_haut'&gt;
&lt;img class=&quot;no_image_filtrer&quot; alt=&quot;Previous page&quot; title=&quot;Previous page&quot; src=&quot;http://workshops.elda.org/ltpd2012/plugins/auto/couteau_suisse/img/decoupe/precedent_off.gif&quot;/&gt; &lt;span class=&quot;cs_pagination_off&quot;&gt;1&lt;/span&gt; &lt;a title=&quot;Page 2: Programme Committee&quot; href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend&amp;artpage=2-2&quot;&gt;2&lt;/a&gt; &lt;a href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend&amp;artpage=2-2&quot;&gt;&lt;img class=&quot;no_image_filtrer&quot; alt=&quot;Next page&quot; title=&quot;Next page&quot; src=&quot;http://workshops.elda.org/ltpd2012/plugins/auto/couteau_suisse/img/decoupe/suivant.gif&quot;/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;h3 class=&quot;spip&quot; id=&quot;outil_sommaire_0&quot;&gt;&lt;a title=&quot;Table of contents&quot; href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend#outil_sommaire&quot; class=&quot;sommaire_ancre&quot;&gt; &lt;/a&gt;Organizing Committee&lt;/h3&gt;
&lt;ul class=&quot;spip&quot;&gt;&lt;li&gt; Olivier Hamon (ELDA &#8211; Evaluations and Language resources Distribution Agency, France)&lt;/li&gt;&lt;li&gt; John Tinsley (PLUTO - Patent Language Translations Online, Ireland)&lt;/li&gt;&lt;li&gt; Heidi Depraetere (Crosslang, Belgium)&lt;/li&gt;&lt;/ul&gt;&lt;div id='decoupe_bas' class='pagination decoupe_bas'&gt;
&lt;img class=&quot;no_image_filtrer&quot; alt=&quot;Previous page&quot; title=&quot;Previous page&quot; src=&quot;http://workshops.elda.org/ltpd2012/plugins/auto/couteau_suisse/img/decoupe/precedent_off.gif&quot;/&gt; &lt;span class=&quot;cs_pagination_off&quot;&gt;1&lt;/span&gt; &lt;a title=&quot;Page 2: Programme Committee&quot; href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend&amp;artpage=2-2&quot;&gt;2&lt;/a&gt; &lt;a href=&quot;http://workshops.elda.org/ltpd2012/spip.php?page=backend&amp;artpage=2-2&quot;&gt;&lt;img class=&quot;no_image_filtrer&quot; alt=&quot;Next page&quot; title=&quot;Next page&quot; src=&quot;http://workshops.elda.org/ltpd2012/plugins/auto/couteau_suisse/img/decoupe/suivant.gif&quot;/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;/div&gt;
		
		</content:encoded>


		

	</item>
<item xml:lang="en">
		<title>Contact</title>
		<link>http://workshops.elda.org/ltpd2012/Contact,1</link>
		<guid isPermaLink="true">http://workshops.elda.org/ltpd2012/Contact,1</guid>
		<dc:date>2009-12-17T14:33:35Z</dc:date>
		<dc:format>text/html</dc:format>
		<dc:language>en</dc:language>
		<dc:creator>WSPP</dc:creator>


		<dc:subject>ouvert_rubrique</dc:subject>

		<description>For further queries, please contact Olivier Hamon at hamon_at_elda_dot_org.

-
&lt;a href="http://workshops.elda.org/ltpd2012/-Contact-" rel="directory"&gt;Contact&lt;/a&gt;

/ 
&lt;a href="http://workshops.elda.org/ltpd2012/+-ouvert_rubrique,5-+" rel="tag"&gt;ouvert_rubrique&lt;/a&gt;

		</description>


 <content:encoded>&lt;div class='rss_texte'&gt;&lt;p&gt;For further queries, please contact Olivier Hamon at hamon_at_elda_dot_org.&lt;/p&gt;&lt;/div&gt;
		
		</content:encoded>


		

	</item>



</channel>

</rss>
