Tài liệu miễn phí Báo cáo khoa học

Download Tài liệu học tập miễn phí Báo cáo khoa học

Báo cáo khoa học: A logical treatment of semi-free word order and bounded discontinuous

This is simply a restatement of the notion of clause union or scrambling familiar from transformational analyses. Our proposal is to provide a feature-value logic with a rigorous semantics with sufficient e x p r e s s i v e power to allow the encoding of even syntactic structure within the single formalism. This means that the work of encoding syntactic structure is carried by the feature-value logic and not by formal language theoretic devices (i.e., phrase structure rules).

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: FOCUS AND ACCENT IN A DUTCH TEXT TO-SPEECH SYSTEM

In this paper we discuss an algorithm for the assignment of pitch accent positions in text-to-speech conversion. The algorithm is closely modeled on current linoulstic accounts of accent placement, and assumes a surface syntactic analysis of the input. It comprises a small number of heuristic rules for determining which phrases of a sentence are to be focussed upon; the exact location of a pitch accent within a focussed phrase is determined m~inly on the basis of the syntactic relations holding b e t w e e n the e l e m e n t s of the phrase....

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: User studies and the design of Natural Language Systems

This paper presents a critical discussion of the various approaches that have been used in the evaluation of Natural Language systems. We conclude that previous approaches have neglected to evaluate systems in the context of their use, e.g. solving a task requiring data retrieval. This raises questions about the validity of such approaches. In the second half of the paper, we report a laboratory study using the Wizard of Oz technique to identify NL requirements for carrying out this task. We evaluate the demands that task dialogues collected using this technique, place upon a prototype Natural Language system. We...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: TENSE GENERATION IN AN INTELLIGENT TUTOR FOR FOREIGN LANGUAGE TEACHING: SOME ISSUES IN THE DESIGN OF THE VERB EXPERT

The paper presents some of the results obtained within a research project aimed at developing ET (English Tutor), an intelligent tutoring system which supports Italian students in learning the English verbs. We concentrate on one of the most important modules of the system, the domain (i.e. verb) expert which is devoted to generate, in a cognitively transparent way, the right tense for the verb(s) appearing in the exercises presented to the student. An example which highlights the main capabilities of the verb expert is provided. A prototype version of ET has been fully implemented. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: Collocations in Multilingual Generation

We present a proposal for the structuring of collocation knowledge 1 in the lexicon of a multilingual generation system and show to what extent it can be used in the process of lexical selection. This proposal is part of Polygloss, a new research project on multilingual generation, and it has been inspired by work carried out in the S EMSYN project (see e.g. [I~(~SNEtt 198812). The descriptive approach presented in this proposal is based on a combination of results from recent lexicographical research and the application of Meaning-Text-Theory (MTT) (see e.g. [MEL'CUK et al. 1981], [MEL'CUK et al. 1984])....

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: LEXICAL ACQUISITION IN THE CORE LANGUAGE ENGINE

The SRI Core Language Engine (CLE) is a general-purpose natural language front end for interactive systems. It translates English expressions into representations of their literal meanings. This paper presents the lexical acquisition component of the CLE, which allows the creation of lexicon entries by users with knowledge of the application domain but not of linguistics or of the detailed workings of the system. It is argued that the need to cater for a wide range of types of back end leads naturally to an approach based on eliciting grammaticality judgments from the user. This approach, which has been used...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: It Would Be Much Easier

The paper proposes a paradigmatic approach to morphological knowledge acquisition. It addresses the problem of learning from examples rules for word-forms analysis and synthesis. These rules, established by generalizing the training data sets, are effectively used by a built-in int e r p r e t e r which acts c o n s e q u e n t l y as a morphological processor within the architecture of a natural language question-answering system. The PARADIGM system has no a priori knowledge which should restrict it to a particular natural language, but instead builds up the morphological rules...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: PLAN REVISION IN PERSON-MACHINE DIALOGUE

Dialogue is a cooperative process in which each speech act of the participants contributes to the overall purpose of the dialogne. Participating in a full dialogue implies understanding at each point of the dialogue session the role of each speech problem. act with respect to the rest of the dialogne. We concentrate in this paper on 02,U: I'd like to buy a car in two years. speech acts that diverge from the straightNormally an investment plan forward unfolding of the dialogue. Such 08,S: Ok. comprise8 four 8ubparte: an emergency speech acts represent dialogue deviations. fund, for unforeseen ezpenses; a...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: REMARKS ON PLURAL ANAPHORA

The interpretation of plural anaphora often requires the construction of complex reference objects (RefOs) out of RefOs which were formerly introduced not by plural terms but by a number of singular terms only. Often, several complex RefOs can be constructed, but only one of them is the preferred referent for the plural anaphor in question. As a means of explanation for preferred and non-preferred interpretations of plural anaphora, the concept of a Common Association Basis (CAB) for the potential atomic parts of a complex object is introduced in the following. CABs pose conceptual constraints on the formation of complex...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: ENHANCING EXPLANATION COHERENCE WITH RHETORICAL STRATEGIES MARKT. MAYBURY

This paper discusses the application of a previously reported theory of explanation rhetoric (Maybury, 1988b) to the task of explaining constraint violations in a hybrid rule/frame based system for resource allocation (Dawson et al, 1987). This research illustrates how discourse strategies of explanation, textual connectives, and additional justification knowledge can be applied to enhance the cohesiveness, structure, and clarity of knowledge based system explanations. INTRODUCTION Recent work in text generation includes emphasis on producing textual presentations of the explanations of reasoning in knowledge-based systems. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: Expressing generalizations in unification-based grammar formalisms

This paper shows how higher levels of generalization can be introduced into unification grammars by exploiting methods for typing grammatical objects. We discuss the strategy of using global declarations to limit possible linguistic structures, and sketch a few unusual aspects of our typechecking algorithm. We also describe the sort system we use in our semantic representation language and illustrate the expressive power gained by being able to state global constraints over these sorts. Finally, we briefly illustrate the sort system by applying it to some agreement phenomena and to problems of adjunct resolution. recurs in different lexical items, combination...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: A rich environment for experimentation unification grammars

The d e v e l o p m e n t of ud arose out of the need to have a v a i l able a full set of p r o t o t y p i n g and development tools for a n u m b e r of different research projects in computational linguistics, all involving extensive text coverage inIn many respects relational abstractions resemble Prolog proc e d u r e s.

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: LAMBEK THEOREM PROVING AND FEATURE UNIFICATION

it follows as a theorem from a set of axioms and inference rules. Especially by the work of Van Benthem (1986) and Moortgat (1988) this view, which we will name with Moortgat (1987a) Lambek Theorem Proving (LTP; Lambek, 1958), has become popular among a number of linguists. The descriptive power of LTP can be extended if unification (Shieber, 1986) is added. Several theories have been developed that combine categorial formalisms and unification based formalisms. Within Unification Categorial Grammar (UCG, Calder et al., 1988, Zeevat et al., 1986) unification is the only operation over grammatical objects (Calder et al. 1988,...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: A FORMAL REPRESERTATION OF PROPOSITIONS

The topic of the paper is the introduction of a formalism that permits a homogeneous representation of definite temporal adverbials, temporal quantifications (as frequency and duration), temporal conJ~ctions and tenses, and of their combinations with propositions. This unified representation renders it possible to show how these components refer to each other and interact in c r e a t i ~ temporal meanings. The formal representation is 0ased on the notions phase-set and phase-operator, and it involves an interval logic. Furthermore logical coz~uections are used, but the (always troublesome) logical quantifications may be avoided. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: COMPUTATIONAL SEMANTICS

Although the formalisms normally used for describing the semantics of natural languages are far from computationally tractable, it is possible to isolate particular semantic phenomena and interpret them within simpler formal systems. Quantified mass noun phrases is one such part. We describe a simple formal system suitable for the interpretation of quantified mass noun phrases. The main issue of this paper is to develop an algorithm for deciding the validity of sentences in the formal system and hence for deciding the validity of natural language inferences where all the involved noun phrases are quantified mass noun phrases. The...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: EXTENDED GRAPH UNIFICATION

W e propose an apparently minor extension to Kay's (1985} notation for describing directed acyclic graphs (DAGs}. The proposed notation permits concise descriptions of phenomena which would otherwise be difficult to describe, without incurring significant extra computational overheads in the process of unification. W e illustrate the notation with examples from a categorial description of a fragment of English, and discuss the computational properties of unification of DAGs specified in this way. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: A MODULAR APPROACH TO STORY GENERATION

One way of characterising texts is in terms of the discourse structures on which they appear to be built. Each type of text, or genre, e.g. the sports report, the recipe, the sermon, the proverb, will have associated with it a characteristic organisation of units. In this paper, a general model of the structure of one text type, the story, is described. This model forms the basis of a program which combines the general story structure principles with rules goveming a particular sub-genre, the Old French epic, in order to generate story summaries. INTRODUCTION Over the last twelve years,...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: SUBJECTERASING AND PRONOMINALIZATION IN ATALIAN TEXT GENERATION

Certain Romance languages such as Italian, Spanish and Portuguese allow the subject to be erased in tensed clauses. This paper studies subject erasing in the framework of a text generation system for Italian. We will prove that it is first necessary to try to pronominalize the subject. Therefore, we are led to study the synthesis of subject and complement personal pronouns. In Romance languages, personal pronouns raise many syntactic problems, whose solution is complex in a generation system. We will see that pronominalization plays a fundamental role in the order in which the elements of a clause are synthesized,...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: AN ALGORITHM FOR GENERATION IN UNIFICATION CATEGORIAL GRAMMAR

We present an algorithm for the generation of sentences from the semantic representations of Unification Categorial Grammar. We discuss a variant of Shieber's semantic monotonicity requirement and its utility in our algorithm. We indicate how the algorithm may be extended to other grammars obeying the same requirement. Appendices contain a full listing of the program and a trace of execution of the algorithm.

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: Interactive Incremental Chart Parsing

This paper presents an algorithm for incremental chart parsing, outlines how this could be embedded in an interactive parsing system, and discusses w h y this might be useful. Incremental parsing here means that input i8 analysed in a piecemeal fashion, in particular allowing arbitrary changes of previous input without exhaustive reanalysis. Interactive parsing means that the analysis process is prompted immediately at the onset of new input, and possibly that the system then m a y interact with the user in order to resolve problems that occur. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: FRENCH ORDER WITHOUT ORDER

To account for the semi-free word order of French, Unification Categorial Grammar is extended in two ways. First, verbal valencies are contained in a set rather than in a list. Second, type-raised NP's are described as two-sided functors. The new framework does not overgenerate i.e., it accepts all and only the sentences which are grammatical. This follows partly from the elimination of false lexical ambiguities - i.e., ambiguities introduced in order to account for all the possible positions a word can be in within a sentence and partly from a system of features constraining the possible combinations. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: THE SYNTACTIC REGULARITY OF ENGLISH NOUN PHRASES

Approximately, 10,000 naturally occurring noun phrases taken from the LOB corpus were used firstly, to evaluate the NP component of the Alvey ANLT grammar (Grover et al., 1987, 1989) and secondly, to retest Sampson's (1987a) claim that this data provide evidence for the lack of a clear-cut distinction between grammatical and 'deviant' examples. The examples were sorted and classified on the basis of the lexical and syntactic analysis undertaken as part of the LOB corpus project (Sampson, 1987b). ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: A Descriptive Framework for Translating Speaker's Meaning Towards a Dialogue Translation System between Japanese and English

Classification of utterances concerning speaker's meaning. DPs present background information of language use in order to derive an appropriate expression from speaker's meaning. In Japanese, IFTs can be derived a u t o m a t i c a l l y t h r o u g h syntactical constraints. To generate appropriate expressions, language-specific communication strategies related to DP values should be given a priori. The whole process is performed in a unification-based framework.

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: TRANSLATION BY STRUCTURAL CORRESPONDENCES

We sketch and illustrate an approach to machine translation that exploits the potential of simultaneous correspondences between separate levels of linguistic representation, as formalized in the LFG notion of codescriptions. The approach is illustrated with examples from English, German and French where the source and the target language sentence show noteworthy differences in linguistic analysis. INTRODUCTION In this paper we sketch an approach to machine translation that offers several advantages compared to many of the other strategies currently being pursued. ...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: A NEW VIEW ON THE PROCESS OF TRANSLATION

In this paper we describe a framework for research into translation that draws on a combination of two existing and independently constructed technologies: an analysis component developed for German by the EUROTRA-D (ET-D) group of IAI and the generation component developed for English by the Penman group at ISI. We present some of the linguistic implications of the research and the promise it bears for furthering understanding of the translation process.

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: EMPIRICAL STUDIES OF DISCOURSE REPRESENTATIONS FOR NATURAL LANGUAGE INTERFACES

We present the results from a series of experiments aimed at uncovering the discourse structure of man-machine communication in natural language (Wizard of Oz experiments). The results suggest the existence of different classes of dialogue situations, requiring computational discourse representations of various complexity. Important factors seem to be the number of different permissible tasks in the system and to what extent the system takes initiative in the dialogue.

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: AN APPROACH TO SENTENCE-LEVEL TRANSLATION ANAPHORA IN MACHINE

The CAT framework (e.g. Arnold et a]. 1986) was meant to be an anTheoretical research in the area of machine translation usu- swer to the same question, this time for a transfer system, ally involves the search for and creation of an appropriate viz. the Eurotra system. The MiMo formalism is a reformalism. An important issue in this respect is the way in action to the CAT framework and tries to solve several which the compositionality of translation is to be defined. translation problems by formulating an alternative definiIn this paper, we will introduce the anaphoric component tion of...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: SITUATION SEMANTICS AND MACHINE TRANSLATION.

Situation Semantics is one of the most recent and controversial theories in formal semantics. Machine Translation (MT) is a highly complex application domain, in which research is expensive of both time and resources. On the surface, the space for interaction between these two fields would seem fairly limited, and in practice the the application of formal semantics in MT has been very limited, a notable exception being the Rosetta project (Landsbergen 1982, 1987).

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: ON FORMALISMS AND ANALYSIS, GENERATION AND SYNTHESIS IN MACHINE TRANSLATION

A formalism is a set of notation with well-defined semantics (namely for the interpretation of the symbols used and their manipulation), by means of which one formally expresses certain domain knowledge, which is to be utilised for specific purposes. In this paper, we are interested in formalisms which are being used or have applications in the domain of machine translation (MT). These can range from specialised languages for linguistic programming (SLLPs) in NIT, like ROBRA in the ARIANE system and GRADE in the Mu-system, to linguistic formalisms like those of the Government and Binding theory and the Lexical Functional...

8/30/2018 3:08:09 AM +00:00

Báo cáo khoa học: ABSTRACT CONTROL STRUCTURES AND THE SEMANTICS OF QUANTIFIERS

Intuitively, a Ruantifier is any word or phrase that expresses a meaning that answers one of the questions How many? or How much? Typical English examples include all, no, many, few, some but not many, all but at most a ver~ few, wherever, whoever, whoever there is, and also, it can be argued, 0nly (Keenan, 1971), also (Cushing, 1978b), and the (Chomsky, 1977). In this paper we review an empirically motivated analysis of such meanings (Cushing, 1976; 1982a) and draw out its computational significance. For purposes of illustration, we focus our attention on the meanings expressed by the English...

8/30/2018 3:08:09 AM +00:00