Ôèëîëîãè÷åñêèå íàóêè/ 7. ßçûê, ðå÷ü, ðå÷åâàÿ êîììóíèêàöèÿ
Cand. Sci. (Philology) G.F. Dyachenko, Z.I.Kashirina, S.L.Mykhailiuk
Odesa National Polytechnic University
DESCRIPTION
OF WORD CLASSES LIST OF THE TEXT
CORPUS "ACOUSTICS AND ULTRASONIC TECHNOLOGY (À&UÒ)"
The current paper gives
description of the research performed in the tideway
of one of the most productive trends of
linguistics - corpus linguistics and its theoretical aspect, in
particular.
At present, many scientists consider
that the exceptional theme of corpus
linguistics is to compile text corpus and store them.
However, modern corpus linguistics is not
limited only to creating
corpus - it
supposes fundamental scientific
researches of languages on the basis
of corpus - towards corpus research of grammar and vocabulary [5].
Any grammar analysis usually begins with
research object definition.
One or another part of speech (or class of words) fixed in the text corpus of different functional styles becomes an object.
Therefore, the paper
is devoted to dividing all vocabulary of text corpus into groups - parts of speech or classes of
words. The classes of words were analyzed
not only from the grammatical point of view but also in terms of their lexical features.
The text corpus of one of the sublanguages of scientific communication "Acoustics and ultrasonic technology"
(À&UT) served as a starting point for the
material. It is
based on the articles taken from the scientific magazines of the USA and the UK : IEEE International Conference on Acoustics,
Speech, and Signal Processing; The Journal of the Acoustical Society of
America; Acoustics Letters; Journal of the Audio Engineering Society; Acustica.
Distribution of lexical units into
classes of words (parts of speech) gives the real
picture of correlation of classes of words in
the sublanguage investigated.
Taking into account the
experience of researchers engaging in classification words by parts of speech, words distribution [7],
frequency (F) of word occurrence in certain positions, and also
orienting on texts of this subject
domain, lexical units of À&UÒ text
corpus fell into the
classes of words (see the table).
Distribution of lexical units in the text corpus "Acoustic
and Ultrasonic Technology"
into the classes of words
|
¹¹ |
Classes of words |
Number of word usages |
Number of words |
Percentage of texts, % |
|
1. |
Nouns |
46 561 |
876 |
23,28 |
|
2. |
Verbs |
34 685 |
465 |
17,34 |
|
3. |
Prepositions |
23 555 |
30 |
11,78 |
|
4. |
Articles |
23 110 |
2 |
11,56 |
|
5. |
Adjectives |
19 534 |
413 |
9,77 |
|
6. |
Adverbs |
11 438 |
171 |
5,72 |
|
7. |
Conjunctions |
9 208 |
17 |
4,6 |
|
8. |
Pronouns |
7 009 |
20 |
3,5 |
|
9. |
Particles |
2 352 |
2 |
1,8 |
|
10. |
Numerals |
549 |
4 |
0,27 |
Total 178 001 2000 89
From the table it follows that 10 parts of speech are functioning in
the "Acoustics and Ultrasonic
Technology" text corpus.
The greatest coverage of the designed texts is illustrated
by nouns, they constitute the
most considerable number of word forms used and
different words. While selecting the nouns
into their separate class, its
main feature was taken into account in the separate class of
words - to designate an object (substance) in wide sense of word including the
names of separate objects : array (F=1082), paper (F=559), system (F=410),
antenna (F=384), transducer (F=348), plate (F=342), etc.;
to denote
abstract concepts : frequency (F1102), signal (F=899), response (F=489), noise
(F=428, function (F=421), source (F341) etc; and also the names of different substances: aluminum (F=35), lead (F=31), air (F=26),
carbon (F=20), epoxide (F=13, rubber (F=12) etc.
Verbs which cover 17,34% of all
word forms of the designed texts occupy the second
place in the number
of word forms (34 685 units) and in the number
of different words (465 units). The common feature of all verbs is process
designation (actions or states).
In A&UT sublanguage, 30 prepositions
were registered: of (F=8088), in (F=2934), for
(F=2275), to (F=1786), by (F=1652), at (F=1207), with (F=1195), from (F=1164),
on (F=946), over (F=924), between (F - 272), above (F=110)
etc. Due
to a number of word forms, the prepositions
fill the third place in the table (23 555 units) and cover the designated texts up to
11,78 %. “Prepositons”
distinguished by a feature
which expresses attitude toward a noun or substantive
pronoun. Mainly, these are prepositions of place and direction : in, to, at, from, on, between, within, out, under etc.; prepositions relating to
grammatical connections in the sentences of the text
: of, by, with, for.
The data from the table
prove that the “articles”
occupy the considerably high portion of the designed text in A&UT
specialty - 11,
56 %, although according to the number of different words, there are two units only.
The sizes of absolute frequencies show the following: the (F=16 883), a/an (F=6 227). The “articles” were grouped
into a separate class as special
words-determinants used before nouns and substantive
word-combinations.
The next class of words under investigation
appeared class of the adjectives occupying the
fifth place according to the frequency of their
use. In the A&UT text
corpus, there were 413 units with a portion of texts coverage
of this domain area - 9, 77%. We will give some examples of semantic groups in which units of this part of speech are incorporated
: size - large (F=233), high (F=203), low (F=195), mean (F - 127), average
(F=67) etc; specifies the
material the objects are made from
: polynomial
(F46), synthetic (F=39), ceramic (F=32), piezoceramic (F=12); a location in
space: near (F=69), upper (F=62), far
(F=40), inside (F=40), internal (F=31); properties of objects and phenomena
: acoustic (F=420), plane (F=170),
experimental (F - 161), complex (F=106), constant (F=98), general (F=84)
etc. .
In the designed
(sampled) texts, adjectives by the structure can be: simple units, for example, large, high, low, great, single, bright,
etc.; derivatives, for example, subjective, electronic, hydrostatic,
representative, temporal etc.; compounds, for example, one - dimensional, right - hand, time - dependent, light - weight, two
– component, etc. .
In adverbs
sampling, the feature of action or circumstances at which an action takes place
was taken into account The adverbs
class is presented by 171 units which
cover 5, 72% of all A&UT investigated texts. Adverbs are another class of
words which were grouped by lexical values, as they depend upon the verbs that
are combined with them. Thus, the following groups of adverbs were selected: of
time, for example, then (F=279), further
(F=109), now (F=74), before (F=44) etc.;
of directions, for example, outside
(F=530, directly (F=50); of places, for example, there (F=173), here (F=87), closely (F=45), elsewhere (F=19) etc.; of character of action, for example, respectively (F=90), approximately (F=61),
readily (F=48), relatively (F=42), accordingly (F=32) etc.; of frequency,
for example, often (F=29), never (F=18), sometimes (F=18) etc. of
degrees, for example, significantly
(F=38), considerably (F=23), entirely
(F=23), greatly (F=22) etc.
Conjunctions were grouped in accordance with that function which they perform in a sentence.
Their number makes up only 17 units, for example, and (F=4695), as (F=1264), or (F=730), that (F=569), than (F=488), if
(F=299) etc. It is known that conjunctions
do not perform the independent function in sentences but serve to express relations between words, combinations of words and sentences. Therefore,
according to the character of the relations expressed, they are divided into
two heterogeneous types - coordinating conjunctions and subordinate ones.
Out of 17 conjunctions
functioning in the A&UT text
corpus, three are
coordinating conjunctions: and, or, while
that serve to connect
homogeneous grammatically equal parts having one
and the same function. Other fourteen conjunctions
connect subordinate clauses and conjuntional
turns : as, that than, if, since, very,
however, because, then, although, what, whether, though, whenever. Conjunctions cover up
to 4,6% of the
designed text corpus.
Pronouns in the investigated texts are the
original class of words including units with abstract meanings,
for example, this (F1482), that (F -
1245), which (F=985), it (F=930), each (F=376), we (F=267) etc. .
In our frequency list, the
pronouns are presented by 20 units whose portion
in A&UT texts makes up 3,5%. Pronouns were selected
by their functions:
personal - each, we, they; possessive - its; indicatory - this, that,
same, those; attributive - each, all,
both, either, another, every; indefinite - any, some, somewhat; relative - which, whose.
Two particles were fixed in sampling
– “to”
(F=1642), indicative of an
infinitive, and negative “not” (F - 710). They cover up
to 1.18% of
the designed
text.
The lowest
coverage of the designed text
corpus is given by numerals in the word writing – it constitutes 0,27 % only. They
are presented by four words. According to their
features, all numerals refer
to the quantitative numerals, for example, two
(F=387), three (F=96), four (F=52), hundred (F=14).
Thus, the paper
is not only a description of classes of words you come across in the
A&UT text corpus
but also their percent distribution , i.e. it enables to trace the portion every
class has in the analyzed text corpus.
Data described in the current research paper is of undoubted linguistic
interest. They were used in many
researches devoted to the comparative
analysis of different parts of speech functioning in many text corpus of scientific communication [1-4; 8-11;].
Quantitative information about parts of speech also allowed to offer new methods in teaching English in the non-linguistic (technical) high schools [6].
Reference
1. Borisenko Ò. I.,
Lebedeva E.V., Vorobjova E.V. Determination of first constituent
of modal verbal constructions functioning in texts of scientific communication
/ Ò. I.
Borisenko, Å.V.Lebedeva,
Å.V.Vorobjova
// Międzynarodowej naukowi-praktycznej konferencji «Dynamika naukowych badań - 2015» Volume 3. Filologiczne nauki.
Psychologia i socjologia. Historia. Muzyka i życie.: Przemyśl. Nauka
i studia - 88 str , P.39-43.
2. Nevreva Ì. N., Dyachenko G.F., Shapa L.N.
Genesis of nominal prefix morphemes in
English texts of scientific functional style /
Ì. N. Nevreva, G.F.Dyachenko, L.N.Shapa // Journal of Kharkiv
National University named after V. N. Karazin. "Romano-germanic philology series. Methods of foreign language teaching". – Kharkiv: KNU. – 2014. – ¹ 1103
. – P. 155-159.
3. Nevreva M. N. Genesis of nominal suffix morphemes in scientific
communication texts (on the material of the English sublanguages of Electrical
Engineering, Chemical and Process Engineering, and Motor Industry) / M. N. Nevreva, L. E. Tsapenko, M. V. Tsinova
// Odessa
Linguistic Bulletin. – Odessa: NUOLA,
2014. – ¹
4. – P. 332-335.
4.
Nevreva Ì. N., Borisenko
Ò. I., Kapinus E.L. Analysis of suffix morphemes in the low frequency
area of probabilistic statistic samples
of sublanguages of scientific
communication / Ì. N. Nevreva,
Ò. I.Borisenko., E. L. Kapinus
// XI international scientific conference
of "The Advanced Achievements in Europian Science
–
2015". Volume 9. – Philological sciences
– Republic of Bulgaria, Sofia. – 2015. – P.
59-63. (reg. ¹197820).
5. RAS Bureau Program on
the Fundamental Researches
"Corpus Linguistics" [Electronic Resource]. –
M.: RAS, 2012.
=6. Shapa L.N., Dantsevich L.G., Dyachenko G.F.
Introducting the results of theoretical research to the process of training
the translation of English texts of scientific communication in the
technical high schools / L.
N. Shapa, L.G. Dantsevich, G.F. Dyachenko //
Actual scientific achievements, Czech Republic, reg. ¹ 196339.
7. Stepanova Ì.
D. Transition
of parts of speech and distributive analysis (on material of modern German)
/ Ì. D.
Stepanova // Foreign languages at higher school. – Ì.: Higher school, 1964. – ¹ 3. – P. 89-95.
8. Ì. V. Tsinovaya Typology
of models with the verb of must at structural
simantic level (based on English material of technology sublanguages)
/ M. V. Tsinovaya // Scientific
Bulletin of International Humanitarian University. "Philology" series.
– Odesa: IHU.
– ¹ 14. – 2015. – P. 269-272.
9. Tsinovaya M.V. Forms and contents of syntactic constructions with the verb of may/might in texts of scientific communication / M.
V. Tsinovaya
// Scientific Bulletin of International
Humanitarian University. "Philology" series. –
Odesa: IHU. – ¹ 14. – 2015. – P.2. – P. 92-96.
10. Tsinovaya
Ì. V. Influence
of extralinguistic
factor on formation of modal
verbal construction pattern "modal verb + to
be+ adjective" (on material
of texts of scientific and technical discourse)
/ M. V. Tsinovaya // Odesa Linguistic magazine. – Odesa: ÎNLA. – 2015. – ¹5. – P. 269-272.
11. Tsapenko L.E., Dyachenko G.F., Shapa L.N.Comparative analysis of the verbal word forms in the texts of scientific communication / L. E.Tsapenko, G.F. Dyachenko, L.N.Shapa // Nauka i studia (Polska).
.