Computers and Technology

As some of you know well, and others of you may be interested to learn, a number of languages (including chinese and japanese) are written without spaces between the words. consequently, software that works with text written in these languages must address the word segmentation problem--inferring li_kely boundaries between consecutive words in the exercises text. if english were written without spaces, the analogous problem would consist of taking a string like "meetateight" and deciding that the best segmentation is "meet at eight" (and not "me et at eight," or "meet ate ight," or any of a huge number of even less plausible alternatives). how could we automate this process? a simple approach that is at least reasonably effective is to find a segmentation that simply maximizes the cumulative "quality" of its individual constituent words. thus, suppose you axe given a black box that, for any string of letters x = xk, will return a number quality(x). this number can be either positive or negative; larger numbers correspond to more plausible english words. (so quaiity("rne") would be positive, while quality("ght") would be negative.) given a long string of letters y = yly2 ¯" "yn, a segmentation of y is a partition of its letters into contiguous blocks of letters; each block corresponds to a word in the segmentation. the total quality of a segmentation is determined by adding up the qualities of each of its blocks. (so we’d get the right answer above provided that quaiity("rneet") + quality("at") + quality(" eight") was greater than the total quality of any other segmentation of the string.) give an efficient algorithm that takes a string y and computes a segmentation of maximum total quality. (you can treat a single call to the black box computing quality(x) as a single computational step.) (a final note, not necessary for solving the problem: to achieve better performance, word segmentation software in practice works with a more complex formulation of the problem--for example, incorporating the notion that solutions should not only be reasonable at the word level, but also form coherent phrases and sentences. if we consider the example "theyouthevent," there are at least three valid ways to segment this into common english words, but one constitutes a much more coherent phrase than the other two. if we think of this in the terminology of formal languages, this broader problem is like searching for a segmentation that also can be parsed well according to a grammar for the underlying language. but even with these additional criteria and constraints, dynamic programming approaches lie at the heart of a number of successful segmentation systems.)

answer
Answers: 2

Other questions on the subject: Computers and Technology

image
Computers and Technology, 22.06.2019 17:40, math31343
Gabe wants to move text from one document to another document. he should copy the text, paste the text, and open the new document highlight the text, select the cut command, move to the new document, make sure the cursor is in the correct location, and select the paste command select the save as command, navigate to the new document, and click save highlight the text, open the new document, and press ctrl and v
Answers: 1
image
Computers and Technology, 24.06.2019 13:50, jaystarr9395
Write a program that performs a simple n-body simulation, called "jumping leprechauns." this simulation involves n leprechauns, numberd 1 to n. it maintains a gold value g_i for each leprechaun i, which begins with each leprechaun starting out with a million dollars worth of gold, that is, g_i = 1000000 for each i = 1,. in addition, the simulation also maintains, for each leprachaun, i, a place on the horizon, which is represented as a double-precision floating point number, x_i. in each iteration of the simulation, the simulation processes the leprachauns in order. processing a leprachaun i during its iteration begins by computing a new place on the horizon for i, which is determined by the assignment:
Answers: 3
image
Computers and Technology, 24.06.2019 17:00, rosepetals2938
Carlos, an algebra teacher, is creating a series of powerpoint presentations to use during class lectures. after writing, formatting, and stylizing the first presentation, he would like to begin writing the next presentation. he plans to insert all-new content, but he wants to have the same formatting and style as in the first one. what would be the most efficient way for carlos to begin creating the new presentation? going under the file tab and opening the first presentation, deleting all content from each page, and adding new content going under the file tab and clicking on new in the left pane, then choosing new from existing going under the design tab and clicking on themes, then selecting the theme that was used for the first template going under the design tab and opening the template that was created for the first presentation
Answers: 2
image
Computers and Technology, 25.06.2019 04:30, almaromeroo13
Consider a list of full names formatted “firstname lastname”, like ["jules verne", "alexandre dumas", "maurice druon"]. write a list comprehension that produces a list with the full names in the format “lastname, firstname”. the resulting list should look like ['verne, jules', 'dumas, alexandre', 'druon, maurice']. the simplest solution may involve a nested comprehension: [ …. for … in [ … for … in … ]].
Answers: 3
Do you know the correct answer?
As some of you know well, and others of you may be interested to learn, a number of languages (inclu...

Questions in other subjects:

Konu
Mathematics, 12.10.2020 19:01