The Null Device

Posts matching tags 'cs'

2010/5/21

Three Israeli computer scientists have developed an algorithm for detecting sarcasm in online comments. Named SASI (Semi-Supervised Algorithm for Sarcasm Identification), the algorithm looks at linguistic features of the sentences to guess whether or not they're sarcastic. It was developed with Amazon product reviews as training data. Potential applications of such algorithms could include systems that gauge the positivity or negativity of public opinion about a subject from online discussions.

ai cs natural language processing sarcasm [no comments] Share

2010/2/23

Jazari is essentially an automated, electromechanical percussion ensemble, controlled using two Nintendo Wii controllers. It consists of a MacBook, a bunch of Arduino boards and a room full of drums fitted with solenoids and motors, and software written in MAX and Java which parses input from the Wii controls and plays the drums. The software is also capable of improvising with the human operator, by imitating, riffing off and mutating what he plays.

Jazari was developed by a guy named Patrick Flanagan, who had been playing around with algorithmic composition, only to discover that people don't want to hear about algorithms, but do want to see a good live show. Anyway, here there are two videos: one of a Jazari performance (think robot samba float, conducted by a guy waving Wiimotes around; the music has a distinctly Afro-Brazilian feel to it), and one of Flanagan explaining how it works.

(via Lef) ai awesome computer music cs music robots [no comments] Share

2010/2/18

Artificial Flight and Other Myths (a reasoned examination of A.F. by top birds), a satire of arguments against the possibility of strong artificial intelligence:

Current A.F. is limited to unpowered gliding; a technical marvel, but nowhere near the sophistication of a bird. Gliding simplifies our lives, and no bird (including myself) would discourage advancing this field, but it is a far cry from synthesizing the millions of cells within the wing alone to achieve Strong A.F. Strong A.F., as it is defined by researchers, is any artificial flier that is capable of passing the Tern Test (developed by A.F. pioneer Alan Tern), which involves convincing an average bird that the artificial flier is in fact a flying bird.
There are religious birds who believe God made Bird in His own image, and while I do not share in most of these beliefs, I do think there’s something to be said about the motivation behind creating Strong A.F. Perhaps, as we are the only creatures on Earth capable of flight, we want to push forward past our current capabilities, perhaps even augmenting our own flying capacities if independent A.F. is an impossibility. This could be interpreted as noble, but I would argue that there’s very little utility in replicating what nature has essentially perfected. Why spend millions on an artificial flier when there are so many birds out of work?

(via Boing Boing) ai cs satire [3 comments] Share

2009/6/19

The editor-in-chief of a commercial academic journal has resigned after the journal accepted for publication a nonsensical, computer-generated article:

Bentham confirmed receipt of my submission the very next day (January 30, 2009). Nearly four months later, I received a response — the article was accepted. The acceptance letter read:

"This is to inform you that your submitted article has been accepted for publication after peer-reviewing process in TOISCIJ. I would be highly grateful to you if you please fill and sign the attached fee form and covering letter and send them back via email as soon as possible to avoid further delay in publication."

The letter was written by a Ms. Sana Mokarram, the Assistant Manager of Publication. She included a fee schedule and confirmation that I would pay US$800, to be sent to a post office box in the SAIF Zone, a tax-free complex in the United Arab Emirates.

The journal, "The Open Information Science Journal", is published by a company named Bentham, out of an office in a tax-free zone in the United Arab Emirates, and charges authors to publish papers, whilst making the journals freely available. The ostensible difference between this and a vanity publisher is that TOISCIJ ostensibly subjects its submissions to a peer review process, thus ensuring that, for example, a charlatan couldn't burnish their credentials merely by writing a cheque. Unfortunately, it appears that the peer review process seems to resemble the papers sitting in a pile for a few months; consequently, those who have had papers published in the journal have probably wasted US$800 in doing so.

The paper in question ("Deconstructing Access Points", by "David Phillips" and "Andrew Kent" of the "Center for Research in Applied Phrenology"), incidentally, may be downloaded here. It contains howlers such as:

Our implementation of our methodology is pseudorandom, wearable, and collaborative. We have not yet implemented the centralized logging facility, as this is the least private component of our method.
Gaussian electromagnetic disturbances in our mobile telephones caused unstable experimental results. Note that vacuum tubes have less jagged effective floppy disk throughput curves than do autogenerated robots.

cs détournement pranks pseudoscience random text scams [1 comment] Share

2009/1/12

Germany's Potsdam University is offering its computer science master's degree students a new subject: a practical course in flirting skills, ostensibly designed to improve students' social skills and ability to operate in the real world:

The 440 students enrolled in the master's degree course will learn how to write flirtatious text messages and emails, impress people at parties and cope with rejection.
Philip von Senftleben, an author and radio presenter who will teach the course, summed up his job as teaching how to "get someone else's heart beating fast while yours stays calm."
(Ah, those Germans: they even make the sport of love sound like a duelling society...)

Of course, the idea of flirting courses for compulsively-systematising geeks is not a new one, though they have usually consisted of sneaky hacks for getting laid, typically boiling down to embedding subliminal messages in one's speech, surreptitiously pointing to one's crotch and going to bars wearing ridiculously flamboyant boots and a LED belt buckle to get attention. It's not clear whether the Potsdam course will follow in this vein.

(via /.) cs education geek germany sex society [no comments] Share

2009/1/8

The latest computer science research being funded by the US military includes a bot that will impersonate parents in the service and talk to their children when they're off fighting wars:

The challenge is to design an application that would would allow a child to receive comfort from being able to have simple, virtual conversations with a parent who is not aivailable "in-person". We are looking for innovative applications that explore and harness the power of “advanced” interactive multimedia computer technologies to produce compelling interactive dialogue between a Service member and their families via a pc- or web-based application using video footage or high-resolution 3-D rendering. The child should be able to have a simulated conversation with a parent about generic, everyday topics. For instance, a child may get a response from saying "I love you", or "I miss you", or "Good night mommy/daddy." This is a technologically challenging application because it relies on the ability to have convincing voice-recognition, artificial intelligence, and the ability to easily and inexpensively develop a customized application tailored to a specific parent.

(via Boing Boing) ai bizarre cs military usa wtf [1 comment] Share

2008/12/10

The latest experimental technology to emerge from Google's labs is something called Native Client. This is an experimental means of running web content in native machine code in a web browser. It's X86-specific (so users of PowerPC Macs and the numerous ARM-based portable devices are out of luck here), though other than that, completely portable; the binaries are in a special format, and get a limited number of system calls standardised across Linux, OSX and Windows. There is even a version of Quake which will run in a browser, in any of these systems, should you have the plugin enabled.

Of course, by now, you're probably thinking "Are they crazy? That's the worst idea since nuclear-powered airliners". Google, though, claim that they have a robust security model. The instruction set available is restricted, with constraints placed on the format of the code, allowing a code inspection process to detect any dangerous instructions. Google argue their case in a research paper; I'm not sufficiently familiar with recent x86 assembly language to verify their claims, but it looks like they certainly put some thought into it. Of course, there are a lot of very bright people in places like Russia, Romania and China who would also put a lot of thought into it, to entirely different ends, so there are reasons to be concerned.

Why are Google doing this? Well, firstly, it must be said that this is an experimental project, and not a finished product. However, I doubt that this is to allow better animated web ads. For most user interface content (video, animation, &c.), Flash and such are sufficient. Where this is a big win is in CPU-intensive processing tasks, which are too expensive to do in JavaScript (even if compiled to native code through Google Chrome's V8 just-in-time compiler) or Flash. (At the moment, with fast machines, one can just about do audio synthesis/processing like Hobnox Audiotool in ActionScript; however, this is quite expensive in terms of resources.)

Where something like Native Client would really come in useful would be for coding web-based applications that do real heavy lifting without handing tasks off to a well-resourced server or relying on them being coded into Flash; for example, image editing or video editing as a web service. Google's paper presents a diagram of how such services would look; the front end would be written in JavaScript and/or Flash, whilst the native x86 code would sit in a separate, sandboxed Native Client process, performing the gruntwork on demand: rendering graphics, processing video, synthesising sound, animating the exploding heads of zombies or whatever is required. C/C++, in this case, is kept firmly under the stairs, with the UI code being left to higher-level languages.

Of course, such an idea opens all sorts of strategic possibilities for Google; if it works, it would reduce the desktop operating system to a commodity. If any kind of application can be used as a web service, why buy a copy of Windows (or a PC with the Microsoft Tax in the price)? In fact, why bother installing a full-scale Linux? They're already starting to make PCs with cut-down instant-on operating systems (typically Linux-based) in the ROM, so that if you can't wait for your Vista box to finish booting, you can boot into the instant OS and get a web browser. Now, imagine a box like this, only with the OS being able to run web apps at native speed, perhaps in an application-oriented browser like Chrome. Could this be the much talked about "Google OS"?

cs google native client security tech web [1 comment] Share

2008/11/30

This is pretty cool: a researcher at Adobe has developed software for manipulating moving objects in video in real time. The software works by tracking points and assigning them to objects, and as the demo shows, allows, among other things, moving objects to be automatically "graffitied" with text or annotated with automatically moving speech/thought bubbles. It also maps the degrees of freedom of a point in an object, and by clicking and dragging on it, automatically moves the video to a frame which has that point going in the desired direction, allowing video to be "scrubbed" by dragging moving objects. He even has it compositing frames of video with different objects (people, in this case) at different positions.

(via Engadget) adobe cs nifty research video [no comments] Share

2008/10/10

Academic paper of the day: The Dining Freemasons, or a look at the mechanics and problems unique to secret societies from the perspective of (computer) security protocols:

To a first approximation, a secret society has three functions:
  • to recruit the worthy,
  • to pass on a secret doctrine,
  • and to reward its members.
Each area presents intriguing challenges, but crucial to each aspect is membership testing – society members must be able to identify each other in order to pass on the doctrine, to confer rewards and to consider new applicants.
The paper talks about steganographic broadcasts (i.e., transmitting your affiliation in a coded form; the drawing of a fish by early Christians is one famous example), plausible deniability, and suggests various protocols using the semantic meanings of bodies of knowledge known to the society, including coding challenges and responses (or even small amounts of information) in the truth value of statements about the shared text.

Also from the same authors: A Pact With The Devil, or a hypothetical outline of how a genuinely nasty form of malware could use various forms of persuasion and blackmail to spread itself.

(via hairyears) cs deception secret societies security [no comments] Share

2008/9/2

Google are apparently working on a new web browser. Named Chrome, the browser is designed more like an operating system which happens to be based on web technologies like DOM and JavaScript than a traditional browser; different pages are separate processes (and about time, too) and privileges are compartmentalised to fortify security. Meanwhile, the web rendering implementation is based on WebKit (the Apple/KDE open-source web engine), with a JIT-compiling JavaScript engine optimised for application from a Danish company named V8 (I wonder how it compares to Apple's Squirrelfish and Mozilla's new engine). Alas, there's no code available (and the URL given returns a 404); instead, Google have given us a beautifully drawn 38-page comic by Scott McCloud, illuminating the technical innovations and the reasons for them, in great detail and with no small amount of humour:

That's all the detail that seems to exist so far. There is a possibility that it's just an elaborate feint; Google could, in theory, have paid McCloud some huge sum to draw a comic to specification, peppered with technical versimilitude, purely in order to send Microsoft/Apple/Yahoo!/whoever's development teams on a wild goose chase. Though I suspect that there is an actual product there. For one, Google are known to use WebKit on Android. More importantly, though, a browser designed as a web application operating system (with the expectations of performance and stability that implies), rather than an information viewer with programmability grafted on as an afterthought (as is the case with current browsers), would line up rather nicely with Google's strategy to make the web into a first-class application platform.

There are no details on what platforms Chrome will run; it is open-source (and other projects, or those willing to fork those, will probably have a field day with this), and the comic does mention Windows in one place, so presumably a Windows version is planned. I'm guessing that Google aren't doing this to help Microsoft sell Windows licences, though, so presumably this is not the only version planned. A Linux desktop version, running on top of X, is probably likely. Another possibility is it running over something lighter than the average Linux desktop, making a robust web-browsing appliance on which the browser meets the conventional definitions of an operating system; either Android or some other lightweight OS.

The other option, of course, is that this is an elaborate hoax, akin to the Photoshopped "spy photos" of new Apple Mac tablets and other fantastic gear that are a regular feature of gadget blogs. The fact that Google's Chrome page doesn't yet exist (at time of writing) does suggest this possibility. Though this would imply that the hoaxers had an enormous amount of time on their hands, excellent comic drawing skills and an uncanny mastery of the drawing style of Scott McCloud.

Update: Google have confirmed Chrome. It's initially a Windows product (presumably to win market share before IE8 comes along and shuts off Google's oxygen with its advertising cookie blocker), though Mac and Linux versions are in the works. The Windows version will apparently be out tomorrow.

(via /.) chrome comics cs design google javascript scott mccloud tech vaporware web [3 comments] Share

2008/7/30

Researchers at Columbia University have developed a new system for protecting the privacy of people who appear in photographs. Rather than blurring or erasing faces, the new software replaces facial features with others from a library:

The software randomly selects 33,000 photos of faces from picture-sharing sites like Flickr.com, then picks the most suitable faces for each person in shot. Only the eyes, nose and mouth are used, resulting in a composite image of the two people. "It matches subject pose, lighting conditions and image resolution," says Kumar. "The selected faces are aligned to common 3D coordinates, corrected for colour and lighting, and blended into the target image."
Aside from Street View, the system could be used to obscure the faces of military personnel or eyewitnesses to crime. It could also allow amateur photographers to improve group shots, by replacing frowning faces with better photos of the same people.

(via Boing Boing) anonymity cs photography privacy research tech [no comments] Share

2008/3/14

This looks really impressive. It seems that Celemony (the makers of pitch/time-correcting sound editor Melodyne) have cracked one of the hard problems of digital audio processing: how to extract and modify individual notes in recorded chords. The video demonstrates this technique being used to transpose and re-edit recorded guitar chords as if they were MIDI scores, or even to play chords on a MIDI keyboard and have them played out using a sample. Which looks amazing, though, alas, it won't be with us until autumn (in the northern hemisphere).

audio celemony melodyne cs dsp music tech want [2 comments] Share

2007/11/5

Another use for all those neatly tagged photos of tourist attractions on websites like Flickr: synthesising accurate 3D models from millions of photos, taken by amateurs from random angles:

To make the 3D digital model, the researchers first download photos of a landmark. For instance, they might download the roughly 60,000 pictures on Flickr that are tagged with the words "Statue of Liberty." The computer finds photos that it will be able to use in the reconstruction and discards pictures that are of low quality or have obstructions. Photo Tourism, a tool developed at the UW, then calculates where each person was standing when he or she took the photo. By comparing two photos of the same object that were taken from slightly different perspectives, the software applies principles of computer vision to figure out the distance to each point.
"We don't quite get the accuracy of a laser scanner, but we're in the ballpark," Seitz said. The recreations of Notre Dame show individual figures carved into the stone facade. A model of The Duomo in Pisa, Italy, a building about 160 feet tall, is accurate to within a few inches. The resolution of the 3D model mostly depends on the resolution of the original photos.
The next step in the research will be to create a detailed 3D model of a city entirely from automatically sorted photos from the internet; Rome has been chosen as the city to thus recreate.

(via /.) cs flickr graphics machine vision tech [no comments] Share

2007/4/26

Science News has an article about recent advances in computer music processing. There has been success in creating software which understands recorded music, to the point of being able to extract note information from a (polyphonic, multitimbral, acoustically imperfect) recording. This has been achieved not by programming in rules of musical theory but by using machine learning techniques, setting up a learning system and training it from examples to infer its own rules of music:

He started with a program that had no information about how music works. He then fed into his computer 92 recordings of piano music and their scores. Each recording and score had been broken into 100-millisecond bits so that the computer program could associate the sounds with the written notes. Within those selections, the computer would receive an A note, for example, in the varying contexts in which it occurred in the music. The software could then search out the statistical similarities among all the provided examples of A.
In the process, the system indirectly figured out rules of music. For example, it found that an A is often played simultaneously with an E but seldom with an A-sharp, even though the researchers themselves never programmed in that information. Ellis says that his program can take advantage of that subtle pattern and many others, including some that people may not be aware of.
The software thus developed got impressively good results in music transcription tests (68% accuracy, with the runner-up, a traditional rule-based system, getting 47%). There are numerous applications of such a technology, from automated accompanyists to "musical spellcheckers" to ways of "decompiling" recordings to a score:
Score-alignment programs could be used after a musician records a piece of music to do the kind of fine-tuning that's now performed painstakingly by recording studios, fixing such problems as notes that are slightly off pitch or come in late. "It'll be kind of like a spell-check for music," says Roger Dannenberg, a computer scientist at Carnegie Mellon University in Pittsburgh who is developing the technology.
Christopher Raphael begins the third movement of a Mozart oboe quartet. As his oboe sounds its second note, his three fellow musicians come in right on cue. Later, he slows down and embellishes with a trill, and the other players stay right with him. His accompanists don't complain or tire when he practices a passage over and over. And when he's done, he switches them off.
Not everybody's happy with this, though; musicians' unions, which have opposed "virtual orchestras", are about as keen on it as buggy-whip manufacturers were on the automobile.

(via Boing Boing) ai computer music cs music tech [no comments] Share