• Non ci sono risultati.

IBM  Watson  Technologies

N/A
N/A
Protected

Academic year: 2021

Condividi "IBM  Watson  Technologies"

Copied!
28
0
0

Testo completo

(1)

IBM  Watson  Technologies  

A  workshop  over  6  days  and  9  hours   by  Leo  Liber?  and  Olivier  Wang  

3-­‐4,  10-­‐11,  17-­‐18  Feb  2015   PC17  Ecole  Polytechnique  

(2)

Aims  

Walk  out  of  here  knowing:  

that  this  thing  exists  

•  what  you  can  expect  from  it  

•  more  or  less  how  you  can  use  it  

how  to  find  more  informa?on  about  it  

•  something  about  Natural  Language  Processing  

 

(3)

Organiza?on  of  the  workshop  

•  None  

•  We  don't  know  much  about  it  

– (Olivier  knows  more  than  me)  

•  Most  IBM  people  don't  know  too  much  either  

•  I  also  hope  to  learn  something  

•  Interrupt  whenever  you  wish  

– French  or  English  understood/spoken  

(4)

The  name  

•  Source  of  endless  confusion  within  IBM  

•  TJ  Watson  is  the  name  of  the  first  two  CEOs  

•  IBM  names  most  things  a]er  them  

•  I  worked  2  years  at  the  IBM  TJ  Watson   Research  Labs  (NY,  USA)  

•  Now  they  have  a  Watson  Division  in  NYC  

•  "Watson"  was  born  at  the  TJ  Watson  labs    

(5)

What  is  it?  

•  Difficult  to  answer  

•  Best  answer:  "it's  an  ecosystem"  

•  Whenever  some  Watson  researcher/

engineer  codes  some  new  cool  program,  it   becomes  part  of  that  "ecosystem"  

•  It  is  an  aeempt  to  have  computers  deal   with  natural  language  

(6)

Languages  for  computers  

•  A  programming  language  is  a  formal  language  

•  Formal  languages  have  unambiguous   grammars  

•  Sentences  can  be  automa?cally  verified  to  be   part  of  the  language  or  not  ("syntac?cally  

valid")  

•  Interpreta?ons  ("seman?cs")  of  formal   languages  are  also  unambiguous  

 

(7)

Languages  for  people  

•  The  language  we  use  is  a  natural  language  

syntac?cally  ambiguous   seman?cally  ambiguous  

mul?-­‐media:  communica?on  occurs  by  

•  wri?ng,  speaking  (ueering)  

•  looking,  touching,  moving  

•  even  not  saying  something  some?mes  has  meaning  

•  Hence  the  misunderstandings  

•  Link  between  language  and  emo?ons  

(8)

Natural  Language  Processing  

•  Why  can't  my  mum  work  a  computer?  

She'd  try  and  talk  to  it  using  her  natural  language   The  computer  wouldn't  even  turn  itself  on  

•  Challenge:  can  we  program  a  computer  to  

"understand"  natural  language?  

•  Algorithms  for  "understanding"  natural   language:  NLP  

(9)

Understanding  

•  Can  we  even  define  "understanding"?  

– Func6onal  defini6on:  the  interconnec:ons  of  all   neurons  of  the  brain,  with  their  set  of  weights  and   thresholds  

– Not  descrip:ve  –  even  if  we  have  the  right  neuron   graph,  weights  and  thresholds  

•  As  with  all  philosophical  ques?ons,  a  complete   and  fully  sa?sfactory  answer  is  impossible  

•  Aim  at  par?al  answers  

(10)

My  own  defini?on  

You  need  to  state  your  goal  and   narrow  the  defini?on  to  that  goal  

For  example:  

•  recognize  named  en66es  from  text  (Delta:COMPANY,   JFK:AIRPORT,  FlightStats:WEBSITE)  

•  extract  rela6ons  from  text  (subj  IN  obj,  subj  DOES  obj,  ...)  

•  find  the  correct  parsing  tree  (we'll  discuss  this  later)  

•  for  a  text  about  x,  decide  whether  it  states  a  posi?ve  or   nega?ve  opinion  (sen6ment  analysis)  

•  supply  correct  answers  to  natural  language  ques?ons  similar   to  a  list  of  ques?ons  on  a  list  (ques6ons  &  answers)  

•  win  a  Jeopardy!  game  

(11)

A  long-­‐term  challenge  

•  A  conversa?ons  between  people:  

may  have  one  or  many  goals   they  may  change  dynamically  

it  may  not  have  any  goal  whatsoever  

•  Watson  cannot  do  this  (yet)  

•  Computers  s?ll  need  to  be  supplied  goals  by   humans  

•  Define  "understanding"  according  to  goal  

(12)

Texts  have  purposes  

•  Text  could  be:  

a  body  of  laws   company  policies   a  novel  

– ...  

•  Watson  aims  to  be  a  commercial  product  

•  Don't  expect  it  to  be  good  at  parsing  The  Lord   of  the  Rings  (speaking  by  experience  here...)  

(13)

How  IBM  sells  this  

•  Domain  focus  

healthcare  (diagnos?cs)   tourism  (travel  advice)  

finance  (investment  advice)  

•  General  tools  

– engagement  advisor   discovery  advisor  

data  explorer  

•  Interna?onaliza?on:    

– Chinese,  Spanish,  some  French  

(14)

What's  in  Watson  for  you?  

•  If  you  go  work  for  IBM  Watson:  

– research  in  cogni?ve  compu?ng  

•  broader  defini?ons  of  "understanding"  

•  automated  goal  decisions  

development  

•  so]ware  using  Watson  services  

sales/commercial  

•  new  broad  applica?on  fields  

•  If  you  go  work  elsewhere:  

many  firms  want  Watson  NOW  

few  know  what  it  is  or  how  to  use  it  

•  make  yourself  indispensable  

(15)

Demos  (some  possible  goals)  

•  Concept  expansion  [watson-­‐ce-­‐demo.mybluemix.net]  

•  Language  iden?fica?on  [ibmlaser.mybluemix.net/liddemo.html]  

•  Machine  transla?on  [ibmlaser.mybluemix.net/mtdemo.html]  

•  Message  resonance  [watson-­‐mr-­‐demo.mybluemix.net]  

•  Ques?ons  and  answers  [watson-­‐qa-­‐demo.mybluemix.net]  

•  Rela?onship  extrac?on  [ibmlaser.mybluemix.net/siredemo.html]  

•  User  modelling  [watson-­‐um-­‐demo.mybluemix.net]  

(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)

Algorithmics  for  NLP  

•  Frequency  analysis  

words  in  sentences   – words  in  paragraphs  

– tuple  of  words  

•  Parsing    

– recursive  syntac?cal  analysis  

seman?cs  from  combinatorial  structure    

•  Machine  Learning  

– neural  networks   – classifiers  

efficient  op?miza?on  methods  

(25)

The  NLP  pipeline  

A  flowchart  describing  how  the  basic  algorithms  are  put  together  

(26)

How  to  use  Watson  Services  

•  Watson  Engagement  Advisor  

web  interface   Java  API  

•  BlueMix  

NodeRed   – Java  

...  

(27)

How  to  set  up  your  worksta?on  

•  Get  an  IBM  ID    

–  www.ibm.com,  "Register"  

•  Get  a  month-­‐long  bluemix  trial    

–  bluemix.net,  "Sign  Up"  

•  Download  and  install  Python  2.7.9  

–  heps://www.python.org/downloads/  

•  Download  and  install  NLTK  3.0  

–  hep://www.nltk.org/  

•  Download  and  install  paeern/CLiPS  

–  hep://www.clips.ua.ac.be/paeern  

(28)

Announcement  

•  Workshop  with  IBM,  HEC  and  Ecole42  

– 21/22  march  2015   – compe??on  +  prizes  

– good  showcase  for  your  coding  abili?es  

Riferimenti

Documenti correlati

Rimborso delle spese sostenute per servizi di trasporto pubblico locale, regionale e interregionale:. ◼

 Evoluzione delle tecnologie dell’informazione e della comunicazione, che richiedono la capacità di comprendere la reale importanza del ruolo della tecnologia all’interno

IBM si riserva il diritto di modificare eventualmente le presenti Condizioni di Utilizzo e di rendere conto di qualsiasi Aggiornamento che IBM può fornire o rendere disponibile per

La recente classifica Green500 si propone come alternativa alla più nota Top500, nel senso che ordina i principali 500 supercomputer al mondo non in base alla velocità di

Ciò mette a disposizione un singolo file system unificato in tutta l'organizzazione, eliminando i silos dati, semplificando la gestione dello storage e garantendo prestazioni

È necessario ottenere titolarità sufficienti per ogni Istanza dell'offerta IBM SaaS resa disponibile per l'accesso e l'utilizzo durante il periodo di misurazione specificato nella

Se il Cliente ha sottoscritto l'abbonamento ai servizi IBM Deal Management, IBM Deal Management: Bill Distribution Module, IBM Shopper Insights, o IBM Assortment Optimization,

Il servizio Premium Foundation Data Import fornisce un (1) feed di importazione di Dati di Base da un (1) sistema HRIS in BrassRing, che consente il caricamento dei dati di base