Download presentation
Presentation is loading. Please wait.
1
SIP as infrastructure Henning Schulzrinne Dept. of Computer Science, Columbia University, New York hgs@cs.columbia.edu SIP 2007 (upperside.fr) Paris, France February 2007
2
2 Outline Scaling SIP to the real world: emergency calling Scaling SIP to very large deployments –some measurements for designing large servers –congestion control and dealing with avalanche restart –P2P SIP –failure discovery The state of SIP standardization, year 11 –developments in 2006 & upcoming highlights –trouble in standards land
3
February 20073 Roadmap Introduction Emergency calling Server scaling P2P SIP End-to-end management Standardization and interoperability
4
February 20074 Evolution of VoIP “amazing – the phone rings” “does it do call transfer?” “How can I make it stop ringing?” 1996-20002000-20032004-2005 catching up with the digital PBX long-distance calling, ca. 1930 going beyond the black phone 2006- “Can it really replace the phone system?” replacing the global phone system
5
February 20075 IETF VoIP efforts SIP (protocol) SIPPING (usage, requirements) ECRIT (emergency calling) AVT (RTP, SRTP, media) ENUM (E.164 translation) IPTEL (tel URL) SIMPLE (presence) GEOPRIV (geo + privacy) usesmay use uses provides usually used with IETF RAI area MMUSIC (SDP, RTSP, ICE) XCON (conf. control) SPEERMINT (peering) uses SPEECHSC (speech services) SIGTRAN (signaling transport) uses
6
February 20076 Roadmap Introduction Emergency calling Server scaling P2P SIP End-to-end management Standardization and interoperability
7
February 20077 VoIP emergency communications emergency call dispatch civic coordination emergency alert (“inverse 911”) Contact well- known number or identifier Route call to location- appropriate PSAP Deliver precise location to call taker to dispatch emergency help nowtransitionall IP 112 911 112 911 112, 911 urn:service:sos SR VPC LoST phone number location (ALI lookup) in-band key location in-band
8
February 20078 IETF ECRIT working group Emergency Contact Resolution with Internet Technologies Solve four major pieces of the puzzle: –location conveyance (with SIP & GEOPRIV) –emergency call identification –mapping geo and civic caller locations to PSAP URI –discovery of local and visited emergency dial string Not solving –location discovery --> GEOPRIV –inter-PSAP communication and coordination –citizen notification Current status: –finishing general and security requirements –agreement on mapping protocol (LoST) and identifier (sos URN) –working on overall architecture and UA requirements
9
February 20079 ECRIT: Options for location delivery GPS L2: LLDP-MED (standardized version of CDP + location data) –periodic per-port broadcast of configuration information –currently implementing CDP L3: DHCP for –geospatial (RFC 3825) –civic (RFC 4676) L7: proposals for retrievals: HELD, RELO, LCP, SIP, … –for own IP address or by third party (e.g., ISP to infrastructure provider) –by IP address –by MAC address –by identifier (conveyed by DHCP or PPP) –HELD, RELO: both HTTP-based
10
February 200710 ECRIT: Finding the correct PSAP Which PSAP should the e-call go to? –Usually to the PSAP that serves the geographic area –Sometimes to a backup PSAP –If no location, then ‘default’ PSAP –solved by LoST
11
February 200711 ECRIT: LoST Functionality Civic as well as geospatial queries –civic address validation Recursive and iterative resolution Fully distributed and hierarchical deployment –can be split by any geographic or civic boundary –same civic region can span multiple LoST servers Germany Bavaria Munich Neu Perlach 96 urn:service:sos.police Indicates errors in civic location data debugging –but provides best-effort resolution Can be used for non-emergency services: –directory and information services –pizza delivery services, towing companies, …
12
February 200712 LoST: Location-to-URL Mapping cluster serves VSP 2 NY US NJ US Bergen County NJ US 123 Broad Ave Leonia Bergen County NJ US cluster serving VSP 1 replicate root information search referral root nodes Leonia NJ US sip:psap@leonianj.gov VSP 1 LoST
13
February 200713 LoST Architecture T1 (.us) T2 (.de) T3 (.dk) G G G G G broadcast (gossip) T1:.us T2:.de resolver seeker 313 Westview Leonia, NJ US Leonia, NJ sip:psap@leonianj.gov tree guide
14
February 200714 Roadmap Introduction Emergency calling Server scaling P2P SIP End-to-end management Standardization and interoperability
15
February 200715 SIP server overload Proxies will return 503 --> retry elsewhere Just adds more load Retransmissions exacerbate the problem overloaded INVITE 503 Springsteen tickets!! earthquake vote for your favorite…
16
February 200716 Avalanche restart Large number of terminals all start at once Typically, after power outage Overwhelms registrar Possible loss of registrations due to retransmission time-out #300,000 #1 reboot after power outage REGISTER
17
February 200717 Overload control Current discussion in design team Feedback control: rate-based or window-based Avoid congestion collapse Deal with multiple upstream sources capacity goodput offered load
18
February 200718 Scaling servers & TCP Need TCP –TLS support: customer privacy, theft of service, … particularly for WiFi –many SIP messages now exceed reasonable UDP size (fragmentation) e.g., INVITE for IMS: 1182 bytes Concern: UA support –improving: 82% of systems at recent SIPit’19 had TCP support –only 45% support TLS Concern: TCP (and TLS) much less efficient than UDP –running series of tests to identify differences –difference mainly in connection setup cost message splitting (may need pre- parsing or incremental parsers) thread count (one per socket?) Our model: –300,000 customers/servers 0.1 Erlang, 180 sec/call –600,000 BHCA --> 167 req/sec –300,000 registrations --> 83 req/sec –$0.001/subscriber
19
February 200719 Performance evaluation results Pentium 4 server, 3 GHz –4 GB memory –Linux 2.6.16 echo server Kumiko Ono
20
February 200720 SIP server measurements Initial INVITE measurements –OpenSER –400 calls/sec for TCP –roughly 260 calls/sec for TLS sipd REGISTER test Kumiko Ono, Charles Shen, Erich Nahum TCP
21
February 200721 Roadmap Introduction Emergency calling Server scaling P2P SIP End-to-end management Standardization and interoperability
22
February 200722 LAN P2P SIP Why? –no infrastructure available: emergency coordination –don’t want to set up infrastructure: small companies –Skype envy :-) P2P technology for –user location only modest impact on expenses but makes signaling encryption cheap –NAT traversal matters for relaying –services (conferencing, …) how prevalent? New IETF working group just formed –likely, multiple DHTs –common control and look-up protocol? P2P provider A P2P provider B p2p network traditional provider DNS zeroconf generic DHT service
23
February 200723 P2P SIP -- components Multicast-DNS (zeroconf) SIP enhancements for LAN –announce UAs and their capabilities Client-P2P protocol –GET, PUT mappings –mapping: proxy or UA P2P protocol –get routing table, join, leave, … –independent of DHT? –replaces DNS for SIP, not proxy
24
February 200724 Roadmap Introduction Emergency calling Server scaling P2P SIP End-to-end management Standardization and interoperability
25
February 200725 VoIP user experience Only 95-99.5% call attempt success –“Keynote was able to complete VoIP calls 96.9% of the time, compared with 99.9% for calls made over the public network. Voice quality for VoIP calls on average was rated at 3.5 out of 5, compared with 3.9 for public-network calls and 3.6 for cellular phone calls. And the amount of delay the audio signals experienced was 295 milliseconds for VoIP calls, compared with 139 milliseconds for public- network calls.” (InformationWeek, July 11, 2005) Mid-call disruptions common Lots of knobs to turn –Separate problem: manual configuration
26
February 200726 Open issues: Configuration Ideally, should only need a user name and some credential –password, USB key, host identity (MAC address), … More than DHCP: device needs to get –SIP-level information (outbound proxy, timers) –policy information (“sorry, no video”) Multiple sources of configuration information –local network (hotel proxy) –voice service provider (off- network) Configuration information may change Needs to allow no-touch deployment of thousands of devices SIP configuration framework –has been languishing for years –currently being rewritten to reduce complexity
27
February 200727 Circle of blame OS VSP app vendor ISP must be a Windows registry problem re-install Windows probably packet loss in your Internet connection reboot your DSL modem must be your software upgrade probably a gateway fault choose us as provider
28
February 200728 Traditional network management model SNMP X “management from the center”
29
February 200729 Old assumptions, now wrong Single provider (enterprise, carrier) –has access to most path elements –professionally managed Problems are hard failures & elements operate correctly –element failures (“link dead”) –substantial packet loss Mostly L2 and L3 elements –switches, routers –rarely 802.11 APs Problems are specific to a protocol –“IP is not working” Indirect detection –MIB variable vs. actual protocol performance End systems don’t need management –DMI & SNMP never succeeded –each application does its own updates
30
February 200730 Management element inspection configuration fault location network understanding we’ve only succeeded here what causes the most trouble?
31
February 200731 Managing the protocol stack RTP UDP/TCP IP SIP no route packet loss TCP neg. failure NAT time-out firewall policy protocol problem playout errors media echo gain problems VAD action protocol problem authorization asymmetric conn (NAT)
32
February 200732 Proposal: “Do You See What I See?” Each node has a set of active and passive measurement tools Use intercept (NDIS, pcap) –to detect problems automatically e.g., no response to HTTP or DNS request –gather performance statistics (packet jitter) –capture RTCP and similar measurement packets Nodes can ask others for their view –possibly also dedicated “weather stations” Iterative process, leading to: –user indication of cause of failure –in some cases, work-around (application-layer routing) TURN server, use remote DNS servers Nodes collect statistical information on failures and their likely causes
33
February 200733 Management architecture “not working” (notification) inspect protocol requests (DNS, HTTP, RTCP, …) “DNS failure for 15m” orchestrate tests contact others ping 127.0.0.1 can buddy reach our resolver? notify admin (email, IM, SIP events, …) request diagnostics
34
February 200734 Roadmap Introduction Emergency calling Server scaling P2P SIP End-to-end management Standardization and interoperability
35
February 200735 SIP, SIPPING & SIMPLE –00 drafts includes draft-ietf-*-00 and draft-personal-*-00
36
February 200736 RFC publication
37
February 200737 IETF WG: SIP in 2006 & 2007 ~ 44 SIP-related RFCs published in 2006 –BFCP, conferencing –SDP revision –rich presence Activities: –hitchhiker’s guide –infrastructure: GRUUs (random identifiers) URI lists XCAP configuration SIP MIB –services: rejecting anonymous requests consent framework location conveyance session policy –security: end-to-middle security certificates SAML sips clarification –NAT: connection re-use SIP outbound ICE (in MMUSIC) see http://tools.ietf.org/wg/sip’/
38
February 200738 IETF WG: SIPPING 31 RFCs published in 2006 Policy –media policy –SBC functions Services –service examples –call transfer –configuration framework –spam and spit –text-over-IP –transcoding Testing and operations –IPv6 transition –race condition examples –IPv6 torture tests –SIP offer-answer examples –overload requirements –configuration –voice quality reporting
39
February 200739 Interoperability Generally no interoperability problems for basic SIP functionality –basic call, digest registration, call transfer, voice mail Weaker in advanced scenarios and backward compatibility –handling TCP, TLS –NAT support (symmetric RTP, ICE, STUN,...) –multipart bodies –SIP torture tests –call transfer, call pick-up –video and voice codec interoperability (H.264, anything beyond G.711) SIPit useful, but no equivalent of WiFi certification –most implementations still single-vendor (enterprise, carrier) or vendor- supplied (VSP) –SFTF (test framework) still limited Need profiles to guide implementers
40
February 200740 Trouble in Standards Land Proliferation of transition standards: 2.5G, 2.6G, 3.5G, … –true even for emergency calling… Splintering of standardization efforts across SDOs –primary: IEEE, IETF, W3C, OASIS, ISO –architectural: PacketCable, ETSI, 3GPP, 3GPP2, OMA, UMA, ATIS, … –specialized: NENA –operational, marketing: SIP Forum, IPCC, … OASIS IEEE IETF W3C ISO (MPEG) L2.5-L7 protocols data exchange data formats L1-L2 3GPP PacketCable
41
February 200741 IETF issues SIP WGs: small number (dozen?) of core authors (80/20) –some now becoming managers… –or moving to other topics IETF: research engineering maintenance –many groups are essentially maintaining standards written a decade (or two) ago DNS, IPv4, IPv6, BGP, DHCP; RTP, SIP, RTSP constrained by design choices made long ago –often dealing with transition to hostile & “random” network –network ossification Stale IETF leadership –often from core equipment vendors, not software vendors or carriers fair amount of not-invented-here syndrome late to recognize wide usage of XML and web standards late to deal with NATs security tends to be per-protocol (silo) –some efforts such as SAML and SASL tendency to re-invent the wheel in each group
42
February 200742 IETF issue: timeliness Most drafts spend lots of time in 90%- complete state –lack of energy (moved on to new -00) –optimizers vs. satisfiers multiple choices that have non- commensurate trade-offs Notorious examples: –SIP request history: Feb. 2002 – May 2005 (RFC 4244) –Session timers: Feb. 1999 – May 2005 (RFC 4028) –Resource priority: Feb. 2001 – Feb 2006 (RFC 4412) New framework/requirements phase adds 1-2 years of delay Three bursts of activity/year, with silence in-between –occasional interim meetings IETF meetings are often not productive –most topics gets 5-10 minutes lack context, focus on minutiae –no background same people as on mailing list –5 people discuss, 195 people read email No formal issue tracking –some WGs use tools, haphazardly Gets worse over time: –dependencies increase, sometimes undiscovered –backwards compatibility issues –more background needed to contribute
43
February 200743 IETF issues: timeliness WG chairs run meetings, but are not managing WG progress –very little control of deadlines e.g., all SIMPLE deadlines are probably a year behind –little push to come to working group last call (WGLC) –limited timeliness accountability of authors and editors –chairs often provide limited editorial feedback IESG review can get stuck in long feedback loop –author – AD – WG chairs –sometimes lack of accountability (AD-authored documents) RFC editor often takes 6+ months to process document –dependencies; IANA; editor queue; author delays –e.g., session timer: Aug. 2004 – May 2005
44
February 200744 Conclusion Moving from lab and trials to large-scale deployments Planning horizon includes turning off circuit-switched phones –in large enterprises –in some carriers From emphasis on features to global scale: –interoperation –configuration –peer-to-peer systems –emergency services –overload behavior –failure detection across networks and protocol layers Integration of advanced features (IM, presence, video, programmable services) still lacking Current standardization processes slow and complexity-inducing
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.